91直播

School & District Management

鈥楧ata Mining鈥 Gains Traction in Education

By Sarah D. Sparks 鈥 December 13, 2010 7 min read
  • Save to favorites
  • Print
Email Copy URL

The new and rapidly growing field of educational data mining is using the chaff from data collected through normal school activities to explore learning in more detail than ever before, and researchers say the day when educators can make use of Amazon-like feedback on student learning behaviors may be closer than most people think.

Educational data mining uses some of the typical data included in state longitudinal databases, such as test scores and attendance, but researchers often spend more time analyzing the detritus cast off during normal classroom data-collection practices, such as student interactions in a chat log or the length of responses to homework assignments鈥攊nformation that researchers call 鈥渄ata exhaust.鈥

Analysis of massive databases isn鈥檛 new to fields like finance and physics, but it has started to gain traction in education only recently, with the first on the subject held in 2008 and the first launched a year later. Experts say such data mining allows faster and more fine-grained answers to education questions and ultimately might change the way students are tested and taught.

鈥淒ata resources you wouldn鈥檛 necessarily think would be useful can turn out to be very powerful for making inferences,鈥 said Ryan S. J. d. Baker, an assistant professor of psychology and learning sciences at Worcester Polytechnic Institute in Massachusetts. For example, research from the Pittsburgh-based Carnegie Mellon University found small changes in the length of time a student took to answer individual test questions signaled the student was struggling, cheating, or had given up in favor of filling in answers randomly.

鈥淚 can easily imagine just a little bit of classroom observation data could do a lot to contextualize the other information about student achievement鈥 in state accountability databases, Mr. Baker said.

Expanding Data Universe

In centers like the Pittsburgh Science of Learning Center鈥檚 DataShop, researchers can use advanced computers to analyze 238 data sets of online and classroom data, comprising 49 million individual student actions.

鈥淵ou might be collecting thousands of data points for a single student鈥攊n some areas virtually millions鈥攚hereas the traditional qualitative methods in education psychology might have dozens or even a hundred measures,鈥 said Arthur C. Graesser, a psychology professor at the University of Memphis and editor of the Journal of Educational Psychology.

鈥淭hat changes the quantitative methods enormously,鈥 Mr. Graesser said. 鈥淣ot only can you look at unique learning trajectories of individuals, but the sophistication of the models of learning goes up enormously.鈥

This data hasn鈥檛 been studied in such depth before because it鈥檚 only possible to find significant results when researchers can study a huge amount of data points. For example, Mr. Baker a topic that has frustrated teachers for generations: students who try to get through a task without actually learning the material.

鈥淪tudents spend on average 3 percent of the time gaming the system, maybe 15 [percent of students] will do it at least once,鈥 Mr. Baker said. With only a few dozen students, it鈥檚 almost impossible to tell exactly when and how it happens, he explained, 鈥渂ut when you have data from thousands of students, you can.鈥

Studying hundreds of thousands of data points on students working through an online tutoring program, Mr. Baker created a model to allow the program to recognize when a student was attempting to complete a task without mastering the material, and then present the missed material again in a new way.

Research that draws on educational data mining may also compress the lag time between undertaking a study and getting usable results, addressing a common critique from educators.

鈥淚 think this is escalating the speed of research on many problems in education,鈥 Mr. Graesser said. 鈥淚n the past, somebody runs an efficacy study where they spend five years trying to study a sample that may include more than one classroom, and it takes a lot of time and a lot of money, whereas [an] EDM [educational data mining] study provides a far richer set of data on students in a matter of weeks or months. It鈥檚 a whole different style.鈥

Imitating Amazon

For practicing educators, the question educational data mining raises is: Does this mean researchers could create tools for teachers that collect information in the same way that Amazon.com, the online retailer, collects information on customers鈥 buying habits? Could systems be developed that can track whether a student is excited about some topics but not others, struggling with decimals but not long division, and suggest interventions accordingly?

鈥淥h yeah, no problem! We have done that already,鈥 said Greg Chung, the co-principal investigator of the Center for Advanced Technology and Schools at the University of California at Los Angeles. In the early 2000s, his team developed a program for the U.S. Marines that tested which soldiers were likely to have trouble with different aspects of marksmanship based on their understanding of trigger-control and then automatically assigned soldiers study materials. By the end of one week on the program, the participating Marines developed better marksmanship skills. Dr. Bror Saxberg, chief learning officer at Kaplan, Inc., said at a Dec. 7 discussion at the Washington D.C.-based think tank Education Sector that his firm is piloting similar rapid-feedback systems.

In fact, Mr. Chung and other researchers said, the technology and research can be developed faster than it takes to teach practitioners how to use it.

鈥淎ctually trying to do this in the classroom, it鈥檚 like, ugh,鈥 Mr. Chung said. He recalled giving teachers electronic clickers that would allow every student in a class to answer a question鈥攁s opposed to only two or three in a classroom鈥攁nd would allow the teacher to analyze their responses. But the sudden flurry of responses鈥攁nd their range鈥攓uickly overwhelmed the teachers. 鈥淭he teachers said, 鈥榊eah, this is interesting, this is cool, and we learned a lot about our students, but what do you do in a class with so many different levels?鈥 鈥 Mr. Chung said. 鈥淭hey couldn鈥檛 address every kid.鈥

As data systems and the tools to analyze them become more ubiquitous, experts say we will need more research into how much and what kind of data are most helpful to teachers trying to improve their classroom instruction. Mr. Baker envisions within a generation preservice teachers will study data analysis as a matter of course, and researchers will develop easier-to-use tools to help them compare their own students鈥 behavior and performance to models based on hundreds of thousands of similar students.

Several states, including Louisiana and New York, are already experimenting with data tools that allow teachers and principals to track daily attendance, behavior and academic performance of each student.

In fact, a 2009 study by a team of researchers from Carnegie Mellon and Worchester Polytechnic found in the process of creating an online tutoring program that its underlying data model for tracking student progress could predict students鈥 year-end academic performance better than scores on the state鈥檚 standardized test.

鈥淚f we could show that a student鈥檚 work over time was a better predictor of student success than these state exams that everyone complains about anyway, wouldn鈥檛 that help us get a lot farther along?鈥 said John C. Stamper, a systems scientist in the Carnegie Mellon Human-Computer Interaction Institute and technical director of the DataShop.

Moving Forward

Educational data mining is catching federal attention, too. The National Science Foundation this month opened a new $30 million grant for studying cyberlearning that is intended in part to expand computer-based educational data mining projects, said Joan Ferrini-Mundy, the acting assistant director for NSF鈥檚 Directorate for Education and Human Resources. 鈥淚t鈥檚 fascinating and potentially very productive,鈥 she said.

Likewise, Aneesh P. Chopra, the nation鈥檚 first federal chief technology officer, argued at the EdSector panel that new types of data and analysis will allow researchers to use more than 鈥渟tatic鈥 standardized test scores to identify best practices.

鈥淗aving a debate about whether that single data point moves here or here or here sounds like a silly conversation in the face of millions of data points,鈥 Mr. Chopra said. 鈥淲e need to understand at far more granulated levels of performance what works and what doesn鈥檛.鈥

Related Tags:

A version of this article appeared in the January 12, 2011 edition of 91直播 as 鈥楧ata Mining鈥 Gains Traction in Education

Events

This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 91直播's editorial staff.
Sponsor
Student Well-Being Webinar
Improve School Culture and Engage Students: Archery鈥檚 Critical Role in Education
Changing lives one arrow at a time. Find out why administrators and principals are raving about archery in their schools.
Content provided by 
School Climate & Safety Webinar Engaging Every Student: How to Address Absenteeism and Build Belonging
Gain valuable insights and practical solutions to address absenteeism and build a more welcoming and supportive school environment.
Student Well-Being K-12 Essentials Forum Social-Emotional Learning 2025: Examining Priorities and Practices
Join this free virtual event to learn about SEL strategies, skills, and to hear from experts on the use and expansion of SEL programs.

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide 鈥 elementary, middle, high school and more.
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.

Read Next

School & District Management What Latino Superintendents Say It Will Take to Grow Their Ranks
Three Latino superintendents talked about the direct and indirect paths to building a pipeline of future district leaders of color.
4 min read
Vector image of many professionals, diversity, highlighting hispanic.
Liz Yap/91直播 and iStock/Getty
School & District Management Opinion Your School Needs a Teacher-Mentorship Program
We all know how critical the first few years of teaching are. Here's how to set teachers up for success.
Pamela Slifer
4 min read
Mentorship development of young teachers. School leaders make the teaching profession more sustainable by developing a robust mentoring program in their school.
Vanessa Solis/91直播 via Canva
School & District Management School Leaders Rush to Manage Deportation Fears
School and district leaders describe a chaotic time amid changes to federal immigration policies.
9 min read
A line of school children with obscured faces board a school bus on their way to school.
E+/Getty
School & District Management Quiz Quiz Yourself: How Much Do You Know About The Superintendent Persona?
The superintendent plays a crucial role in purchasing decisions. Test your knowledge of this key buyer persona and see how your results stack up with your peers.