Data Mining for Academic Success

Purdue’s academic analytics correlate data from the course management and student information systems, to create predictive models that can support student retention strategies.

StatsIN A PROJECT begun in 2005, researchers at Purdue University (IN) are developing models to predict academic success: academic analytics that will eventually be used to create interventions for at-risk students. Their first step was to identify data that could be mined from the course management system (CMS) and from the student information system (SIS), and demonstrate which factors are most significant.

Researchers studied an initial sample of about 1,500 students during the Fall ’05 semester, and quickly expanded their work to reflect the entire range of WebCT supported classes at Purdue in Spring ’06. Analyses now include data on some 130,000 seats in the CMS (individual students may be counted more than once if they take more than one course), representing more than 30,000 students.

Exploring the Factors

Project lead John Campbell, Purdue’s associate VP for Teaching and Learning Technologies, explains how the study looks at the factors influencing academic success: “Academic success is really based on two different components: aptitude and effort. You can be the smartest person in the world, but if you don’t put in any effort, you’re not going to be successful. And people with less aptitude, who put a lot of effort into it, can be very successful.” So the researchers are rigorously examining indicators of aptitude and effort, by mining historical data such as SAT scores and GPA from the SIS (reflecting aptitude), and data on student use of the CMS from the Oracle back-end database connected to their WebCT system (reflecting effort).

The example in the graph above is a representative sample of 600 students across a range of classes and departments at Purdue. The chart shows the number of WebCT logins (where the fourth quartile is high and relative to the given class), the SAT scores (where the fourth quartile is high and relative to student SAT records for the given class), and the earned grade for the course (where A=4.0). This analyis demonstrates that the number of WebCT logins tends to impact the final grade—more dramatically in the case of students with a history of lower SAT scores and fewer WebCT logins.

StatsPredicting Is in the Future

Ultimately, the end goals are to develop intelligent agents that will automatically take actions (such as alerting the instructor that a student is likely in trouble, or notifying the student about help sessions that are available), and to provide trend data to administrators with an interest in retention. Campbell explains: “We have a lot of retention initiatives; the biggest challenge is getting the right people to the right initiative.” He points out that early intervention can be critical to success—and interventions may be more timely when triggered by academic analytics.

Editor’s Note: John Campbell and a team from Purdue will present their work on academic analytics at Campus Technology 2006 in Boston. For more information, go to www.campus-technology.com/conf.

Featured

  • Abstract AI symbols, including neural network diagrams and circuit-like patterns, blending seamlessly with a blue sky filled with soft white clouds

    UC Berkeley Announces Sky-T1-32B Open Source AI Model, Offering High Performance at a Fraction of the Cost

    UC Berkeley researchers have introduced Sky-T1-32B, an open source, reasoning-focused language model that delivers high performance at an unprecedented cost of under $450 for training.

  • glowing digital brain made of blue circuitry hovers above multiple stylized clouds of interconnected network nodes against a dark, futuristic background

    Report: 85% of Organizations Are Using Some Form of AI

    Eighty-five percent of organizations today are leveraging some form of AI, according to the latest State of AI in the Cloud 2025 report from Wiz. While AI's role in innovation and disruption continues to expand, security vulnerabilities and governance challenges remain pressing concerns.

  • AI robot with cybersecurity symbol on its chest

    Microsoft Adds New Agentic AI Tools to Security Copilot

    Microsoft has announced a major expansion of its AI-powered cybersecurity platform, introducing a suite of autonomous agents to help organizations counter rising threats and manage the growing complexity of cloud and AI security.

  • The AI Show

    Register for Free to Attend the World's Greatest Show for All Things AI in EDU

    The AI Show @ ASU+GSV, held April 5–7, 2025, at the San Diego Convention Center, is a free event designed to help educators, students, and parents navigate AI's role in education. Featuring hands-on workshops, AI-powered networking, live demos from 125+ EdTech exhibitors, and keynote speakers like Colin Kaepernick and Stevie Van Zandt, the event offers practical insights into AI-driven teaching, learning, and career opportunities. Attendees will gain actionable strategies to integrate AI into classrooms while exploring innovations that promote equity, accessibility, and student success.