TY - GEN
T1 - Exploring the value of different data sources for predicting student performance in multiple CS courses
AU - Liao, Soohyun Nam
AU - Zingaro, Daniel
AU - Alvarado, Christine
AU - Griswold, William G.
AU - Porter, Leo
N1 - Publisher Copyright:
© 2019 Association for Computing Machinery.
PY - 2019/2/22
Y1 - 2019/2/22
N2 - A number of recent studies in computer science education have explored the value of various data sources for early prediction of students' overall course performance. These data sources include responses to clicker questions, prerequisite knowledge, instrumented student IDEs, quizzes, and assignments. However, these data sources are often examined in isolation or in a single course. Which data sources are most valuable, and does course context matter? To answer these questions, this study collected student grades on prerequisite courses, Peer Instruction clicker responses, online quizzes, and assignments, from five courses (over 1000 students) across the CS curriculum at two institutions. A trend emerges suggesting that for upper-division courses, prerequisite grades are most predictive; for introductory programming courses, where no prerequisite grades were available, clicker responses were the most predictive. In concert, prerequisites and clicker responses generally provide highly accurate predictions early in the term, with assignments and online quizzes sometimes providing incremental improvements. Implications of these results for both researchers and practitioners are discussed.
AB - A number of recent studies in computer science education have explored the value of various data sources for early prediction of students' overall course performance. These data sources include responses to clicker questions, prerequisite knowledge, instrumented student IDEs, quizzes, and assignments. However, these data sources are often examined in isolation or in a single course. Which data sources are most valuable, and does course context matter? To answer these questions, this study collected student grades on prerequisite courses, Peer Instruction clicker responses, online quizzes, and assignments, from five courses (over 1000 students) across the CS curriculum at two institutions. A trend emerges suggesting that for upper-division courses, prerequisite grades are most predictive; for introductory programming courses, where no prerequisite grades were available, clicker responses were the most predictive. In concert, prerequisites and clicker responses generally provide highly accurate predictions early in the term, with assignments and online quizzes sometimes providing incremental improvements. Implications of these results for both researchers and practitioners are discussed.
KW - Architecture
KW - CS1
KW - CS2
KW - Data Structures
KW - Low-performing students
KW - Machine learning
KW - Prediction
KW - Student outcomes
UR - http://www.scopus.com/inward/record.url?scp=85064391128&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85064391128&partnerID=8YFLogxK
U2 - 10.1145/3287324.3287407
DO - 10.1145/3287324.3287407
M3 - Conference contribution
AN - SCOPUS:85064391128
T3 - SIGCSE 2019 - Proceedings of the 50th ACM Technical Symposium on Computer Science Education
SP - 112
EP - 118
BT - SIGCSE 2019 - Proceedings of the 50th ACM Technical Symposium on Computer Science Education
PB - Association for Computing Machinery, Inc
T2 - 50th ACM Technical Symposium on Computer Science Education, SIGCSE 2019
Y2 - 27 February 2019 through 2 March 2019
ER -