TY - JOUR
T1 - Inferring student comprehension from highlighting patterns in digital textbooks
T2 - 2nd International Workshop on Intelligent Textbooks, iTextbooks 2020
AU - Kim, David Y.J.
AU - Winchell, Adam
AU - Waters, Andrew E.
AU - Grimaldi, Phillip J.
AU - Baraniuk, Richard
AU - Mozer, Michael C.
N1 - Funding Information:
This research is supported by NSF awards DRL-1631428 and DRL-1631556. We thank Christian Plagemann and three anonymous reviewers for their helpful feedback on earlier drafts of this manuscript.
Publisher Copyright:
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2020
Y1 - 2020
N2 - We investigate whether student comprehension and knowledge retention can be predicted from textbook annotations, specifically the material that students choose to highlight. Using a digital open-access textbook platform, Openstax, students enrolled in Biology, Physics, and Sociology courses read sections of their introductory text as part of required coursework, optionally highlighted the text to flag key material, and then took brief quizzes as the end of each section. We find that when students choose to highlight, the specific pattern of highlights can explain about 13% of the variance in observed quiz scores. We explore many different representations of the pattern of highlights and discover that a low-dimensional logistic principal component based vector is most effective as input to a ridge regression model. Considering the many sources of uncontrolled variability affecting student performance, we are encouraged by the strong signal that highlights provide as to a student’s knowledge state.
AB - We investigate whether student comprehension and knowledge retention can be predicted from textbook annotations, specifically the material that students choose to highlight. Using a digital open-access textbook platform, Openstax, students enrolled in Biology, Physics, and Sociology courses read sections of their introductory text as part of required coursework, optionally highlighted the text to flag key material, and then took brief quizzes as the end of each section. We find that when students choose to highlight, the specific pattern of highlights can explain about 13% of the variance in observed quiz scores. We explore many different representations of the pattern of highlights and discover that a low-dimensional logistic principal component based vector is most effective as input to a ridge regression model. Considering the many sources of uncontrolled variability affecting student performance, we are encouraged by the strong signal that highlights provide as to a student’s knowledge state.
KW - Knowledge retention
KW - Student modeling
KW - Textbook annotation
UR - http://www.scopus.com/inward/record.url?scp=85093839130&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093839130&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85093839130
SN - 1613-0073
VL - 2674
SP - 67
EP - 79
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
Y2 - 6 July 2020
ER -