When Is Deep Learning the Best Approach to Knowledge Tracing?

Intelligent tutoring systems (ITSs) teach skills using learning-by-doing principles and provide learners with individualized feedback and materials adapted to their level of understanding. Given a learner's history of past interactions with an ITS, a learner performance model estimates the curr...

Full description

Saved in:
Bibliographic Details
Published inJournal of educational data mining Vol. 12; no. 3; pp. 31 - 54
Main Authors Gervet, Theophile, Koedinger, Ken, Schneider, Jeff, Mitchell, Tom
Format Journal Article
LanguageEnglish
Published International Educational Data Mining 27.10.2020
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Intelligent tutoring systems (ITSs) teach skills using learning-by-doing principles and provide learners with individualized feedback and materials adapted to their level of understanding. Given a learner's history of past interactions with an ITS, a learner performance model estimates the current state of a learner's knowledge and predicts her future performance. The advent of increasingly large scale datasets has turned deep learning models for learner performance prediction into competitive alternatives to classical Markov process and logistic regression models. In an extensive empirical comparison on nine real-world datasets, we ask which approach makes the most accurate predictions and in what conditions. Logistic regression--with the right set of features--leads on datasets of moderate size or containing or containing a very large number of interactions per student, whereas Deep Knowledge Tracing leads on datasets of large size or where precise temporal information matters most. Markov process methods, like Bayesian Knowledge Tracing, lag behind other approaches. We follow this analysis with ablation studies to determine what components of leading algorithms explain their performance and a discussion of model calibration (reliability), which is crucial for downstream applications of learner performance prediction models.
AbstractList Intelligent tutoring systems (ITSs) teach skills using learning-by-doing principles and provide learners with individualized feedback and materials adapted to their level of understanding. Given a learner's history of past interactions with an ITS, a learner performance model estimates the current state of a learner's knowledge and predicts her future performance. The advent of increasingly large scale datasets has turned deep learning models for learner performance prediction into competitive alternatives to classical Markov process and logistic regression models. In an extensive empirical comparison on nine real-world datasets, we ask which approach makes the most accurate predictions and in what conditions. Logistic regression--with the right set of features--leads on datasets of moderate size or containing or containing a very large number of interactions per student, whereas Deep Knowledge Tracing leads on datasets of large size or where precise temporal information matters most. Markov process methods, like Bayesian Knowledge Tracing, lag behind other approaches. We follow this analysis with ablation studies to determine what components of leading algorithms explain their performance and a discussion of model calibration (reliability), which is crucial for downstream applications of learner performance prediction models.
Author Gervet, Theophile
Schneider, Jeff
Koedinger, Ken
Mitchell, Tom
Author_xml – sequence: 1
  fullname: Gervet, Theophile
– sequence: 2
  fullname: Koedinger, Ken
– sequence: 3
  fullname: Schneider, Jeff
– sequence: 4
  fullname: Mitchell, Tom
BackLink http://eric.ed.gov/ERICWebPortal/detail?accno=EJ1273917$$DView record in ERIC
BookMark eNpNjEFLwzAYhoNMcM5dvQn5A5350qRJDiJzTp0WvBQ8jjT5slZmWtKC6K-3oAffy_McHt5zMotdREIuga0k13D9jbHz3UqAyAsQJ2TOQaqMA2Ozf35GlsPwzqZJoYzSc3Lz1mCku4HeI_a0RJtiGw90bJDe4TDSdd-nzrqGjh19id3nEf0BaZWsm7LbC3Ia7HHA5R8XpHrYVpunrHx93G3WZYaGj1mBvOaMmdpbGSQEh8qJoJXm6FB7AHCgna-FtT7UXhuup1x6USBaZUK-IFe_t5hat-9T-2HT1377DFzlBlT-A2rNSmc
ContentType Journal Article
DBID ERI
GA5
DOI 10.5281/zenodo.4143614
DatabaseName ERIC
ERIC - Full Text Only (Discovery)
DatabaseTitle ERIC
DatabaseTitleList ERIC
Database_xml – sequence: 1
  dbid: ERI
  name: ERIC
  url: https://eric.ed.gov/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Education
EISSN 2157-2100
ERIC EJ1273917
ExternalDocumentID EJ1273917
GroupedDBID AAHSB
ABOPQ
ALMA_UNASSIGNED_HOLDINGS
ERI
FRS
GA5
OK1
ID FETCH-LOGICAL-e92t-6e2b2009bda5f51fce7c4f8782ece8d111c18cdb4aadfbd89282005d46eea79f3
IEDL.DBID ERI
ISSN 2157-2100
IngestDate Tue Sep 02 19:32:48 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 3
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-e92t-6e2b2009bda5f51fce7c4f8782ece8d111c18cdb4aadfbd89282005d46eea79f3
ORCID 0000-0002-5080-9073
0000-0001-8851-533X
0000-0001-7373-0301
OpenAccessLink http://eric.ed.gov/ERICWebPortal/detail?accno=EJ1273917
PageCount 24
ParticipantIDs eric_primary_EJ1273917
PublicationCentury 2000
PublicationDate 2020-10-27
PublicationDateYYYYMMDD 2020-10-27
PublicationDate_xml – month: 10
  year: 2020
  text: 2020-10-27
  day: 27
PublicationDecade 2020
PublicationTitle Journal of educational data mining
PublicationYear 2020
Publisher International Educational Data Mining
Publisher_xml – name: International Educational Data Mining
SSID ssj0000547978
Score 2.1315365
Snippet Intelligent tutoring systems (ITSs) teach skills using learning-by-doing principles and provide learners with individualized feedback and materials adapted to...
SourceID eric
SourceType Open Access Repository
StartPage 31
SubjectTerms Academic Achievement
Bayesian Statistics
Comparative Analysis
Data Analysis
Feedback (Response)
Intelligent Tutoring Systems
Knowledge Level
Learning Processes
Markov Processes
Prediction
Title When Is Deep Learning the Best Approach to Knowledge Tracing?
URI http://eric.ed.gov/ERICWebPortal/detail?accno=EJ1273917
Volume 12
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV25TgMxELUgTWgQV8QtF7SG2PGu7QqFIwpBUAUpXeRjTLeJyNLw9Yx3TUiDRG3Lkg-9N288fibkSijLTcTDi7JLMul5ZMmVjukIXICIod8k9F9ey_GbnMyK2W_q4r8VlROOtItKYzs9YTQbqqe19JbKNBiMfKYYipp-a9dYCM1vvqBCwXctMVBAauqS7nqwXPK8wS-jPbKbA0M6bHdyn2xBdZD-VM71F4ck4WZFn1b0AWBJsy_qO8UAjt4htNNhdgen9YI-_2TKKHKRx263R2Q6epzej1n-_ICBETUrQbh0ceGCLWLBowflJa6pFuBBB0Qoz7UPTlobogvaoHTCiQdZAlhl4qBHOtWigmNCbX9gG-Myq5UE73TpbMm90aVxXsVwQnpp2vNla28xX6_H6V8NZ2RHJNmJEC7UOenUH59wgdxcu8tmJ74BVAiJEQ
linkProvider ERIC Clearinghouse on Information & Technology
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=When+Is+Deep+Learning+the+Best+Approach+to+Knowledge+Tracing%3F&rft.jtitle=Journal+of+educational+data+mining&rft.au=Gervet%2C+Theophile&rft.au=Koedinger%2C+Ken&rft.au=Schneider%2C+Jeff&rft.au=Mitchell%2C+Tom&rft.date=2020-10-27&rft.pub=International+Educational+Data+Mining&rft.issn=2157-2100&rft.eissn=2157-2100&rft.volume=12&rft.issue=3&rft.spage=31&rft_id=info:doi/10.5281%2Fzenodo.4143614&rft.externalDocID=EJ1273917
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2157-2100&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2157-2100&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2157-2100&client=summon