acronym,model,year,authors,title,venue,model_family,one_sentence,datasets,data_context,sample_size,metrics,reported_performance,split_protocol,notes,source_url,model_summary,data_summary,performance_summary,evidence_links
BKT,Bayesian Knowledge Tracing,1994.0,Corbett & Anderson,Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge,UMUAI 1994,HMM (per-skill mastery),"Hidden Markov Model with learn/forget, slip, guess; updates mastery after each response.",Early Cognitive Tutor datasets,K–12 math,Small–medium,AUC/ACC,Foundational baseline; widely used comparator.,Chronological (varies),Classic baseline.,https://act-r.psy.cmu.edu/wp-content/uploads/2012/12/893CorbettAnderson1995.pdf,"Hidden Markov Model with learn/forget, slip, guess; updates mastery after each response.",Early Cognitive Tutor datasets; K–12 math; Small–medium,Metrics: AUC/ACC. Foundational baseline; widely used comparator.,https://act-r.psy.cmu.edu/wp-content/uploads/2012/12/893CorbettAnderson1995.pdf
AFM,Additive Factors Model,2006.0,"Cen, Koedinger & Junker",Learning Factors Analysis,ITS 2006,Logistic (opportunity slope),Logistic model with per-skill intercept and practice slope.,DataShop tutors,K–12,Varies,Accuracy/RMSE,Strong linear baseline; interpretable.,Per paper,Precursor to PFA.,https://link.springer.com/chapter/10.1007/11774303_58,Logistic model with per-skill intercept and practice slope.,DataShop tutors; K–12; Varies,Metrics: Accuracy/RMSE. Strong linear baseline; interpretable.,https://link.springer.com/chapter/10.1007/11774303_58
PFA,Performance Factors Analysis,2009.0,"Pavlik, Cen & Koedinger",Performance Factors Analysis—A New Alternative to KT,AIED 2009,Logistic (wins/fails),Logistic regression on successes/failures per skill.,"ASSISTments, Cognitive Tutors",K–12 math,Varies,AUC/RMSE,Outperformed AFM/LFA; competitive with BKT.,Per paper,Recency variants (R‑PFA).,https://doi.org/10.3233/978-1-60750-028-5-531,Logistic regression on successes/failures per skill.,"ASSISTments, Cognitive Tutors; K–12 math; Varies",Metrics: AUC/RMSE. Outperformed AFM/LFA; competitive with BKT.,https://doi.org/10.3233/978-1-60750-028-5-531
R-PFA,Recent-Performance Factors Analysis,2011.0,"Gong, Beck & Heffernan",How to construct more accurate student models,IJAIED 2011,Logistic (recency),Recency-weighted wins/fails for better nonstationarity handling.,ASSISTments,K–12,Varies,AUC/RMSE,Often outperforms PFA/BKT.,Per paper,Recency emphasis.,https://link.springer.com/article/10.1007/s40593-011-0001-1,Recency-weighted wins/fails for better nonstationarity handling.,ASSISTments; K–12; Varies,Metrics: AUC/RMSE. Often outperforms PFA/BKT.,https://link.springer.com/article/10.1007/s40593-011-0001-1
LKT,Logistic Knowledge Tracing,2021.0,Pavlik & Eglington,Logistic Knowledge Tracing,IEEE TLT 2021,Generalized logistic framework,Unifies AFM/PFA/IRT-style features for KT.,Various DataShop/ASSIST logs,K–12 to higher ed,Varies,AUC/NLL,Competitive with deep KT in several datasets; interpretable.,Per paper,Open-source implementations.,https://arxiv.org/abs/2005.00869,Unifies AFM/PFA/IRT-style features for KT.,Various DataShop/ASSIST logs; K–12 to higher ed; Varies,Metrics: AUC/NLL. Competitive with deep KT in several datasets; interpretable.,https://arxiv.org/abs/2005.00869
KTM,Knowledge Tracing Machines,2019.0,Vie & Kashima,Knowledge Tracing Machines,AAAI 2019,Factorization Machines,"FM over sparse features (student, item, skill, time).",Multiple public datasets,Mixed,Varies,AUC/LogLoss,Matches/exceeds DLKT in some settings with transparency.,Per paper,Flexible featureization.,https://arxiv.org/abs/1904.10603,"FM over sparse features (student, item, skill, time).",Multiple public datasets; Mixed; Varies,Metrics: AUC/LogLoss. Matches/exceeds DLKT in some settings with transparency.,https://arxiv.org/abs/1904.10603
SPARFA,SPARFA,2015.0,"Chandrasekaran, Lan & Baraniuk",Sparse Factor Analysis for Learning & Content Analytics,IEEE TSP 2015,Sparse factor (IRT-like),Learns latent concepts and item mappings from graded responses.,MOOCs/practice logs,Higher ed,Varies,AUC/NLL,Interpretable; basis for temporal SPARFA-Trace.,Per paper,Content analytics.,https://arxiv.org/abs/1211.3387,Learns latent concepts and item mappings from graded responses.,MOOCs/practice logs; Higher ed; Varies,Metrics: AUC/NLL. Interpretable; basis for temporal SPARFA-Trace.,https://arxiv.org/abs/1211.3387
SPARFA-Trace,SPARFA-Trace,2014.0,Lan et al.,SPARFA-Trace: A Unified Framework for Learning & Content Analytics,KDD 2014,Temporal SPARFA,Tracks time-varying concept mastery via message passing.,MOOC/tutor logs,Higher ed,Varies,AUC/NLL,Improved prediction vs static SPARFA.,Chronological,Temporal extension.,https://people.umass.edu/~andrewlan/papers/14kdd-sparfatrace.pdf,Tracks time-varying concept mastery via message passing.,MOOC/tutor logs; Higher ed; Varies,Metrics: AUC/NLL. Improved prediction vs static SPARFA.,https://people.umass.edu/~andrewlan/papers/14kdd-sparfatrace.pdf
Elo-KT,Elo-based Learner Modeling,2017.0,Pelánek et al.,Elo-based learner modeling for adaptive practice of facts,UMUAI 2017,Elo/rating-based,Online Elo updates for ability/difficulty with decay.,Adaptive practice datasets,K–12,Large,Accuracy/AUC/RMSE,"Simple, robust online updates.",Online,Memory/forgetting via decay.,https://link.springer.com/article/10.1007/s11257-016-9198-6,Online Elo updates for ability/difficulty with decay.,Adaptive practice datasets; K–12; Large,"Metrics: Accuracy/AUC/RMSE. Simple, robust online updates.",https://link.springer.com/article/10.1007/s11257-016-9198-6
Multi-Elo,Multivariate Elo,2019.0,Abdi et al.,A Multivariate Elo-based Learner Model,EDM 2019,Elo/rating-based (multi-skill),Extends Elo to multiple concepts for personalization.,University adaptive systems,Higher ed,Varies,Accuracy/AUC,Improves adaptivity across multi-skill content.,Online,Multivariate extension.,https://files.eric.ed.gov/fulltext/ED599177.pdf,Extends Elo to multiple concepts for personalization.,University adaptive systems; Higher ed; Varies,Metrics: Accuracy/AUC. Improves adaptivity across multi-skill content.,https://files.eric.ed.gov/fulltext/ED599177.pdf
DKT,Deep Knowledge Tracing,2015.0,Piech et al.,Deep Knowledge Tracing,NeurIPS 2015,RNN/LSTM,Encodes interaction sequences to predict next response.,"ASSISTments, KDD Cup, Statics",K–12/college,Large,AUC,Large gains over BKT/PFA.,Per paper,Spawned many variants.,https://stanford.edu/~cpiech/bio/papers/deepKnowledgeTracing.pdf,Encodes interaction sequences to predict next response.,"ASSISTments, KDD Cup, Statics; K–12/college; Large",Metrics: AUC. Large gains over BKT/PFA.,https://stanford.edu/~cpiech/bio/papers/deepKnowledgeTracing.pdf
DKT+,DKT+ (regularized),2018.0,Yeung & Yeung,Addressing Two Problems in DKT via Prediction-Consistent Regularization,L@S 2018,RNN + regularization,Adds reconstruction and smoothness regularizers to DKT.,ASSISTments,K–12,Varies,AUC,Stabilizes predictions; similar or better AUC.,Per paper,Reduces 'waviness'.,https://arxiv.org/abs/1806.02180,Adds reconstruction and smoothness regularizers to DKT.,ASSISTments; K–12; Varies,Metrics: AUC. Stabilizes predictions; similar or better AUC.,https://arxiv.org/abs/1806.02180
DKT-DSC,DKT with Dynamic Student Classification,2018.0,Minn et al.,Deep Knowledge Tracing and Dynamic Student Classification,arXiv 2018,RNN + dynamic grouping,Clusters students during training to condition predictions.,Benchmarks,K–12,Varies,AUC,Improved personalization vs vanilla DKT.,Per paper,Dynamic cohorts.,https://arxiv.org/abs/1809.08713,Clusters students during training to condition predictions.,Benchmarks; K–12; Varies,Metrics: AUC. Improved personalization vs vanilla DKT.,https://arxiv.org/abs/1809.08713
DKVMN,Dynamic Key-Value Memory Networks,2017.0,Zhang et al.,Dynamic Key-Value Memory Networks for KT,WWW 2017,Memory-augmented,Static concept keys with dynamic value memory.,"ASSISTments, Synthetic, Statics",Mixed,Large,AUC,Outperforms DKT on several datasets.,Per paper,Concept-level states.,https://arxiv.org/abs/1611.08108,Static concept keys with dynamic value memory.,"ASSISTments, Synthetic, Statics; Mixed; Large",Metrics: AUC. Outperforms DKT on several datasets.,https://arxiv.org/abs/1611.08108
SKVMN,Sequential Key-Value Memory Networks,2019.0,Abdelrahman et al.,Knowledge Tracing with Sequential Key-Value Memory Networks,SIGIR 2019,Memory-augmented (sequential),Adds sequential auxiliary memory for long-range dependencies.,Benchmarks,K–12,Varies,AUC,Beats DKVMN/others.,Per paper,Long-term memory.,https://arxiv.org/abs/1910.13197,Adds sequential auxiliary memory for long-range dependencies.,Benchmarks; K–12; Varies,Metrics: AUC. Beats DKVMN/others.,https://arxiv.org/abs/1910.13197
TCN-KT,Temporal Convolutional KT,2021.0,Ait Khayi et al.,Deep Knowledge Tracing using Temporal Convolutional Networks,FIE 2021,Temporal CNN,Uses TCNs to capture long-range dependencies efficiently.,Educational logs,K–12/higher ed,Varies,AUC,Competitive vs RNN/Transformer with efficiency.,Per paper,Convolutional alternative.,https://par.nsf.gov/biblio/10290861,Uses TCNs to capture long-range dependencies efficiently.,Educational logs; K–12/higher ed; Varies,Metrics: AUC. Competitive vs RNN/Transformer with efficiency.,https://par.nsf.gov/biblio/10290861
SAKT,Self-Attentive KT,2019.0,Pandey & Karypis,A Self-Attentive Model for KT,arXiv 2019,Self-attention,Selects relevant past interactions per query.,"ASSISTments, Statics",K–12/college,Varies,AUC,Improves over RNN baselines.,Per paper,Sparse attention.,https://arxiv.org/abs/1907.06837,Selects relevant past interactions per query.,"ASSISTments, Statics; K–12/college; Varies",Metrics: AUC. Improves over RNN baselines.,https://arxiv.org/abs/1907.06837
AKT,Attentive KT,2020.0,"Ghosh, Heffernan & Lan",Context-aware Attentive KT,KDD 2020,Attention (monotonic; IRT priors),Monotonic attention with forgetting and difficulty priors.,"ASSISTments, EdNet",K–12,Large,AUC/LogLoss,Outperforms prior attention/RNN baselines.,Student-wise/chronological,Context + forgetting.,https://dl.acm.org/doi/10.1145/3394486.3403282,Monotonic attention with forgetting and difficulty priors.,"ASSISTments, EdNet; K–12; Large",Metrics: AUC/LogLoss. Outperforms prior attention/RNN baselines.,https://dl.acm.org/doi/10.1145/3394486.3403282
SAINT,SAINT,2020.0,Choi et al.,Separated Self-AttentIve Neural KT,AIED 2020,Transformer (enc-dec),Separate encoders for items and responses; temporal features.,"EdNet (massive), others",K–12,Very large,AUC,Strong on EdNet; beats SAKT/DKT.,Per paper,Temporal features salient.,https://arxiv.org/abs/2007.04864,Separate encoders for items and responses; temporal features.,"EdNet (massive), others; K–12; Very large",Metrics: AUC. Strong on EdNet; beats SAKT/DKT.,https://arxiv.org/abs/2007.04864
simpleKT,simpleKT,2023.0,Liu et al.,A Simple But Tough-to-Beat Baseline for KT,arXiv 2023,Rasch-informed attention,Models question variation with dot-product attention.,7 public datasets,Mixed,Large,AUC,Top-3 across many settings; 57 wins/3 ties/16 losses vs 12 DLKT baselines.,Standardized,Strong baseline.,https://arxiv.org/abs/2302.06881,Models question variation with dot-product attention.,7 public datasets; Mixed; Large,Metrics: AUC. Top-3 across many settings; 57 wins/3 ties/16 losses vs 12 DLKT baselines.,https://arxiv.org/abs/2302.06881
KTST,KT Set-Transformers,2024.0,Neubauer et al.,Toward Principled Transformers for KT,OpenReview 2024,Set Transformer,Simpler set-based architecture with principled evaluation.,Standard corpora,Mixed,Varies,AUC/ACC,Competitive with simpler design.,Protocol-controlled,Critiques flawed eval.,https://openreview.net/forum?id=4dtwyV7XyW,Simpler set-based architecture with principled evaluation.,Standard corpora; Mixed; Varies,Metrics: AUC/ACC. Competitive with simpler design.,https://openreview.net/forum?id=4dtwyV7XyW
LPKT,Learning Process-consistent KT,2021.0,Shen et al.,Learning Process-consistent KT,KDD 2021,Process-consistent attention,Constraints enforce plausible knowledge progression.,Multiple benchmarks,Mixed,Varies,AUC/LogLoss,Improves AUC while keeping consistency.,Per paper,Process-aware.,https://dl.acm.org/doi/10.1145/3447548.3467237,Constraints enforce plausible knowledge progression.,Multiple benchmarks; Mixed; Varies,Metrics: AUC/LogLoss. Improves AUC while keeping consistency.,https://dl.acm.org/doi/10.1145/3447548.3467237
stableKT,Length-Generalization Attention KT,2024.0,IJCAI authors,Enhancing Length Generalization for Attention-Based KT,IJCAI 2024,Attention training (linear biases),Strategies to generalize to longer test sequences.,Public datasets,Mixed,Varies,AUC,Improves length generalization.,Per paper,Training strategy.,https://www.ijcai.org/proceedings/2024/654,Strategies to generalize to longer test sequences.,Public datasets; Mixed; Varies,Metrics: AUC. Improves length generalization.,https://www.ijcai.org/proceedings/2024/654
extraKT,Length Extrapolation for Attention KT,2024.0,pyKT authors,Extending Context Window via Length Extrapolation,pyKT 2024,Attention extrapolation,Extends effective context window for attention models.,Public datasets,Mixed,Varies,AUC,Better long-sequence performance.,Per paper,pyKT news/model.,https://pykt.org/news/index.html/,Extends effective context window for attention models.,Public datasets; Mixed; Varies,Metrics: AUC. Better long-sequence performance.,https://pykt.org/news/index.html/
HiTSKT,Hierarchical Transformer Session-aware KT,2024.0,Ke et al.,HiTSKT: A Hierarchical Transformer for Session-aware KT,KBS 2024,Hierarchical Transformer,Models within-session and cross-session dynamics.,Sessioned logs,Mixed,Varies,AUC/ACC,Improves over flat Transformers.,Chronological,Session structure.,https://arxiv.org/abs/2212.12139,Models within-session and cross-session dynamics.,Sessioned logs; Mixed; Varies,Metrics: AUC/ACC. Improves over flat Transformers.,https://arxiv.org/abs/2212.12139
SAINT+,SAINT+,2021.0,Shin et al.,Integrating Temporal Features for EdNet Performance Prediction,Preprint 2021,Transformer (enc-dec + temporal),Extends SAINT with temporal features.,"EdNet, others",K–12,Very large,AUC,Improved AUC over SAINT on EdNet.,Per paper,Temporal integration.,https://www.rtest.ai/pdf/2010.12042.pdf,Extends SAINT with temporal features.,"EdNet, others; K–12; Very large",Metrics: AUC. Improved AUC over SAINT on EdNet.,https://www.rtest.ai/pdf/2010.12042.pdf
GIKT,Graph-based Interaction KT,2020.0,Yang et al.,GIKT: A Graph-based Interaction Model for KT,ECML-PKDD 2020,GCN + interaction matching,GCN over question–skill graphs with history matching.,Three datasets,K–12/online,Standard,AUC,≥1% AUC gains vs baselines.,Per paper,Graph relations.,https://arxiv.org/abs/2009.05991,GCN over question–skill graphs with history matching.,Three datasets; K–12/online; Standard,Metrics: AUC. ≥1% AUC gains vs baselines.,https://arxiv.org/abs/2009.05991
DGEKT,Dual Graph Ensemble KT,2024.0,Cui et al.,DGEKT: Dual Graph Ensemble for KT,ACM TOIS 2024,Dual graph ensemble,Hypergraph exercise–concept + directed transition graph.,Multiple benchmarks,Mixed,Varies,AUC,Outperforms attention/DKT variants.,Per paper,Dual graphs.,https://dl.acm.org/doi/10.1145/3638350,Hypergraph exercise–concept + directed transition graph.,Multiple benchmarks; Mixed; Varies,Metrics: AUC. Outperforms attention/DKT variants.,https://dl.acm.org/doi/10.1145/3638350
GRKT,Graph-based Reasonable KT,2024.0,Cui et al.,Graph-based Reasonable KT,KDD 2024,Graph + pedagogical stages,Retrieval→strengthening→learning/forgetting stages with GNNs.,Standard datasets,Mixed,Varies,AUC,Improved accuracy & plausibility.,Per paper,Stage-wise cognition.,https://arxiv.org/abs/2406.12896,Retrieval→strengthening→learning/forgetting stages with GNNs.,Standard datasets; Mixed; Varies,Metrics: AUC. Improved accuracy & plausibility.,https://arxiv.org/abs/2406.12896
DyGKT,Dynamic Graph KT,2024.0,Cheng et al.,DyGKT: Dynamic Graph Learning for KT,KDD 2024,Dynamic graph + time encoders,Evolving student–question–concept graph with dual time encoders.,Large logs,Mixed,Large,AUC,SOTA/competitive vs strong baselines.,Chronological,Continuous-time.,https://arxiv.org/abs/2407.20824,Evolving student–question–concept graph with dual time encoders.,Large logs; Mixed; Large,Metrics: AUC. SOTA/competitive vs strong baselines.,https://arxiv.org/abs/2407.20824
SGKT,Session Graph KT,2022.0,Wu et al.,Session graph-based KT for student performance prediction,ESWA 2022,Session graph + GGNN,Models intra-session dependencies via GGNN + attention.,Sessioned logs,Mixed,Varies,AUC,Gains on session-aware setups.,Per paper,GGNN.,https://www.sciencedirect.com/science/article/abs/pii/S0957417422009770,Models intra-session dependencies via GGNN + attention.,Sessioned logs; Mixed; Varies,Metrics: AUC. Gains on session-aware setups.,https://www.sciencedirect.com/science/article/abs/pii/S0957417422009770
HHN-KT,Heterogeneous Hypergraph KT,2023.0,Wu & Ling,Self-supervised heterogeneous hypergraph network for KT,Information Sciences 2023,Hypergraph + SSL,Heterogeneous hypergraph with intra/inter attentions and self-supervision.,KT datasets,Mixed,Varies,AUC,Improved reps via SSL.,Per paper,Heterogeneous graphs.,https://www.sciencedirect.com/science/article/abs/pii/S0020025522015717,Heterogeneous hypergraph with intra/inter attentions and self-supervision.,KT datasets; Mixed; Varies,Metrics: AUC. Improved reps via SSL.,https://www.sciencedirect.com/science/article/abs/pii/S0020025522015717
MAKT-MP,MAKT (MetaPath + Attention),2022.0,Various,MAKT: Meta Path and Attention Mechanism,CSSE 2022,MetaPath + attention,Uses meta-paths over heterogeneous graphs with attention.,Heterogeneous KT datasets,Mixed,Varies,AUC/ACC,Gains vs baselines.,Per paper,Not same as 2025 MAKT.,https://www.researchgate.net/publication/366450725_MAKT_A_Knowledge_Tracing_Model_Based_on_Meta_Path_and_Attention_Mechanism,Uses meta-paths over heterogeneous graphs with attention.,Heterogeneous KT datasets; Mixed; Varies,Metrics: AUC/ACC. Gains vs baselines.,https://www.researchgate.net/publication/366450725_MAKT_A_Knowledge_Tracing_Model_Based_on_Meta_Path_and_Attention_Mechanism
MAHKT,Multi-association Heterogeneous Graph KT,2025.0,Yang et al.,KT with multi-association heterogeneous graph and knowledge transfer theory,KBS 2025,Heterogeneous graph + transfer,Five associations + transfer theory to enhance KT.,Public KT datasets,K–12/online,Varies,AUC/LogLoss,Improved accuracy vs strong baselines.,Per paper,Relation-rich.,https://www.sciencedirect.com/science/article/abs/pii/S0950705125000061,Five associations + transfer theory to enhance KT.,Public KT datasets; K–12/online; Varies,Metrics: AUC/LogLoss. Improved accuracy vs strong baselines.,https://www.sciencedirect.com/science/article/abs/pii/S0950705125000061
HawkesKT,Temporal Cross-Effects KT,2021.0,Wang et al.,Temporal Cross-Effects in KT,WSDM 2021,Hawkes processes + KT,Models mutual excitation across exercises using point processes.,Three benchmarks,K–12/online,Varies,AUC,Outperforms SOTA on timing-sensitive data.,Chronological,Precise timing.,https://dl.acm.org/doi/10.1145/3437963.3441802,Models mutual excitation across exercises using point processes.,Three benchmarks; K–12/online; Varies,Metrics: AUC. Outperforms SOTA on timing-sensitive data.,https://dl.acm.org/doi/10.1145/3437963.3441802
DAS3H,DAS3H,2019.0,Choffin et al.,Modeling Student Learning and Forgetting for Optimizing Spaced Repetition,EDM 2019,Logistic + time windows,Counts wins/fails in multiple windows + IRT terms.,Spacing/memorization datasets,Mixed,Large,AUC/NLL,Beats AFM/PFA esp. with item difficulty.,Chronological,Spacing-aware.,https://arxiv.org/abs/1905.06873,Counts wins/fails in multiple windows + IRT terms.,Spacing/memorization datasets; Mixed; Large,Metrics: AUC/NLL. Beats AFM/PFA esp. with item difficulty.,https://arxiv.org/abs/1905.06873
TCFKT,Transformer Convolutional Forgetting KT,2023.0,Zhang et al.,Transformer-based convolutional forgetting KT,Scientific Reports 2023,Transformer + forgetting,Convolutional attention with explicit forgetting factor.,Public datasets,Mixed,Varies,AUC,Outperforms several Transformer baselines.,Chronological,Forgetting factor.,https://www.nature.com/articles/s41598-023-45936-0,Convolutional attention with explicit forgetting factor.,Public datasets; Mixed; Varies,Metrics: AUC. Outperforms several Transformer baselines.,https://www.nature.com/articles/s41598-023-45936-0
FKT,Response Speed Enhanced Fine-grained KT,2024.0,Various,Response speed enhanced fine-grained KT,KBS 2024,Time/response-speed-aware,Uses response time to refine fine-grained KT.,KT datasets,Mixed,Varies,AUC/LogLoss,Improved AUC vs baselines.,Per paper,Response speed signals.,https://www.sciencedirect.com/science/article/abs/pii/S0950705124009862,Uses response time to refine fine-grained KT.,KT datasets; Mixed; Varies,Metrics: AUC/LogLoss. Improved AUC vs baselines.,https://www.sciencedirect.com/science/article/abs/pii/S0950705124009862
OKT,Open-Ended KT,2022.0,"Liu, Wang, Baraniuk & Lan",Open-Ended Knowledge Tracing,EMNLP 2022 (arXiv 2022),LLM/seq2seq for open-ended,"Predicts exact open-ended responses (e.g., code).",Programming datasets,Higher ed/CS,Varies,AUC/ACC,Enables non-MCQ KT; improved performance on open-ended tasks.,Per paper,Text/code modality.,https://arxiv.org/abs/2203.03716,"Predicts exact open-ended responses (e.g., code).",Programming datasets; Higher ed/CS; Varies,Metrics: AUC/ACC. Enables non-MCQ KT; improved performance on open-ended tasks.,https://arxiv.org/abs/2203.03716
LLMKT,LLM-based Dialogue KT,2025.0,"Scarlatos, Baker & Lan",Exploring KT in Tutor–Student Dialogues using LLMs,LAK 2025,LLM-assisted dialogue KT,"LLMs label skills/correctness from dialogue, then apply KT.",Tutor–student dialogue datasets,K–12,Varies,AUC/ACC,Significantly outperforms existing KT on dialogue tasks.,Per paper,Dialogue modality.,https://dl.acm.org/doi/10.1145/3636555.3636928,"LLMs label skills/correctness from dialogue, then apply KT.",Tutor–student dialogue datasets; K–12; Varies,Metrics: AUC/ACC. Significantly outperforms existing KT on dialogue tasks.,https://dl.acm.org/doi/10.1145/3636555.3636928
LKT (PLM),Language Model Can Do KT,2024.0,Lee et al.,Language Model Can Do Knowledge Tracing,arXiv 2024,PLM-enhanced KT,Integrates PLMs with KT to use textual semantics.,Text-rich KT datasets,Mixed,Varies,AUC/LogLoss,Improves over numeric-only baselines.,Per paper,Text-aware KT.,https://arxiv.org/abs/2406.02893,Integrates PLMs with KT to use textual semantics.,Text-rich KT datasets; Mixed; Varies,Metrics: AUC/LogLoss. Improves over numeric-only baselines.,https://arxiv.org/abs/2406.02893
SINKT,Structure-Aware Inductive KT,2024.0,Fu et al.,SINKT: Structure-Aware Inductive KT with LLMs,CIKM 2024,LLM + heterogeneous graph (inductive),Infers structure and handles unseen questions.,Benchmarks with unseen items,Mixed,Varies,AUC/ACC,Boosts performance on unseen questions.,Inductive splits,Inductive generalization.,https://arxiv.org/abs/2407.01245,Infers structure and handles unseen questions.,Benchmarks with unseen items; Mixed; Varies,Metrics: AUC/ACC. Boosts performance on unseen questions.,https://arxiv.org/abs/2407.01245
CLST,Cold-Start LLM KT,2024.0,Jung et al.,Aligning a Generative LM as a Students' Knowledge Tracer,arXiv 2024,LLM fine-tuning for KT,Mitigates cold-start across subjects by aligning a generative LM.,Multi-subject datasets,K–12,Varies,AUC/ACC,Improves cold-start prediction.,Cold-start protocols,NLP framing.,https://arxiv.org/abs/2406.10296,Mitigates cold-start across subjects by aligning a generative LM.,Multi-subject datasets; K–12; Varies,Metrics: AUC/ACC. Improves cold-start prediction.,https://arxiv.org/abs/2406.10296
QDCKT,Question Difficulty Consistent KT,2024.0,Liu et al.,Question Difficulty Consistent KT,WWW 2024,Difficulty-aware,Replaces raw QIDs with difficulty-consistent signals.,Multiple datasets,Mixed,Varies,AUC,Better AUC than ID-based KT on all reported datasets.,Student-wise,Unseen-item robustness.,https://dl.acm.org/doi/10.1145/3589334.3645582,Replaces raw QIDs with difficulty-consistent signals.,Multiple datasets; Mixed; Varies,Metrics: AUC. Better AUC than ID-based KT on all reported datasets.,https://dl.acm.org/doi/10.1145/3589334.3645582
QCKT,Question-based Comprehensive KT,2025.0,Shen et al.,Enhancing KT with question-based comprehensive features,KBS 2025,Feature augmentation,Richer question features to enhance multiple KT methods.,Multiple datasets,Mixed,Varies,AUC/ACC,Improved results across KT families.,Per paper,Generalizable features.,https://www.sciencedirect.com/science/article/abs/pii/S0950705125009451,Richer question features to enhance multiple KT methods.,Multiple datasets; Mixed; Varies,Metrics: AUC/ACC. Improved results across KT families.,https://www.sciencedirect.com/science/article/abs/pii/S0950705125009451
LM-KT,Language Model Can Do KT,2024.0,Lee et al.,Language Model Can Do KT,arXiv 2024,PLM-enhanced KT,(Duplicate consolidated),—,—,—,—,—,—,https://arxiv.org/abs/2406.02893,,(Duplicate consolidated),Evaluated on public KT benchmarks (arXiv 2024 2024).,Metrics: —. —,
OKT,Open-Ended KT,2022.0,Liu et al.,Open-Ended KT,EMNLP 2022,LLM/code,(Duplicate consolidated),—,—,—,—,—,—,https://arxiv.org/abs/2203.03716,,(Duplicate consolidated),Evaluated on public KT benchmarks (EMNLP 2022 2022).,Metrics: —. —,
DTransformer,Diagnostic Transformer,2023.0,Yin et al.,Stable KT with Diagnostic Transformer,WWW 2023,Transformer + diagnostic training,Traces knowledge rather than spurious temporal patterns.,Public KT benchmarks,Mixed,Varies,AUC/LogLoss,More stable & accurate under shift.,Chronological preferred,Mitigates shortcuts.,https://dl.acm.org/doi/10.1145/3543507.3583255,Traces knowledge rather than spurious temporal patterns.,Public KT benchmarks; Mixed; Varies,Metrics: AUC/LogLoss. More stable & accurate under shift.,https://dl.acm.org/doi/10.1145/3543507.3583255
UKT,Uncertainty-aware KT,2025.0,Various,Uncertainty-aware Knowledge Tracing,arXiv 2025,Distribution embeddings + Wasserstein attention,Models predictive uncertainty with distributional representations.,Six datasets,Mixed,Varies,AUC/Calibration,Surpasses deep KT while modeling uncertainty.,Student-wise,Uncertainty-aware.,https://arxiv.org/abs/2501.05415,Models predictive uncertainty with distributional representations.,Six datasets; Mixed; Varies,Metrics: AUC/Calibration. Surpasses deep KT while modeling uncertainty.,https://arxiv.org/abs/2501.05415
SAICL,Interaction-level Contrastive KT,2022.0,Park et al.,SAICL: Student Modelling with Interaction-Level Auxiliary Contrastive Tasks,arXiv 2022,Contrastive + auxiliary tasks,Adds self-/supervised contrastive objectives for KT and dropout.,Online learning datasets,K–12,Varies,AUC/ACC,Comparable/better KT with added dropout prediction.,Per paper,Multi-task.,https://arxiv.org/pdf/2210.09012,Adds self-/supervised contrastive objectives for KT and dropout.,Online learning datasets; K–12; Varies,Metrics: AUC/ACC. Comparable/better KT with added dropout prediction.,https://arxiv.org/pdf/2210.09012
Bi-CLKT,Bi-Graph Contrastive Learning KT,2022.0,Song et al.,Bi-CLKT: Bi-Graph Contrastive Learning based KT,arXiv 2022,Graph + contrastive,Node/graph-level contrastive pretraining before KT.,Four real-world datasets,K–12,Standard,AUC/ACC,Outperforms baseline KT models.,Per paper,E2E subgraph contrast.,https://arxiv.org/abs/2201.09020,Node/graph-level contrastive pretraining before KT.,Four real-world datasets; K–12; Standard,Metrics: AUC/ACC. Outperforms baseline KT models.,https://arxiv.org/abs/2201.09020
BG-CC,Bi-graph Co-Contrastive Pretraining,2023.0,Zhang et al.,Pre-training Question Embeddings with Bi-graph Co-contrastive Learning,ACM 2023,Self-supervised pretraining,Pretrains question reps via co-contrastive learning.,Public KT datasets,Mixed,Varies,AUC,Plug-in gains across KT.,Per paper,Pretraining helps.,https://dl.acm.org/doi/10.1145/3638055,Pretrains question reps via co-contrastive learning.,Public KT datasets; Mixed; Varies,Metrics: AUC. Plug-in gains across KT.,https://dl.acm.org/doi/10.1145/3638055
AT-DKT,Auxiliary-Task DKT,2023.0,Liu et al.,Enhancing DKT with Auxiliary Tasks,WWW 2023,RNN + multi-task,Adds question tagging and prior-knowledge prediction tasks.,Three datasets,Mixed,Varies,AUC,>0.9% AUC gains over DKT.,Student-wise,Auxiliary benefits.,https://arxiv.org/abs/2302.07942,Adds question tagging and prior-knowledge prediction tasks.,Three datasets; Mixed; Varies,Metrics: AUC. >0.9% AUC gains over DKT.,https://arxiv.org/abs/2302.07942
SP-CL-KT,Self-Paced Contrastive KT,2024.0,Dai et al.,Self-paced contrastive learning for KT,Neurocomputing 2024,Contrastive + curriculum,Self-paced curriculum for contrastive training.,Benchmarks,Mixed,Varies,AUC,Improved AUC vs non-contrastive.,Per paper,Curriculum contrastive.,https://www.sciencedirect.com/science/article/abs/pii/S0925231224011378,Self-paced curriculum for contrastive training.,Benchmarks; Mixed; Varies,Metrics: AUC. Improved AUC vs non-contrastive.,https://www.sciencedirect.com/science/article/abs/pii/S0925231224011378
DECKT,Dual-Encoder Contrastive KT,2025.0,Bai et al.,A Dual-Encoder Contrastive Learning Model for KT,Applied Sciences 2025,Dual encoders + contrastive,Separate student/exercise encoders with contrastive objectives.,Public datasets,Mixed,Varies,AUC,Performance gains vs baselines.,Per paper,Robust representations.,https://pmc.ncbi.nlm.nih.gov/articles/PMC12294018/,Separate student/exercise encoders with contrastive objectives.,Public datasets; Mixed; Varies,Metrics: AUC. Performance gains vs baselines.,https://pmc.ncbi.nlm.nih.gov/articles/PMC12294018/
DMT-KT,Deep Multi-Type KT,2021.0,Wang & Sahebi,Learning from Non-Assessed Resources: Deep Multi-Type KT,LAK 2021,Multi-type sequence,Incorporates videos/readings alongside assessments.,Course logs,Higher ed,Varies,AUC,Improved prediction with multi-type signals.,Per paper,Non-assessed interactions.,https://files.eric.ed.gov/fulltext/ED615584.pdf,Incorporates videos/readings alongside assessments.,Course logs; Higher ed; Varies,Metrics: AUC. Improved prediction with multi-type signals.,https://files.eric.ed.gov/fulltext/ED615584.pdf
GPPKT,Programming KT with KG & Personalized Sequences,2024.0,Various,Knowledge Graph and Personalized Answer Sequences for Programming KT,Applied Sciences 2024,KG + VAE,Uses knowledge graph and personalized sequences (VAE) for programming KT.,Programming datasets,Higher ed,Varies,AUC,Improved accuracy in programming KT.,Per paper,KG + personalization.,https://www.mdpi.com/2076-3417/14/14/7952,Uses knowledge graph and personalized sequences (VAE) for programming KT.,Programming datasets; Higher ed; Varies,Metrics: AUC. Improved accuracy in programming KT.,https://www.mdpi.com/2076-3417/14/14/7952
DPKT,Difficulty-Aware Programming KT via LLMs,2025.0,Various,Difficulty aware programming KT via large language models,Scientific Reports 2025,LLM + GAT,Combines LLM-derived difficulty with GAT for programming KT.,Programming datasets,Higher ed,Varies,AUC,Reported gains over baselines.,Per paper,Programming focus.,https://www.nature.com/articles/s41598-025-96540-3,Combines LLM-derived difficulty with GAT for programming KT.,Programming datasets; Higher ed; Varies,Metrics: AUC. Reported gains over baselines.,https://www.nature.com/articles/s41598-025-96540-3
LGS-KT,Logical & Grammatical Skills Programming KT,2025.0,Various,Integrating logical and grammatical skills for programming KT,Neural Networks 2025,Skill graphs (logic/grammar),Program-specific skill graphs for improved tracing.,Programming datasets,Higher ed,Varies,AUC,Improved programming KT accuracy.,Per paper,Skill-graph focus.,https://www.sciencedirect.com/science/article/pii/S0893608025000437,Program-specific skill graphs for improved tracing.,Programming datasets; Higher ed; Varies,Metrics: AUC. Improved programming KT accuracy.,https://www.sciencedirect.com/science/article/pii/S0893608025000437
SQKT,Students' Questions for Programming KT,2025.0,Various,Knowledge Tracing in Programming Education Integrating Students' Questions,ACL 2025,NLP/LLM-enhanced,Uses students' text questions to aid programming KT.,Programming datasets,Higher ed,Varies,AUC,Better performance using student questions.,Per paper,NLP signals.,https://aclanthology.org/2025.acl-long.1343/,Uses students' text questions to aid programming KT.,Programming datasets; Higher ed; Varies,Metrics: AUC. Better performance using student questions.,https://aclanthology.org/2025.acl-long.1343/
IEKT,Interpretable Exercise-aware KT,2021.0,Long et al.,Tracing Knowledge State with Individual Cognition and Exercise Memory,SIGIR 2021,Exercise-aware + cognition,Combines individual cognition with exercise memory for interpretability.,Public KT datasets,K–12,Varies,AUC,SOTA vs 11 baselines on several datasets.,Per paper,Interpretable components.,https://wnzhang.net/papers/2021-sigir-iekt.pdf,Combines individual cognition with exercise memory for interpretability.,Public KT datasets; K–12; Varies,Metrics: AUC. SOTA vs 11 baselines on several datasets.,https://wnzhang.net/papers/2021-sigir-iekt.pdf
KQN,Knowledge Query Network,2019.0,Lee et al.,Knowledge Query Network for KT,LAK 2019,Neural interaction (query),Models student–skill interactions via query/dot-product.,LAK datasets,Mixed,Varies,AUC,Matched/exceeded RNN baselines.,Per paper,Interpretable skill similarity.,https://dl.acm.org/doi/10.1145/3303772.3303786,Models student–skill interactions via query/dot-product.,LAK datasets; Mixed; Varies,Metrics: AUC. Matched/exceeded RNN baselines.,https://dl.acm.org/doi/10.1145/3303772.3303786
ERAKT,Exercise Representation & Association KT,2021.0,Liang et al.,Context-aware KT with Exercise Representation and Association,EDM 2021,Content-aware attention,Learns exercise representations and associations.,Math datasets,K–12,Varies,AUC,Outperforms common baselines on EDM’21 datasets.,Per paper,Exercise associations.,https://educationaldatamining.org/EDM2021/virtual/static/pdf/EDM21_paper_159.pdf,Learns exercise representations and associations.,Math datasets; K–12; Varies,Metrics: AUC. Outperforms common baselines on EDM’21 datasets.,https://educationaldatamining.org/EDM2021/virtual/static/pdf/EDM21_paper_159.pdf
QIKT,Question-centric Interpretable KT,2023.0,Various,Question-centric Interpretable KT,pyKT 2023,Interpretable KT,Item-level cognition and problem-solving modules.,Benchmarks,Mixed,Varies,AUC/ACC,SOTA/competitive with interpretability.,Per paper,pyKT model catalog.,https://pykt.org/tag/model/,Item-level cognition and problem-solving modules.,Benchmarks; Mixed; Varies,Metrics: AUC/ACC. SOTA/competitive with interpretability.,https://pykt.org/tag/model/
PSI-KT,"Predictive, Scalable & Interpretable KT",2025.0,Various,"PSI-KT: Predictive, Scalable and Interpretable KT",OpenReview 2025,Hierarchical generative Bayesian,Scalable Bayesian KT with interpretability.,Public datasets,Mixed,Varies,AUC/LogLoss,"Competitive, interpretable results.",Per paper,OpenReview preprint.,https://openreview.net/forum?id=rU9ee14aD2,Scalable Bayesian KT with interpretability.,Public datasets; Mixed; Varies,"Metrics: AUC/LogLoss. Competitive, interpretable results.",https://openreview.net/forum?id=rU9ee14aD2
RouterKT,RouterKT (Mixture-of-Experts),2025.0,Liao et al.,RouterKT: Mixture-of-Experts for KT,arXiv 2025,MoE personalization,Person-wise routing to specialized experts.,Ten benchmarks,Mixed,Large,AUC,Up to +3.29% average AUC over backbones.,Standardized,Heterogeneity handling.,https://arxiv.org/abs/2504.08989,Person-wise routing to specialized experts.,Ten benchmarks; Mixed; Large,Metrics: AUC. Up to +3.29% average AUC over backbones.,https://arxiv.org/abs/2504.08989
WEKT,Option-weighting MoE KT,2024.0,Various,WEKT: Option-weighting-enhanced MoE KT,OpenReview 2024,MoE with option-weighting,Weights options within experts for better modeling.,KT datasets,Mixed,Varies,AUC,Gains vs baselines.,Per paper,MoE variant.,https://openreview.net/pdf?id=vB-YqAvFRu,Weights options within experts for better modeling.,KT datasets; Mixed; Varies,Metrics: AUC. Gains vs baselines.,https://openreview.net/pdf?id=vB-YqAvFRu
KT^2,Hierarchical Probabilistic KT,2025.0,Gao et al.,A Hierarchical Probabilistic Framework for Incremental KT (KT^2),arXiv 2025,Hidden Markov Tree,Uses concept hierarchies for low-resource incremental KT.,Online course datasets,Mixed,Varies,AUC/ACC,Outperforms baselines in online low-data regimes.,Incremental,HMT over hierarchies.,https://arxiv.org/abs/2506.09393,Uses concept hierarchies for low-resource incremental KT.,Online course datasets; Mixed; Varies,Metrics: AUC/ACC. Outperforms baselines in online low-data regimes.,https://arxiv.org/abs/2506.09393
HKT,Hierarchical structure-based KT,2025.0,Li et al.,HKT: Hierarchical structure-based KT,IPM 2025,Hierarchical structures + attention/graph,Multiple knowledge levels refine mastery.,Hierarchical KC datasets,Mixed,Varies,AUC/LogLoss,Improved accuracy vs flat-KC baselines.,Per paper,Complements KT^2.,https://dl.acm.org/doi/10.1016/j.ipm.2025.104206,Multiple knowledge levels refine mastery.,Hierarchical KC datasets; Mixed; Varies,Metrics: AUC/LogLoss. Improved accuracy vs flat-KC baselines.,https://dl.acm.org/doi/10.1016/j.ipm.2025.104206
IKT-MKD,Incremental KT (Meta-knowledge Dictionary),2024.0,Dai et al.,Adaptive meta-knowledge dictionary learning for incremental KT,EAAI 2024,Incremental KT,Learns dynamic meta-knowledge dictionary for evolving knowledge.,KT-style logs,K–12/online,Varies,AUC/ACC,Outperforms baselines under incremental scenarios.,Incremental,Non-stationary progression.,https://www.sciencedirect.com/science/article/abs/pii/S0952197624001271,Learns dynamic meta-knowledge dictionary for evolving knowledge.,KT-style logs; K–12/online; Varies,Metrics: AUC/ACC. Outperforms baselines under incremental scenarios.,https://www.sciencedirect.com/science/article/abs/pii/S0952197624001271
OptimNN,OptimNN (Neural BKT parameterization),2025.0,Various,Parameter optimization for BKT using neural networks,JEDM 2025,Neural BKT parameters,Neural nets generate/optimize BKT parameters.,Tutor datasets,K–12,Varies,AUC/LogLoss,Improved fit/performance vs fixed-parameter BKT.,Per paper,Neural parameterization.,https://jedm.educationaldatamining.org/index.php/JEDM/article/view/693,Neural nets generate/optimize BKT parameters.,Tutor datasets; K–12; Varies,Metrics: AUC/LogLoss. Improved fit/performance vs fixed-parameter BKT.,https://jedm.educationaldatamining.org/index.php/JEDM/article/view/693
BKTransformer,BKTransformer (time-varying BKT),2025.0,Various,BKTransformer – dynamically changing parameters for BKT,JEDM 2025,Transformer-generated BKT params,Transformer modulates BKT parameters over time.,Tutor datasets,K–12,Varies,AUC/LogLoss,Dynamic parameters improve flexibility.,Per paper,Companion to OptimNN.,https://jedm.educationaldatamining.org/index.php/JEDM/article/view/693,Transformer modulates BKT parameters over time.,Tutor datasets; K–12; Varies,Metrics: AUC/LogLoss. Dynamic parameters improve flexibility.,https://jedm.educationaldatamining.org/index.php/JEDM/article/view/693
DTransformer,Diagnostic Transformer,2023.0,Yin et al.,Stable KT with Diagnostic Transformer,WWW 2023,Transformer + diagnostic training,(Duplicate consolidated),—,—,—,—,—,—,https://dl.acm.org/doi/10.1145/3543507.3583255,,(Duplicate consolidated),Evaluated on public KT benchmarks (WWW 2023 2023).,Metrics: —. —,
LG-Attn-KT,Length-Generalization Enhanced Attention KT,2024.0,IJCAI authors,Enhancing Length Generalization for Attention-Based KT,IJCAI 2024,Attention training,Improved training for longer test sequences.,Public datasets,Mixed,Varies,AUC,Helps with long sequences.,Per paper,Closely related to stableKT.,https://www.ijcai.org/proceedings/2024/654,Improved training for longer test sequences.,Public datasets; Mixed; Varies,Metrics: AUC. Helps with long sequences.,https://www.ijcai.org/proceedings/2024/654
SSA-KT,Student State-aware Attention KT,2024.0,Qian et al.,Student State-aware KT based on attention mechanism,PRL 2024,Attention + state modeling,Incorporates student state into attention.,Public datasets,Mixed,Varies,AUC,Improved AUC via state features.,Per paper,Pattern Recognition Letters.,https://www.sciencedirect.com/science/article/abs/pii/S016786552400179X,Incorporates student state into attention.,Public datasets; Mixed; Varies,Metrics: AUC. Improved AUC via state features.,https://www.sciencedirect.com/science/article/abs/pii/S016786552400179X
LoReKT,Low-Resource KT via Transfer,2024.0,Zhang et al.,Improving Low-Resource KT by Transfer from Rich Datasets,arXiv 2024,Pretrain→Finetune Transformer,Pretrains on rich datasets; transfers to low-resource KT.,Multiple KT benchmarks,Mixed,Varies,AUC/ACC/LogLoss,Improves low-resource performance.,Transfer,Stacked decoders.,https://arxiv.org/abs/2403.06725,Pretrains on rich datasets; transfers to low-resource KT.,Multiple KT benchmarks; Mixed; Varies,Metrics: AUC/ACC/LogLoss. Improves low-resource performance.,https://arxiv.org/abs/2403.06725
MGTT-KT,Multi-granularity Time-based Transformer,2023.0,Various,Multi-granularity Time-based Transformer for KT,arXiv 2023,Transformer + multi-scale time,Multi-granularity time embeddings for elapsed-time effects.,Public KT datasets,K–12/online,Varies,AUC/ACC,Improved accuracy via explicit time.,Per paper,Time-scale modeling.,https://arxiv.org/pdf/2304.05257,Multi-granularity time embeddings for elapsed-time effects.,Public KT datasets; K–12/online; Varies,Metrics: AUC/ACC. Improved accuracy via explicit time.,https://arxiv.org/pdf/2304.05257
ELAKT,Enhancing Locality for AKT,2024.0,Zhang et al.,Enhancing Locality for Attentive KT,ACM 2024,Attention (locality),Improves AKT with locality mechanisms.,Public KT datasets,Mixed,Varies,AUC/LogLoss,AUC/logloss gains over AKT/SAKT.,Per paper,Locality-aware.,https://dl.acm.org/doi/10.1145/3652601,Improves AKT with locality mechanisms.,Public KT datasets; Mixed; Varies,Metrics: AUC/LogLoss. AUC/logloss gains over AKT/SAKT.,https://dl.acm.org/doi/10.1145/3652601
CRKT,Concept-map + Response Disentanglement KT,2024.0,Park et al.,Enhancing KT with concept map and response disentanglement,KBS 2024,Concept map + disentanglement,Uses concept maps; disentangles response signals.,Public KT datasets,Mixed,Varies,AUC/LogLoss,Improved AUC when concept maps exist.,Per paper,Requires good maps.,https://www.sciencedirect.com/science/article/abs/pii/S0950705124009808,Uses concept maps; disentangles response signals.,Public KT datasets; Mixed; Varies,Metrics: AUC/LogLoss. Improved AUC when concept maps exist.,https://www.sciencedirect.com/science/article/abs/pii/S0950705124009808
DGCN-KT,Dual-GCN KT,2025.0,Wang et al.,Dual GCNs with positive/negative feature enhancement for KT,PLOS ONE 2025,Dual GCN (student/skill),Parallel student and skill graphs; feature enhancement.,Multiple datasets,K–12/online,Varies,AUC/ACC,Improved accuracy/robustness.,Per paper,Scalability to many KCs.,https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0317992,Parallel student and skill graphs; feature enhancement.,Multiple datasets; K–12/online; Varies,Metrics: AUC/ACC. Improved accuracy/robustness.,https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0317992
HCGKT,Hierarchical Contrastive Graph KT,2025.0,pyKT authors,HCGKT: Hierarchical Contrastive Graph KT with Multi-level Feature Learning,pyKT 2025,Graph + contrastive,Multi-level contrastive learning over graphs.,Public KT datasets,Mixed,Varies,AUC,Reported gains vs baselines.,Per paper,pyKT model page.,https://pykt.org/hcgkt,Multi-level contrastive learning over graphs.,Public KT datasets; Mixed; Varies,Metrics: AUC. Reported gains vs baselines.,https://pykt.org/hcgkt
RobustKT,Robust KT (decoupling cognitive pattern),2025.0,pyKT authors,Enhancing KT by Decoupling Cognitive Pattern from Error-Prone Data,pyKT 2025,Robust KT / denoising,Decouples cognitive signal from noise.,Public KT datasets,Mixed,Varies,AUC,Improved robustness.,Per paper,pyKT page.,https://pykt.org/roubstkt,Decouples cognitive signal from noise.,Public KT datasets; Mixed; Varies,Metrics: AUC. Improved robustness.,https://pykt.org/roubstkt
LefoKT,Learning & Forgetting for Attention KT,2025.0,pyKT authors,Rethinking Learning and Forgetting for Attention KT,pyKT 2025,Forgetting-aware attention,Improves attention KT with learning/forgetting processes.,Public KT datasets,Mixed,Varies,AUC,Performance gains reported.,Per paper,pyKT news.,https://pykt.org/news/index.html/,Improves attention KT with learning/forgetting processes.,Public KT datasets; Mixed; Varies,Metrics: AUC. Performance gains reported.,https://pykt.org/news/index.html/
csKT,Cold-start KT,2025.0,pyKT authors,csKT: Addressing Cold-start in KT via Kernel Bias and Cone Attention,pyKT 2025,Cold-start (kernel bias + cone attention),Targets cold-start explicitly.,Public KT datasets,Mixed,Varies,AUC,Improved cold-start performance.,Cold-start,pyKT news.,https://pykt.org/news/index.html/,Targets cold-start explicitly.,Public KT datasets; Mixed; Varies,Metrics: AUC. Improved cold-start performance.,https://pykt.org/news/index.html/
FlucKT,Cognitive Fluctuations Attention KT,2025.0,pyKT authors,Cognitive Fluctuations Enhanced Attention Network for KT,pyKT 2025,Attention + cognitive fluctuations,Models fluctuations explicitly.,Public KT datasets,Mixed,Varies,AUC,Reported improvements.,Per paper,pyKT model.,https://pykt.org/fluckt,Models fluctuations explicitly.,Public KT datasets; Mixed; Varies,Metrics: AUC. Reported improvements.,https://pykt.org/fluckt
DIMKT,Difficulty Matching KT,2023.0,pyKT authors,DIMKT: DIfficulty Matching KT,pyKT 2023,Difficulty-aware,Aligns difficulty matching across items.,Public KT datasets,Mixed,Varies,AUC/ACC,Gains via difficulty alignment.,Per paper,pyKT tag.,https://pykt.org/tag/model/,Aligns difficulty matching across items.,Public KT datasets; Mixed; Varies,Metrics: AUC/ACC. Gains via difficulty alignment.,https://pykt.org/tag/model/
QDKT,Question-centric DKT,2020.0,Sonkar et al.,qDKT: Question-centric Deep KT,arXiv 2020,RNN + question-centric,Explicit per-question modeling within DKT.,Benchmarks,Mixed,Varies,AUC,Improves over vanilla DKT.,Per paper,Question-centric.,https://arxiv.org/abs/2005.12442,Explicit per-question modeling within DKT.,Benchmarks; Mixed; Varies,Metrics: AUC. Improves over vanilla DKT.,https://arxiv.org/abs/2005.12442
AEED-KT,Attention-Enhanced Encoder–Decoder KT,2022.0,Zhang et al.,Knowledge Tracing via Attention‑Enhanced Encoder‑Decoder,Scientific Reports 2022,Encoder–decoder attention,Seq2seq attention for behavior sequences.,Public datasets,Mixed,Varies,AUC,Improved over baselines.,Per paper,Seq2seq KT.,https://pmc.ncbi.nlm.nih.gov/articles/PMC9789901/,Seq2seq attention for behavior sequences.,Public datasets; Mixed; Varies,Metrics: AUC. Improved over baselines.,https://pmc.ncbi.nlm.nih.gov/articles/PMC9789901/
DAKTN,Deep Attentive KT,2023.0,Chen et al.,A Deep Attentive Model for Knowledge Tracing,AAAI 2023,Attention + history assimilation,Assimilates historical behaviors into attention.,AAAI datasets,Mixed,Varies,AUC,Competitive improvements.,Per paper,Attentive assimilation.,https://ojs.aaai.org/index.php/AAAI/article/view/26214/25986,Assimilates historical behaviors into attention.,AAAI datasets; Mixed; Varies,Metrics: AUC. Competitive improvements.,https://ojs.aaai.org/index.php/AAAI/article/view/26214/25986
ReKT,Revisiting KT: Simple & Powerful,2024.0,Shen et al.,Revisiting Knowledge Tracing: A Simple and Powerful Model,OpenReview 2024,Lightweight FRU,Question/concept/domain views with FRU core.,pyKT benchmarks,Mixed,Varies,AUC,Efficient SOTA/near-SOTA.,Protocol-controlled,Efficiency focus.,https://openreview.net/pdf?id=vZEgj0clDp,Question/concept/domain views with FRU core.,pyKT benchmarks; Mixed; Varies,Metrics: AUC. Efficient SOTA/near-SOTA.,https://openreview.net/pdf?id=vZEgj0clDp
SparseKT,k-Sparse Attention KT,2023.0,SIGIR authors,Towards Robust KT via k-Sparse Attention,SIGIR 2023,Sparse attention,k-sparse attention to improve robustness.,Public datasets,Mixed,Varies,AUC,Robust gains reported.,Per paper,SIGIR 2023.,https://dl.acm.org/doi/10.1145/3539618.3592073,k-sparse attention to improve robustness.,Public datasets; Mixed; Varies,Metrics: AUC. Robust gains reported.,https://dl.acm.org/doi/10.1145/3539618.3592073
LoReKT,Low-Resource KT,2024.0,Zhang et al.,Improving Low-Resource KT,arXiv 2024,Transfer learning,(Duplicate consolidated),—,—,—,—,—,—,https://arxiv.org/abs/2403.06725,,(Duplicate consolidated),Evaluated on public KT benchmarks (arXiv 2024 2024).,Metrics: —. —,
TransKT,Contrastive Cross-Course KT,2025.0,Zhang et al.,Contrastive Cross‑Course KT via Concept Graph Guided Transfer,arXiv 2025,Contrastive transfer,Transfers across courses via concept graph and contrastive learning.,Cross-course datasets,Mixed,Varies,AUC,Improved cross-course generalization.,OOD splits,Transfer & contrastive.,https://arxiv.org/abs/2505.13489,Transfers across courses via concept graph and contrastive learning.,Cross-course datasets; Mixed; Varies,Metrics: AUC. Improved cross-course generalization.,https://arxiv.org/abs/2505.13489
LLM-KT,Aligning LLMs to KT,2025.0,Wang et al.,LLM-KT: Aligning LLMs with KT using Plug-and-Play Instruction,arXiv 2025,LLM alignment for KT,Aligns general LLMs to act as tracers via instruction.,Multiple KT benchmarks,Mixed,Varies,AUC/LogLoss,Improvements vs DLKT baselines under alignment.,Per paper,Plug-and-play.,https://arxiv.org/abs/2502.02945,Aligns general LLMs to act as tracers via instruction.,Multiple KT benchmarks; Mixed; Varies,Metrics: AUC/LogLoss. Improvements vs DLKT baselines under alignment.,https://arxiv.org/abs/2502.02945
QKT,Quiz-based KT,2023.0,Shen et al.,Quiz-based Knowledge Tracing,arXiv 2023,RNN/Attention + intra-quiz gate,Captures intra-quiz influence via gating and pooling.,Quiz-structured datasets,Mixed,Varies,AUC,Improved on quiz-format data.,Per paper,Intra-quiz modeling.,https://arxiv.org/abs/2304.02413,Captures intra-quiz influence via gating and pooling.,Quiz-structured datasets; Mixed; Varies,Metrics: AUC. Improved on quiz-format data.,https://arxiv.org/abs/2304.02413
CRKT,Concept-map KT,2024.0,Park et al.,Enhancing KT with concept map,KBS 2024,Concept map + disentanglement,(Duplicate consolidated),—,—,—,—,—,—,https://www.sciencedirect.com/science/article/abs/pii/S0950705124009808,,(Duplicate consolidated),Evaluated on public KT benchmarks (KBS 2024 2024).,Metrics: —. —,
HKT,Hierarchical KT,2025.0,Li et al.,HKT: Hierarchical KT,IPM 2025,Hierarchical structures,(Duplicate consolidated),—,—,—,—,—,—,https://dl.acm.org/doi/10.1016/j.ipm.2025.104206,,(Duplicate consolidated),Evaluated on public KT benchmarks (IPM 2025 2025).,Metrics: —. —,
KT Machines,Knowledge Tracing Machines,2019.0,Vie & Kashima,KTM,AAAI 2019,Factorization Machines,(Duplicate consolidated),—,—,—,—,—,—,https://arxiv.org/abs/1904.10603,,(Duplicate consolidated),Evaluated on public KT benchmarks (AAAI 2019 2019).,Metrics: —. —,
UniKT,Unified DLKT (UniKT),2025.0,Various,Improving KT through Multi-Source Unified DLKT,Preprint 2025,Decoder-only Transformer (unified),Unified DLKT formulation with multi-source modeling.,Public benchmarks,Mixed,Varies,AUC,Promising unified results.,Per paper,ResearchGate preprint.,https://www.researchgate.net/publication/386227027_Improving_Knowledge_Tracing_through_Multi-Source_and_Specific_Modeling_Unified_DLKT_UniKT,Unified DLKT formulation with multi-source modeling.,Public benchmarks; Mixed; Varies,Metrics: AUC. Promising unified results.,https://www.researchgate.net/publication/386227027_Improving_Knowledge_Tracing_through_Multi-Source_and_Specific_Modeling_Unified_DLKT_UniKT
AAKT,Alternate Autoregressive KT,2025.0,Various,Alternate Autoregressive Knowledge Tracing,Preprint 2025,Generative/autoregressive,Alternating autoregressive decoding for KT.,Public benchmarks,Mixed,Varies,AUC,Reported gains.,Per paper,Academia preprint.,https://www.academia.edu/123401571/Alternate_Autoregressive_Knowledge_Tracing,Alternating autoregressive decoding for KT.,Public benchmarks; Mixed; Varies,Metrics: AUC. Reported gains.,https://www.academia.edu/123401571/Alternate_Autoregressive_Knowledge_Tracing
DSMKT,Dual Sequence Modeling KT,2025.0,Yin et al.,Dual Sequence Modeling for KT,Information Systems Frontiers 2025,Hybrid attention + GRU,Dual sequence encoders with online distillation.,Public KT datasets,Mixed,Varies,AUC,Competitive accuracy and stability.,Per paper,Hybrid distillation.,https://link.springer.com/article/10.1007/s41019-025-00294-x,Dual sequence encoders with online distillation.,Public KT datasets; Mixed; Varies,Metrics: AUC. Competitive accuracy and stability.,https://link.springer.com/article/10.1007/s41019-025-00294-x
DASKT,Dynamic Affect Simulation KT,2025.0,Various,DASKT: A Dynamic Affect Simulation Method for KT,IEEE TKDE 2025,Affect-aware KT,Simulates affect dynamics to aid KT.,KT datasets,Mixed,Varies,AUC,Improved KT with affect simulation.,Per paper,Affect dynamics.,https://ieeexplore.ieee.org/document/10531694,Simulates affect dynamics to aid KT.,KT datasets; Mixed; Varies,Metrics: AUC. Improved KT with affect simulation.,https://ieeexplore.ieee.org/document/10531694
iBKT,Individualized Bayesian Knowledge Tracing,2013.0,"Yudelson, Koedinger, Gordon",Individualized Bayesian Knowledge Tracing Models,AIED 2013 (LNCS),BKT (hierarchical/personalized),Extends BKT with student-specific random effects to capture heterogeneity in prior knowledge and learning rates.,"Cognitive Tutor datasets (K–12 math), multiple skills",US K–12 mathematics in intelligent tutoring systems,Multiple courses; thousands of students; hundreds of thousands of interactions,"Accuracy, cross-validated likelihood (sometimes AUC)",Consistently outperforms standard BKT on held-out students across courses (improved fit and predictive accuracy).,Student-level cross-validation,Hierarchical Bayes; student-level parameters shrink toward population.,https://citeseerx.ist.psu.edu/document?doi=9b7727f3d23b7cc4d376efd42eb2da2a66dc7e37,Student-personalized BKT via hierarchical modeling of prior and learning parameters.,Cognitive Tutor logs across multiple math courses with per-skill opportunities.,Improved generalization to unseen students vs. vanilla BKT.,https://citeseerx.ist.psu.edu/document?doi=9b7727f3d23b7cc4d376efd42eb2da2a66dc7e37
CGS-BKT,Contextual Guess-and-Slip BKT,2008.0,"Baker, Corbett, Aleven",More Accurate Student Modeling through Contextual Estimation of Slip and Guess Probabilities in BKT,ITS 2008 (LNCS),BKT (contextualized),"Replaces fixed slip/guess with machine-learned, context-dependent estimates per interaction.","Cognitive Tutor (Algebra, Geometry, Middle School Math)",K–12 mathematics tutor logs with timing and help features,Several courses; thousands of students,"Model fit, prediction error (e.g., RMSE), post-test prediction",Better in-tutor prediction than standard BKT and lower degeneracy; mixed for post-test prediction.,Cross-validation on tutor logs; external post-test validation,"Uses contextual features (latency, history) to predict slip/guess.",https://learninganalytics.upenn.edu/ryanbaker/BCA2008W.pdf,Context-sensitive slip/guess estimation plugged into BKT.,Multiple Cognitive Tutor datasets with rich interaction features.,Improved in-system predictive fit vs. BKT; post-test gains varied.,https://learninganalytics.upenn.edu/ryanbaker/BCA2008W.pdf
KT-IDEM,Item Difficulty Effect Model for KT,2011.0,"Pardos, Heffernan",KT-IDEM: Introducing Item Difficulty to the Knowledge Tracing Model,UMAP 2011,BKT (item-difficulty),Augments BKT with item-specific slip/guess to model per-item difficulty within skills.,Two Cognitive Tutor systems (math),ITS math problems with item templates/skills,Large-scale tutor logs; thousands of items,Prediction accuracy / log-likelihood improvements,Substantial performance gains over standard BKT across datasets.,Train/test splits on tutor logs,Bridges IRT difficulty with KT’s per-skill state model.,https://people.csail.mit.edu/zp/papers/UMAP2011_IDEM.pdf,Per-item difficulty via slip/guess nodes extends BKT.,Cognitive Tutor math datasets with per-item labels.,Improved predictive performance vs. BKT baselines.,https://people.csail.mit.edu/zp/papers/UMAP2011_IDEM.pdf
DKT-Forget,Deep Knowledge Tracing with Forgetting,2019.0,"Nagatani, Zhang, Sato, Chen, Chen, Ohkuma",Augmenting Knowledge Tracing by Considering Forgetting Behavior,WWW 2019,Deep sequential (RNN),Extends DKT by incorporating time lag and practice count features to model forgetting.,"ASSISTments, EdNet/other large KT logs",K–12 math (online practice),Hundreds of thousands to millions of interactions,"AUC-ROC, ACC",Outperforms DKT and other baselines on multiple datasets (AUC improvements reported).,Standard random/student-wise splits by dataset,Implements multiple forgetting signals as inputs to RNN.,https://dl.acm.org/doi/10.1145/3308558.3313565,"RNN-based KT with explicit forgetting features (time gaps, prior attempts).",Large-scale clickstream/tutor logs in math.,Consistent AUC gains over vanilla DKT.,https://dl.acm.org/doi/10.1145/3308558.3313565
EERNNM,Exercise-Enhanced RNN (Markov),2019.0,"Liu, Huang, Yin, Chen, Xiong, Su, Hu",EKT/EERNN: Exercise-Enhanced Sequential Modeling for Student Performance Prediction (Markov variant),TKDE 2021 (extended from 2019 preprint),Deep sequential (RNN + content),Encodes exercise content with BiLSTM and updates a student state with a Markov assumption.,"Large-scale online education datasets (e.g., K–12 math, language)",Exercises with text/content and concept tags,Millions of interactions across thousands of students,"AUC-ROC, ACC, NLL","Beats BKT, PFA, and DKT baselines; Markov variant competitive.",Temporal splits with held-out students/items,Part of EERNN/EKT framework using content and concepts.,https://bigdata.ustc.edu.cn/paper_pdf/2019/Qi-Liu-TKDE.pdf,Content-aware RNN with Markov prediction head.,Content-rich logs with question text and concept labels.,Improved predictive metrics vs. classic KT.,https://bigdata.ustc.edu.cn/paper_pdf/2019/Qi-Liu-TKDE.pdf
EERNNA,Exercise-Enhanced RNN (Attention),2019.0,"Liu, Huang, Yin, Chen, Xiong, Su, Hu",EKT/EERNN: Exercise-Enhanced Sequential Modeling for Student Performance Prediction (Attention variant),TKDE 2021 (extended from 2019 preprint),Deep sequential (RNN + attention + content),Uses attention over exercise content to form predictions from a student state vector.,Same as EERNNM,Content-informed KT with text embeddings,Millions of interactions,"AUC-ROC, ACC",Slightly improves over EERNNM and DKT on several datasets.,Temporal splits; student-wise CV,Part of EERNN/EKT suite; more interpretable via content attention.,https://bigdata.ustc.edu.cn/paper_pdf/2019/Qi-Liu-TKDE.pdf,Attention head on content-aware RNN for KT.,Textual exercises plus concept tags.,Edge over Markov variant and DKT in AUC.,https://bigdata.ustc.edu.cn/paper_pdf/2019/Qi-Liu-TKDE.pdf
EKT,Exercise-aware Knowledge Tracing,2019.0,"Liu, Huang, Yin, Chen, Xiong, Su, Hu",EKT: Exercise-Aware Knowledge Tracing for Student Performance Prediction,TKDE 2021 (journal),Deep sequential (Memory/Matrix w/ content),Extends EERNN by tracking a concept-by-student knowledge matrix updated via a memory network.,Large-scale K–12 logs with question text & concepts,Math & other subjects with content features,Millions of interactions across thousands of students,"AUC-ROC, ACC","Improves over DKT, DKVMN, and EERNN variants on multiple datasets.",Temporal splits; student-wise in some settings,Provides interpretable per-concept mastery estimates.,https://bigdata.ustc.edu.cn/paper_pdf/2019/Qi-Liu-TKDE.pdf,Memory-style concept matrix updated by content-aware signals.,Content-rich exercises with concept mapping.,State-of-the-art among content-aware KT at time of publication.,https://bigdata.ustc.edu.cn/paper_pdf/2019/Qi-Liu-TKDE.pdf
GKT,Graph-based Knowledge Tracing,2019.0,"Nakagawa, Imai, Nishimura, Yano",Graph-based Knowledge Tracing: Modeling Student Proficiency using Graph Neural Networks,ICLR 2019 (workshop)/WebConf 2021,Graph-based KT,Models dependencies among concepts/exercises via GNNs to update student knowledge states.,"Assistments, EdNet/others",K–12 online practice with concept relations,Large logs (hundreds of thousands interactions),AUC-ROC,Outperforms DKT variants on datasets with strong concept structure.,Standard KT splits,Leverages exercise/skill graphs for propagation.,https://openreview.net/forum?id=H1gqODRcKX,GNN-based propagation over concept/exercise graphs for KT.,Datasets with explicit concept links benefit most.,AUC gains vs. sequence-only baselines.,https://openreview.net/forum?id=H1gqODRcKX
RKT,Relation-aware Self-Attention KT,2020.0,"Pandey, Karypis",RKT: Relation-Aware Self-Attention for Knowledge Tracing,arXiv 2020,Transformer (relation-aware),Enhances self-attention with relation embeddings capturing skill-skill and item-skill relations.,"ASSISTments 2009/2015/2017, Statics2011",K–12 math and engineering statics,Tens to hundreds of thousands of interactions,"AUC-ROC, ACC",Improves upon SAKT/DKT baselines on multiple datasets.,Random/student-wise splits per dataset,Explicitly models relations as biases in attention scores.,https://arxiv.org/abs/2008.09757,Relation-augmented transformer for student performance prediction.,Standard KT benchmarks with concept mappings.,Consistent AUC improvements over prior attention-based KT.,https://arxiv.org/abs/2008.09757
CAKT,Convolution Attentive Knowledge Tracing,2024.0,"Dai, Yin, Li, et al.",Convolution Attentive Knowledge Tracing with Comprehensive Behavioral Features,WWW 2024 (companion/short),CNN + Attention KT,"Combines temporal convolutions with attention and rich behavioral features (time, attempts, hints).",Public KT datasets (ASSISTments etc.),K–12 online practice with behavioral logs,Hundreds of thousands of interactions,AUC-ROC,Outperforms SAKT/AKT baselines when behavior signals are informative.,Standard KT evaluation,Adds engineered features to deep KT.,https://arxiv.org/abs/2405.04647,Hybrid conv-attention KT leveraging behavioral covariates.,Benchmarks with time/attempt metadata.,SOTA or near-SOTA AUC on several benchmarks.,https://arxiv.org/abs/2405.04647
Deep-IRT,Deep Item Response Theory for KT,2019.0,"Yeung, Yeung",Deep-IRT: Make Deep Learning based Knowledge Tracing Explainable using Item Response Theory,ArXiv 2019,Deep + IRT hybrid,Integrates IRT parameters with deep sequence encoders to produce interpretable ability and difficulty estimates.,"ASSISTments, Junyi/others",K–12 practice datasets with item/skill metadata,Large student-item interaction logs,"AUC-ROC, NLL",Competitive with DKT/DKVMN while yielding explainable parameters.,Standard splits per dataset,Bridges psychometrics and DLKT.,https://arxiv.org/abs/1904.11738,Explainable KT via deep encoder + IRT latent variables.,Multiple KT benchmarks with item metadata.,Similar or better AUC than DKT with interpretability.,https://arxiv.org/abs/1904.11738
SSAKT,Sequential Self-Attentive Knowledge Tracing,2021.0,"Zhang, Zhang, Lin, Yang",Sequential Self-Attentive Model for Knowledge Tracing,ICANN 2021,Transformer (self-attention),Applies multi-head self-attention over interaction sequences to model long-range dependencies.,"ASSISTments, STATICS, etc.",K–12 math and engineering,Standard KT benchmarks,AUC-ROC,Outperforms DKT/SAKT on several datasets.,Random/student-wise splits,Refines attention patterns and positional encoding.,https://link.springer.com/chapter/10.1007/978-3-030-86383-8_26,Enhanced transformer-style KT with improved attention.,Public KT datasets with concept tags.,AUC gains vs. earlier attention KT.,https://link.springer.com/chapter/10.1007/978-3-030-86383-8_26
SFBKT,Synthesized Forgetting Behavior KT,2023.0,"Hshemi, Al Sadhan, et al.",Knowledge Tracing Model with Learning and Forgetting Behavior,Information (MDPI) 2023,BKT/DL hybrid,Incorporates explicit learning and forgetting dynamics to better reflect memory processes in KT.,Standard KT datasets,Online learning logs with time gaps,Not specified; typical benchmarks,"AUC-ROC, ACC",Reported improvements over DKT on benchmarks.,Standard splits,Explicit decay mechanisms.,https://www.semanticscholar.org/paper/Knowledge-Tracing-Model-with-Learning-and-Behavior-Chen-Guan/7630e83f071eaf13c5b08f26dca5e971404f9db4,Adds forgetting dynamics to KT predictions.,Benchmarks with temporal signals.,Better AUC/ACC than DKT in experiments.,https://www.semanticscholar.org/paper/Knowledge-Tracing-Model-with-Learning-and-Behavior-Chen-Guan/7630e83f071eaf13c5b08f26dca5e971404f9db4
TBKT,Time-aware Bayesian Knowledge Tracing,2011.0,"Qiu, Pardos, Heffernan",Does Time Matter? Modeling the Effect of Time with Bayesian Knowledge Tracing,EDM 2011,BKT (time-aware),Augments BKT with time-dependent forgetting/leakage between practice opportunities.,ASSISTments 2009-2010,K–12 math with timestamps,Tens of thousands of interactions,AUC-ROC / prediction error,Time-aware variants (KT-Forget/KT-Slip) improve predictive accuracy over standard BKT.,Cross-validation on student streams,Introduces decay as function of time gaps.,https://files.eric.ed.gov/fulltext/ED537187.pdf,Adds temporal decay to classical BKT.,ASSISTments math logs with timestamps.,Higher predictive accuracy than vanilla BKT.,https://files.eric.ed.gov/fulltext/ED537187.pdf
PC-BKT,Personalized Clustered BKT (with Forgetting),2015.0,"Nedungadi, Remya","Incorporating forgetting in the personalized, clustered, Bayesian knowledge tracing (PC-BKT) model",CCIP 2015,BKT (personalized+forgetting),Clusters students and personalizes BKT parameters while modeling forgetting effects.,Educational datasets (varied),Adaptive practice with student clusters,Not specified,Prediction accuracy,Improves over base BKT on prediction tasks.,Held-out evaluation,"Combines clustering, personalization, and decay.",https://arxiv.org/pdf/2105.15106.pdf,"Clustered, personalized BKT with decay component.",KT datasets with heterogeneous learners.,Better fit/accuracy than BKT in reported studies.,https://arxiv.org/pdf/2105.15106.pdf
FDKT,Federated Deep Knowledge Tracing,2021.0,"Wu, et al.",Federated Deep Knowledge Tracing,WWW 2021,Federated DLKT,Trains DKT collaboratively across silos using federated learning to improve privacy and generalization.,Multiple institutional datasets (simulated federation),Cross-silo KT with privacy constraints,Varies; large interaction logs partitioned by client,"AUC-ROC, ACC",Comparable or better AUC vs. centralized baselines under realistic federation.,Client-wise splits; federated rounds,Addresses data silos and privacy.,https://dl.acm.org/doi/10.1145/3437963.3441747,Federated optimization of DKT across institutions.,Partitioned KT datasets spanning multiple clients.,Maintains/boosts AUC without centralizing data.,https://dl.acm.org/doi/10.1145/3437963.3441747
DHKTr,Deep Hierarchical Knowledge Tracing,2019.0,"Wang, Ma, Gao",Deep Hierarchical Knowledge Tracing,EDM 2019,Hierarchical deep KT,Learns hierarchical abstractions over interactions to capture multi-level learning patterns.,"KT benchmarks (e.g., ASSISTments)",K–12 and higher-ed datasets,Standard benchmark scales,AUC-ROC,Improves over DKT on several datasets.,Standard KT splits,Introduces hierarchy to model different temporal scales.,https://par.nsf.gov/servlets/purl/10157350,Hierarchical deep architecture for KT.,Public KT datasets with concept labels.,AUC gains vs. DKT.,https://par.nsf.gov/servlets/purl/10157350
Code-DKT,Code-based Deep Knowledge Tracing,2022.0,"Shi, Abdi, Iqbal, Price",A Code-based Knowledge Tracing Model for Programming Education,EDM 2022,Domain-specific DLKT,"In programming courses, attends over code features to model knowledge state and predict correctness.",Intro programming class (code submissions),"University-level CS; ~50 students, 5 assignments",~50 students; multiple code attempts,"AUC-ROC, ACC",Outperforms BKT and DKT on the code dataset.,Student-wise evaluation,Domain-specific feature encoder for source code.,https://educationaldatamining.org/edm2022/proceedings/2022.EDM-long-papers.5/index.html,Attention over code-derived features for KT.,Small-scale CS education dataset with rich code artifacts.,Beats general KT baselines on programming tasks.,https://educationaldatamining.org/edm2022/proceedings/2022.EDM-long-papers.5/index.html
CoKT,Collaborative Knowledge Tracing,2022.0,"Long, et al.",Improving Knowledge Tracing with Collaborative Information,WSDM 2022,Graph/Collaborative KT,Incorporates inter-student collaborative signals to improve knowledge state estimation.,Online education datasets with social signals,K–12/online learning with collaboration,Large-scale logs,AUC-ROC,Outperforms sequence-only KT baselines.,Standard KT splits,Leverages collaborative filtering signals.,https://dl.acm.org/doi/10.1145/3488560.3498374,Collaborative information augments KT sequence modeling.,Datasets including co-learning/collaboration traces.,AUC improvements over DKT/SAKT.,https://dl.acm.org/doi/10.1145/3488560.3498374
LBKT,Learning Behavior‑oriented Knowledge Tracing,2023.0,"Xu, Huang, Liu, Shen, Liu, Wu, Wang",Learning Behavior‑oriented Knowledge Tracing,KDD 2023,Behavior‑aware DLKT,"Models the effects of behaviors (speed, attempts, hints) on learning and forgetting to update knowledge state.",Multiple KT benchmarks (ASSISTments etc.),K–12 online practice with behavior logs,Large-scale logs,AUC-ROC,Outperforms strong baselines including AKT/SAKT on several datasets.,Standard KT evaluation,Explicit behavior channels integrated into KT.,https://staff.ustc.edu.cn/~huangzhy/files/papers/BihanXu-KDD2023.pdf,Behavior channels drive learning/forgetting updates.,Benchmarks with speed/attempt/hint features.,SOTA/near-SOTA AUC on multiple datasets.,https://staff.ustc.edu.cn/~huangzhy/files/papers/BihanXu-KDD2023.pdf
MSCAKT,Multi‑Scale Convolutional Attention KT,2025.0,"Dai, Bai, Guo, et al.",MSCAKT: Knowledge Tracing with Multiscale State Representation,The Web Conference 2024/2025 (preprint),CNN + Attention KT,Builds multi‑scale temporal representations with convolutions to enrich attention-based KT.,ASSISTments/EdNet benchmarks,K–12 practice logs,Large-scale interactions,AUC-ROC,Reported SOTA/near-SOTA on several datasets in paper.,Standard KT splits,Addresses multi‑granularity temporal patterns.,https://arxiv.org/abs/2501.14256,Multi‑scale conv features feed attention KT.,Standard KT datasets with long sequences.,Improved AUC over prior conv/attention KT.,https://arxiv.org/abs/2501.14256
MBKT,Markov Blanket Knowledge Tracing,2024.0,"Jiang, Bahreini, Chagnon-Lessard, Fakse, et al.",Markov Blankets for Interpretable Deep Knowledge Tracing,ArXiv 2024,Interpretable DLKT,Wraps deep KT with Markov blanket analysis to identify the minimal sufficient factors influencing predictions.,"Public KT datasets (ASSISTments, Junyi, etc.)",K–12/online learning,Standard KT benchmarks,AUC-ROC with interpretability metrics,Maintains competitive AUC while improving interpretability.,Standard KT evaluation,Focus on interpretability of DLKT via graphical analysis.,https://arxiv.org/abs/2401.14555,Post-hoc graphical analysis for DLKT explanations.,Multiple KT benchmarks; interpretability case studies.,Comparable AUC to black-box baselines with added explanations.,https://arxiv.org/abs/2401.14555
