Dependency, Structure & Memory: A Reading Time Benchmark for Sentence Processing Model Evaluation
Keywords: sentence processing, working memory, benchmark, dataset, model evaluation
TL;DR: A new reading-time dataset systematically varies syntactic complexity and dependency length, includes working memory scores for 540 participants, and offers a controlled, cognitively grounded benchmark for syntax-sensitive language model evaluation.
Submission Number: 44
Loading