HkAmsters at CMCL 2022 Shared Task: Predicting Eye-Tracking Data from a Gradient Boosting Framework with Linguistic FeaturesDownload PDF

Published: 28 Mar 2022, Last Modified: 23 May 2023CMCL Shared TaskReaders: Everyone
Keywords: eye-tracking, computational model, surprisal, psycholinguistics
TL;DR: Regression model with linguistic features and surprisal
Abstract: Eye movement data are used in psycholinguistic studies to infer information regarding cognitive processes during reading. In this paper, we describe our proposed method for the Shared Task of Cognitive Modeling and Computational Linguistics (CMCL) 2022 - Subtask 1, which involves data from multiple datasets on 6 languages. We compared different regression models using features of the target word and its previous word, and target word surprisal as regression features. Our final system, using a gradient boosting regressor, achieved the lowest mean absolute error (MAE), resulting in the best system of the competition.
3 Replies

Loading