Improving Automatic Grammatical Error Annotation for Chinese Through Linguistically-Informed Error Typology

Published: 01 Jan 2025, Last Modified: 06 Feb 2025COLING 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Comprehensive error annotation is essential for developing effective Grammatical Error Correction (GEC) systems and delivering meaningful feedback to learners. This paper introduces improvements to automatic grammatical error annotation for Chinese. Our refined framework addresses language-specific challenges that cause common spelling errors in Chinese, including pronunciation similarity, visual shape similarity, specialized participles, and word ordering. In a case study, we demonstrated our system’s ability to provide detailed feedback on 12-16% of all errors by identifying them under our new error typology, specific enough to uncover subtle differences in error patterns between L1 and L2 writings. In addition to improving automated feedback for writers, this work also highlights the value of incorporating language-specific features in NLP systems.
Loading