Correcting Sense Annotations Using Wordnets and Translations

Published: 01 Jan 2023, Last Modified: 09 Jul 2024GWC 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Acquiring large amounts of high-quality annotated data is an open issue in word sense disambiguation. This problem has become more critical recently with the advent of supervised models based on neural networks, which require large amounts of annotated data. We propose two algorithms for making selective corrections on a sense-annotated parallel corpus, based on cross-lingual synset mappings. We show that, when applied to bilingual parallel corpora, these algorithms can rectify noisy sense annotations, and thereby produce multilingual sense-annotated data of improved quality.
Loading