Word Sense Disambiguation for Ancient Greek: Sourcing a training corpus through translation alignment

Published: 01 Jan 2023, Last Modified: 29 May 2024ALP@RANLP 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper seeks to leverage translations of Ancient Greek texts to enhance the performance of automatic word sense disambiguation (WSD). Satisfactory WSD in Ancient Greek is achievable, provided that the system can rely on annotated data. This study, acknowledging the challenges of manually assigning meanings to every Greek lemma, explores the strategies to derive WSD data from parallel texts using sentence and word alignment. Our results suggest that, assuming the condition of high word frequency is met, this technique permits us to automatically produce a significant volume of annotated data, although there are still significant obstacles when trying to automate this process.
Loading