Findings of the WMT 2023 Shared Task on Low-Resource Indic Language Translation

Published: 01 Jan 2023, Last Modified: 15 Jun 2024WMT 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper presents the results of the low-resource Indic language translation task organized alongside the Eighth Conference on Machine Translation (WMT) 2023. In this task, participants were asked to build machine translation systems for any of four language pairs, namely, English-Assamese, English-Mizo, English-Khasi, and English-Manipuri. For this task, the IndicNE-Corp1.0 dataset is released, which consists of parallel and monolingual corpora for northeastern Indic languages such as Assamese, Mizo, Khasi, and Manipuri. The evaluation will be carried out using automatic evaluation metrics (BLEU, TER, RIBES, COMET, ChrF) and human evaluation.
Loading