GrammaMT: Improving Machine Translation with Grammar-Informed In-Context Learning

ACL ARR 2025 February Submission332 Authors

06 Feb 2025 (modified: 09 May 2025)ACL ARR 2025 February SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Abstract: We introduce GrammaMT, a grammatically-aware prompting approach for machine translation that uses Interlinear Glossed Text (IGT), a common form of linguistic description providing morphological and lexical annotations for source sentences.GrammaMT proposes three prompting strategies: gloss-shot, chain-gloss and model-gloss. All are training-free, requiring only a few examples that involve minimal effort to collect, and making them well-suited for low-resource setups. Experiments show that GrammaMT enhances translation performance on open-source instruction-tuned LLMs for various low- to high-resource languages across three benchmarks: (1) the largest IGT corpus, (2) the challenging 2023 SIGMORPHON Shared Task data over endangered languages, and (3) even in an out-of-domain setting with FLORES. Moreover, ablation studies reveal that leveraging gloss resources could substantially boost MT performance (by over 17 BLEU points) if LLMs accurately generate or access input sentence glosses.
Paper Type: Long
Research Area: Machine Translation
Research Area Keywords: Machine Translation, Large Language Models, In-context Learning, Prompting
Contribution Types: Approaches to low-resource settings
Languages Studied: Gitksan, Lezgi, Natugu, Tsez, Swahili, Yoruba, Icelandic, Marathi, Kannada, Urdu, Thai, Greek, Portuguese, Japanese, Russian, Arabic, English
Submission Number: 332
Loading