Evaluating Gender Bias in Machine Translation for Low-Resource Languages

Evaluating Gender Bias in Machine Translation for Low-Resource Languages

ICLR 2024 Workshop AfricaNLP Submission38 Authors

Published: 03 Mar 2024, Last Modified: 29 Sept 2024AfricaNLP 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Machine Translation, Low- Resource Languages, Gender Bias, Amharic, Afan Oromo, Tigrigna

Abstract: While Machine Translation (MT) research has progressed over the years, translation systems still suffer from exhibiting biases, including gender bias. While an active line of research studies the existence and mitigation strategies of gender bias in machine translation systems, there is limited research exploring this phenomenon for low-resource languages. The limited availability of linguistic and computational resources confounded with the lack of benchmark datasets makes studying bias for low-resourced languages that much more difficult. In this paper, we construct benchmark datasets for evaluating gender bias in machine translation for three low-resourced languages: Afan Oromo (orm), Amharic (amh), and Tigrinya (tig). Building on prior work, we collected 2400 gender-balanced sentences parallelly translated into the three languages. From our human evaluations on the dataset we collected, we found that about 93% of Afan Oromo, 80% of Tigrigna, and 72\% of Amharic sentences exhibited gender bias. In addition to providing benchmarks for improving gender bias mitigation research in the three languages, we hope the careful documentation of our work will help other low-resourced language researchers extend our approach to their languages.

Submission Number: 38

Loading