You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings

Zeerak Talat; Aurélie Névéol; Stella Biderman; Miruna Clinciu; Manan Dey; Shayne Longpre; Sasha Luccioni; Maraim Masoud; Margaret Mitchell; Dragomir Radev; Shanya Sharma; Arjun Subramonian; Jaesung Tae; Samson Tan; Deepak Tunuguntla; Oskar van der Wal

You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings

Zeerak Talat, Aurélie Névéol, Stella Biderman, Miruna Clinciu, Manan Dey, Shayne Longpre, Sasha Luccioni, Maraim Masoud, Margaret Mitchell, Dragomir Radev, Shanya Sharma, Arjun Subramonian, Jaesung Tae, Samson Tan, Deepak Tunuguntla, Oskar van der Wal

Published: 09 Apr 2022, Last Modified: 05 May 2023BigScience#5Readers: Everyone

Keywords: Evaluation, bias in machine learning, fairness, large language models, multilingual NLP

Abstract: Evaluating bias, fairness, and social impact in monolingual language models is a difficult task. This challenge is further compounded when language modeling occurs in a multilingual context. Considering the implication of evaluation biases for large multilingual language models, we situate the discussion of bias evaluation within a wider context of social scientific research with computational work. We highlight three dimensions of developing multilingual bias evaluation frameworks: (1) increasing transparency through documentation, (2) expanding targets of bias beyond gender, and (3) addressing cultural differences that exist between languages. We further discuss the power dynamics and consequences of training large language models and recommend that researchers remain cognizant of the ramifications of developing such technologies.

1 Reply

Loading