FairFace: A Novel Face Attribute Dataset for Bias Measurement and Mitigation

Kimmo Kärkkäinen; Jungseock Joo

FairFace: A Novel Face Attribute Dataset for Bias Measurement and Mitigation

Kimmo Kärkkäinen, Jungseock Joo

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Withdrawn SubmissionReaders: Everyone

Keywords: dataset bias, face attribute recognition, bias measurement

TL;DR: A new face image dataset for balanced race, gender, and age which can be used for bias measurement and mitigation

Abstract: Existing public face image datasets are strongly biased toward Caucasian faces, and other races (e.g., Latino) are significantly underrepresented. The models trained from such datasets suffer from inconsistent classification accuracy, which limits the applicability of face analytic systems to non-White race groups. To mitigate the race bias problem in these datasets, we constructed a novel face image dataset containing 108,501 images which is balanced on race. We define 7 race groups: White, Black, Indian, East Asian, Southeast Asian, Middle Eastern, and Latino. Images were collected from the YFCC-100M Flickr dataset and labeled with race, gender, and age groups. Evaluations were performed on existing face attribute datasets as well as novel image datasets to measure the generalization performance. We find that the model trained from our dataset is substantially more accurate on novel datasets and the accuracy is consistent across race and gender groups. We also compare several commercial computer vision APIs and report their balanced accuracy across gender, race, and age groups.

Original Pdf: pdf

7 Replies

Loading