Gender and Ethnicity Bias Mitigation in Automatic Recruitment

Angela Petkova; Farnaz Faramarzi Lighvan; Lynn Houthuys

Gender and Ethnicity Bias Mitigation in Automatic Recruitment

Angela Petkova, Farnaz Faramarzi Lighvan, Lynn Houthuys

Published: 15 Oct 2025, Last Modified: 31 Oct 2025BNAIC/BeNeLearn 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Type D (Master/Bachelor Thesis Abstracts)

Keywords: Machine Learning, Fairness in AI, Automatic Recruitment, Natural Language Processing, Deep Learning, Human-centered AI, AI Applications in Industry

Abstract: Recent advances in machine learning have led to decision-making algorithms being used in recruitment to help managers identify candidates. An example is Amazon’s Recruitment tool, an AI-driven hiring system that was trained on resumes submitted to the company over 10 years. Without explicit gender labels, it learned to associate words like “women’s chess club captain” or women’s colleges with female candidates. Because the majority of the training data came from men who previously applied for jobs at Amazon, the algorithm learned to prefer male candidates, leading to discriminatory hiring decisions. This shows that machine learning models can detect and exploit sensitive attributes, such as gender or ethnicity, by identifying patterns in unstructured resume data, even when those attributes are not explicitly provided. This bachelor thesis focuses on creating fairer hiring outcomes by debiasing the unstructured textual component of a resume: the biography. This component has not been previously addressed in isolation and represents a complementary modality to the visual debiasing approach introduced by Peña et al. (2020). The biography was debiased for the sensitive gender attribute. The approach combined two techniques: scrubbing and round-trip translation. Scrubbing removes or replaces explicit gender- or ethnicity-related indicators. Explicit gender indicators included first names, gendered titles, -pronouns, -occupational terms, and familial terms. Gendered occupational terms and familial terms were replaced with gender-neutral alternatives rather than being removed, as they carry relational or professional information. Round-trip translation was applied to remove implicit gender indicators by translating the text into the pivot language, French and then back to the original using the Google Translate API, thereby stripping away stylistic features while preserving the original sentence’s meaning. To evaluate this debiasing approach, we used Peña et al. (2020)'s framework on the FairCVtest dataset, which contains 24,000 resumes with 12 structured features and a biography section. Each candidate has a Target Score from 0 to 1, indicating predicted job suitability. Two types of scores are provided: Unbiased and Gender-biased. The biased scores simulate real-world scenarios where certain demographic groups are disadvantaged. We create a biased regression model, using original biographies, and an agnostic regression model, using debiased biographies (scrubbing + round-trip translation). Both models will be trained on biased scores, along with the structured features (excluding the sensitive feature of gender). In all cases, the model is used to rank candidates by predicting a score for each candidate. The top 100 are selected, and fairness of this ranking is analysed. The goal is not to accurately identify the top 100 most qualified candidates, because the target scores the model is trained on are intentionally biased. The first version, using raw biography embeddings, evaluates whether the model can uncover sensitive attributes purely from unstructured data, without access to explicit gender or ethnicity attributes. In contrast, the agnostic version is specifically made to assess whether applying the bias mitigation techniques can prevent the model from reproducing biased outcomes, even when trained on biased scores. Models used early fusion with feedforward MLPs and BERT-encoded biographies. The removal of sensitive information using the debiasing techniques improved model fairness. The agnostic model achieved a perfectly balanced 50–50 male-to-female ratio in the top-10 selection, better representation in the top-100 selection, and a top-10 ranking more closely aligned with the true Unbiased top-10 (higher NDCG score). MAE calculated against the Unbiased scores and KL-divergence also improved (see Table 1). Based on these findings, it can be concluded that scrubbing and using round-trip translation effectively helps correct the gender bias in the FairCVtest dataset. **Table 1: Performance and fairness results between the biased and agnostic model** | Metric | Biased model | Agnostic model | |---------------------------------------|--------------|----------------| | MAE (Gender-biased) | 0.018 | 0.043 | | MAE (Unbiased) | 0.050 | 0.049 | | KL (female score distribution to male)| 0.237 | 0.015 | | % Female in top 100 | 23% | 44% | | % Female in top 10 | 10% | 50% | | NDCG (against Unbiased top 10) | 0.977 | 0.995 | Finally, the same debiasing pipeline was also applied with ethnicity as the sensitive attribute. However, since the biographies contain limited ethnicity-related information, scrubbing and back-translation did not significantly reduce ethnic bias in the FairCVtest dataset.

Serve As Reviewer: ~Angela_Petkova2

Submission Number: 11

Loading