FaAlGrad: Fairness through Alignment of Gradients across Different Subpopulations

Nikita Malik; Konda Reddy Mopuri

FaAlGrad: Fairness through Alignment of Gradients across Different Subpopulations

Nikita Malik, Konda Reddy Mopuri

Published: 26 Feb 2025, Last Modified: 26 Feb 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: The growing deployment of Machine Learning systems has increased interest in systems optimized for other important criteria along with the expected task performance. For instance, machine learning models often exhibit biases that lead to unfair outcomes for certain protected subpopulations. This work aims to handle the bias in machine learning models and enhance their fairness by aligning the loss gradients. Specifically, leveraging the meta-learning technique, we propose a novel training framework that aligns the gradients computed across different subpopulations for learning fair classifiers. Aligning the gradients enables our framework to regularize the training process, thereby prioritizing fairness over predictive accuracy. Our experiments on multiple benchmark datasets demonstrate significant improvements in fairness metrics without having any exclusive regularizers for fairness. Thus our work contributes to developing fairer machine learning models with broader societal benefits.

Submission Length: Long submission (more than 12 pages of main content)

Video: https://www.youtube.com/watch?v=YFx0svBR4Lw

Code: https://github.com/NikitaMalik2303/FaAlGrad

Assigned Action Editor: ~Cedric_Archambeau1

Submission Number: 2957

Loading