How does overparametrization affect performance on minority groups?

TMLR Paper4555 Authors

25 Mar 2025 (modified: 28 Mar 2025)Under review for TMLREveryoneRevisionsBibTeXCC BY 4.0
Abstract: The benefits of overparameterization for the overall performance of modern machine learning (ML) models are well known. However, the effect of overparameterization at a more granular level of data subgroups is less understood. Recent empirical studies demonstrate encouraging results: (i) when groups are not known, overparameterized models trained with empirical risk minimization (ERM) perform better on minority groups; (ii) when groups are known, ERM on data subsampled to equalize group sizes yields state-of-the-art worst-group accuracy in the overparameterized regime. In this paper, we complement these empirical studies with a theoretical investigation of the risk of overparameterized random feature models on minority groups. In a setting in which the regression functions for the majority and minority groups are different, we show that overparameterization always improves minority group performance.
Submission Length: Regular submission (no more than 12 pages of main content)
Assigned Action Editor: ~Ahmad_Beirami1
Submission Number: 4555
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview