Towards Equitable Coreset Selection: Addressing Challenges Under Class Imbalance

Liyana Sahir Kallooriyakath, Anugu Namratha Reddy, B Srinath Achary, Ashutosh Sharma, Krisha Shah, Sonia Gupta, Siddhartha Asthana

Published: 10 Nov 2025, Last Modified: 19 Nov 2025CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: Coreset selection reduces training cost by constructing compact, representative subsets, but existing methods largely assume balanced class distributions. Under imbalance, this assumption yields biased subsets that discard critical minority samples and degrade accuracy. We propose Equitable Coreset Selection (ECS), a framework tailored for imbalanced data. ECS mitigates these issues through adaptive pruning that preserves minority examples, class-sensitive partitioning aligned with skewed class distributions, and stratified graph-cut selection for diverse sampling. Experiments across multiple imbalanced datasets show that ECS improves generalization and substantially boosts minority-class accuracy compared to standard coreset methods.
Loading