When unlearning is free: leveraging low influence points to reduce computational costs

When unlearning is free: leveraging low influence points to reduce computational costs

TMLR Paper6758 Authors

02 Dec 2025 (modified: 24 Feb 2026)Under review for TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: As concerns around data privacy in machine learning grow, the ability to unlearn-or remove- specific data points from trained models becomes increasingly important. While state-of-the-art unlearning methods have emerged in response, they typically treat all points in the forget set equally. In this work, we challenge this approach by asking: do points that have a negligible impact on the model's learning need to be removed? Through a comparative analysis of influence functions across language and vision tasks, we identify subsets of training data with negligible impact on model outputs. Leveraging this insight, we propose an efficient unlearning framework that reduces the size of datasets before unlearning—leading to significant computational savings (up to ~50%) on real-world empirical examples.

Submission Type: Regular submission (no more than 12 pages of main content)

Assigned Action Editor: ~Antti_Honkela1

Submission Number: 6758

Loading