Sequential Subspace Noise Injection Prevents Accuracy Collapse in Certified Unlearning

Dolgova Polina; Sebastian U Stich

Sequential Subspace Noise Injection Prevents Accuracy Collapse in Certified Unlearning

Dolgova Polina, Sebastian U Stich

19 Sept 2025 (modified: 11 Feb 2026)Submitted to ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0

Keywords: machine unlearning, certified unlearning, privacy amplification by iteration

Abstract: Certified unlearning based on differential privacy offers strong guarantees but remains largely impractical: the noisy fine-tuning approaches proposed so far achieve these guarantees but severely reduce model accuracy. We propose sequential noise scheduling, which distributes the noise budget across orthogonal subspaces of the parameter space instead of injecting it all at once. This simple modification mitigates the destructive effect of noise while preserving the original certification guarantees. We extend the analysis of noisy fine-tuning to the subspace setting, proving that the same $(\varepsilon,\delta)$ privacy budget is retained. Empirical results on image classification benchmarks show that our approach substantially improves accuracy after unlearning while remaining robust to membership inference attacks. These results show that certified unlearning can achieve both rigorous guarantees and practical utility.

Primary Area: alignment, fairness, safety, privacy, and societal considerations

Submission Number: 19798

Loading