Directional Influence Function: Estimating Training Data Influence in Constrained Learning

Xin Wang; R.Tyrrell Rockafellar; Xuegang Ban

Directional Influence Function: Estimating Training Data Influence in Constrained Learning

Xin Wang, R.Tyrrell Rockafellar, Xuegang Ban

15 Sept 2025 (modified: 11 Feb 2026)Submitted to ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Directional Influence Function, Constrained Learning, Deep Learning, Sensitivity analysis, Variational inequality

TL;DR: directional influence function

Abstract: Constrained learning has been increasingly applied to various domains to ensure explicit feasibility requirements due to fairness, safety, robustness, regularization, and physics or logic constraints. Understanding how training samples influence the solution (e.g., learned parameters) of constrained learning is crucial for interpretability and robustness. The classical influence function (IF) may becomes unreliable in constrained settings: data perturbations can reshape both the objective and the feasible region, leading to estimates that violate feasibility. In response, we propose the Directional Influence Function (DIF), a new estimator that explicitly incorporates the constraints into influence estimation. DIF formulates the optimality conditions of constrained learning as a variational inequality (VI) and analyzes how perturbing training data affects this VI. We validate DIF in constrained linear regression and demonstrate that it recovers leave-one-out retraining results, whereas IF and penalty-based IF exhibit significant bias. We further apply DIF to fairness-constrained CNNs, where DIF accurately predicts test loss changes under data removal and aligns closely with actual retraining. Our results establish DIF as an efficient and reliable tool for data attribution in constrained learning.

Primary Area: learning theory

Submission Number: 6295

Loading