Evaluating Robustness to Dataset Shift via Parametric Robustness Sets

Nikolaj Thams; Michael Oberst; David Sontag

Evaluating Robustness to Dataset Shift via Parametric Robustness Sets

Nikolaj Thams, Michael Oberst, David Sontag

Published: 31 Oct 2022, Last Modified: 06 Apr 2025NeurIPS 2022 AcceptReaders: Everyone

Keywords: Causality, Robustness, Distributional Robustness, Distribution Shift, Dataset Shift

Abstract: We give a method for proactively identifying small, plausible shifts in distribution which lead to large differences in model performance. These shifts are defined via parametric changes in the causal mechanisms of observed variables, where constraints on parameters yield a "robustness set" of plausible distributions and a corresponding worst-case loss over the set. While the loss under an individual parametric shift can be estimated via reweighting techniques such as importance sampling, the resulting worst-case optimization problem is non-convex, and the estimate may suffer from large variance. For small shifts, however, we can construct a local second-order approximation to the loss under shift and cast the problem of finding a worst-case shift as a particular non-convex quadratic optimization problem, for which efficient algorithms are available. We demonstrate that this second-order approximation can be estimated directly for shifts in conditional exponential family models, and we bound the approximation error. We apply our approach to a computer vision task (classifying gender from images), revealing sensitivity to shifts in non-causal attributes.

TL;DR: We give a method for evaluating worst-case predictive performance under parameterized changes in distribution.

Supplementary Material: zip

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/evaluating-robustness-to-dataset-shift-via/code)

22 Replies

Loading