Optimal Representations for Covariate Shift

Yangjun Ruan; Yann Dubois; Chris J. Maddison

Optimal Representations for Covariate Shift

Yangjun Ruan, Yann Dubois, Chris J. Maddison

Published: 28 Jan 2022, Last Modified: 04 May 2025ICLR 2022 PosterReaders: Everyone

Keywords: distribution shift, domain generalization, representation learning, self-supervised learning, invariance, robustness

Abstract: Machine learning systems often experience a distribution shift between training and testing. In this paper, we introduce a simple variational objective whose optima are exactly the set of all representations on which risk minimizers are guaranteed to be robust to any distribution shift that preserves the Bayes predictor, e.g., covariate shifts. Our objective has two components. First, a representation must remain discriminative for the task, i.e., some predictor must be able to simultaneously minimize the source and target risk. Second, the representation's marginal support needs to be the same across source and target. We make this practical by designing self-supervised objectives that only use unlabelled data and augmentations to train robust representations. Our objectives give insights into the robustness of CLIP, and further improve CLIP's representations to achieve SOTA results on DomainBed.

One-sentence Summary: We give a simple variational objective whose optima are exactly the set of representations that are robust under covariate shift

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/optimal-representations-for-covariate-shift/code)

18 Replies

Loading