On reducing sampling variance in covariate shift using control variates

Published: 2017, Last Modified: 15 May 2025CoRR 2017EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Cross-validation under sample selection bias can, in principle, be done by importance-weighting the empirical risk. However, the importance-weighted risk estimator produces sub-optimal hyperparameter estimates in problem settings where large weights arise with high probability. We study its sampling variance as a function of the training data distribution and introduce a control variate to increase its robustness to problematically large weights.
Loading