Pessimistic Data Integration for Policy Evaluation

Xiangkun Wu; Ting Li; Gholamali Aminian; Armin Behnamnia; Hamid R. Rabiee; Chengchun Shi

Pessimistic Data Integration for Policy Evaluation

Xiangkun Wu, Ting Li, Gholamali Aminian, Armin Behnamnia, Hamid R. Rabiee, Chengchun Shi

Published: 18 Sept 2025, Last Modified: 29 Oct 2025NeurIPS 2025 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Data Integration, Policy Evaluation, Causal Inference, A/B testing

Abstract: This paper studies how to integrate historical control data with experimental data to enhance A/B testing, while addressing the distributional shift between historical and experimental datasets. We propose a pessimistic data integration method that combines two causal effect estimators constructed based on experimental and historical datasets. Our main idea is to conceptualize the weight function for this combination as a policy so that existing pessimistic policy learning algorithms are applicable to learn the optimal weight that minimizes the resulting weighted estimator's mean squared error. Additionally, we conduct comprehensive theoretical and empirical analyses to compare our method against various baseline estimators across five scenarios. Both our theoretical and numerical findings demonstrate that the proposed estimator achieves near-optimal performance across all scenarios.

Supplementary Material: zip

Primary Area: General machine learning (supervised, unsupervised, online, active, etc.)

Submission Number: 13113

Loading