A Unifying Framework for Causal Imitation Learning with Hidden Confounders

A Unifying Framework for Causal Imitation Learning with Hidden Confounders

ICLR 2026 Conference Submission13961 Authors

18 Sept 2025 (modified: 08 Oct 2025)ICLR 2026 Conference SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Imitation Learning, Hidden Confounders, Causal Inference, Reinforcement Learning

Abstract: We propose a general and unifying framework for causal Imitation Learning (IL) with hidden confounders, which subsumes several existing settings. Our framework accounts for two types of hidden confounders: (a) variables observed by the expert but not by the imitator, and (b) confounding noise hidden from both. By leveraging trajectory histories as instruments, we reformulate causal IL in our framework into a Conditional Moment Restriction (CMR) problem. We propose DML-IL, an algorithm that solves this CMR problem via instrumental variable regression, and upper bound its imitation gap. Empirical evaluation on continuous state-action environments, including Mujoco tasks, demonstrates that DML-IL outperforms existing causal IL baselines.

Supplementary Material: zip

Primary Area: reinforcement learning

Submission Number: 13961

Loading