Semi-Supervised Cross-Domain Imitation Learning

Li-Min Chu; Kai-Siang Ma; Ming-Hong Chen; Ping-Chun Hsieh

Semi-Supervised Cross-Domain Imitation Learning

Li-Min Chu, Kai-Siang Ma, Ming-Hong Chen, Ping-Chun Hsieh

Published: 20 Feb 2026, Last Modified: 20 Feb 2026Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Cross-domain imitation learning (CDIL) accelerates policy learning by transferring expert knowledge across domains, which is valuable in applications where collection of expert data is costly. Existing methods are either supervised, relying on proxy tasks and explicit alignment, or unsupervised, aligning distributions without paired data but often unstable. We introduce the Semi-Supervised CDIL (SS-CDIL) setting and propose the first algorithm for SS-CDIL with theoretical justification. Our method uses only offline data, including a small number of target expert demonstrations and some unlabeled imperfect trajectories. To handle domain discrepancy, we propose a novel cross-domain loss function for learning inter-domain state-action mappings and design an adaptive weight function to balance the source and target knowledge. Experiments on MuJoCo and Robosuite show consistent gains over the baselines, demonstrating that our approach achieves stable and data-efficient policy learning with minimal supervision.

Submission Type: Regular submission (no more than 12 pages of main content)

Video: https://drive.google.com/file/d/1tMyNBf_vCG2aLwyGpeagQ1ZuOKWTRSPd/view?usp=sharing

Code: https://github.com/NYCU-RL-Bandits-Lab/CDIL

Supplementary Material: zip

Assigned Action Editor: ~Arnob_Ghosh3

Submission Number: 6472

Loading