An Empirical Study of Policy Interpolation via Diffusion Models

Yuqing Xie; Chao Yu; Ya Zhang; Yu Wang

An Empirical Study of Policy Interpolation via Diffusion Models

Yuqing Xie, Chao Yu, Ya Zhang, Yu Wang

Published: 06 Mar 2025, Last Modified: 06 Apr 2025MCDC @ ICLR 2025EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Diffusion model, imitation learning, policy merge

Abstract: Diffusion-based policies have shown great potential in multi-task settings, as they can solve new tasks without additional training through inference-time steering. In this paper, we explore the inference-time composition of diffusion-based policies using various interpolation methods. Our results show that, while existing methods merely switch between predefined action modes, our proposed approach can generate entirely new action patterns by leveraging existing policies, all without the need for further training or tuning.

Submission Number: 38

Loading