Keywords: parallel tempering, diffusion model, inference-time control, replica exchange
Abstract: Inference-time control of diffusion models aims to steer model outputs to satisfy new constraints without retraining.
Previous approaches have mostly relied on heuristic guidance or have been coupled with Sequential Monte Carlo (SMC) for bias correction.
In this paper, we propose a flexible alternative based on replica exchange, an algorithm designed initially for sampling problems.
We refer to this method as the CREPE (Controlling with REPlica Exchange). Unlike SMC, CREPE:
(i) generates particles sequentially, (ii) maintains high diversity in the generated samples after a burn-in period,
and
(iii) enables online refinement or early termination.
We demonstrate its versatility across various tasks, including temperature annealing, reward tilting, model composition and classifier-free guidance debiasing, with competitive performance compared to prior SMC methods.
Primary Area: probabilistic methods (Bayesian methods, variational inference, sampling, UQ, etc.)
Submission Number: 13569
Loading