Keywords: Generative models, Parallel diffusion sampling
Abstract: Recent years have witnessed significant progress in developing effective diffusion models. Parallel sampling is a promising recent approach that reformulates the sequential denoising process as solving a system of nonlinear equations, and it can be combined with other acceleration techniques. However, current progress is limited by the trade-off between high fidelity and computational efficiency.
This paper addresses the challenge of scaling to high-dimensional, multi-modal generation. Specifically, we present ROPA (Robust Parallel Diffusion Sampling), which takes into account the properties of the denoising process and solves the linear system using adaptive local sparsity to achieve stable parallel sampling.
Extensive experiments demonstrate ROPA’s effectiveness: it significantly accelerates sampling across diverse image and video diffusion models, achieving up to $2.9\times$ speedup with eight core, an improvement of 52\% over baselines without sacrificing sample quality. ROPA enables parallel sampling methods to provide a solid foundation for real-time, high-fidelity diffusion generation.
Primary Area: generative models
Submission Number: 4016
Loading