An Efficient Test-Time Scaling Approach for Image Generation

Published: 03 Mar 2026, Last Modified: 07 Apr 2026ICLR 2026 DeLTa Workshop PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: efficient test-time scaling, verifier-threshold, image generation, diffusion models, flow models
TL;DR: An efficient test-time compute method for flow models that moves to the next denoising step only when the reward model's score is above a threshold.
Abstract: Image generation has emerged as a mainstream application of large generative models. Just as test-time compute and reasoning have improved language model capabilities, similar benefits have been observed for image generation models. In particular, searching over noise samples for diffusion and flow models has been shown to scale well with test-time compute. While recent works explore allocating non-uniform inference-compute budgets across denoising steps, existing approaches rely on greedy heuristics and often allocate the compute budget ineffectively. In this work, we study this problem and propose a simple fix. We propose Verifier-Threshold, which automatically reallocates test-time compute and delivers substantial efficiency improvements. For the same performance on the GenEval benchmark, we achieve a 2x- 4x reduction in computational time over the state-of-the-art method.
Submission Number: 114
Loading