TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward

Yihong Luo, Tianyang Hu, Weijian Luo, Jing Tang

Published: 2026, Last Modified: 05 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading