MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation

Mingzhen Sun, Weining Wang, Yanyuan Qiao, Jiahui Sun, Zihan Qin, Longteng Guo, Xinxin Zhu, Jing Liu

Published: 28 Oct 2024, Last Modified: 18 Mar 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading