Toggle navigation
OpenReview
.net
Login
×
Back to
ICCV
ICCV 2025 Workshop Gen4AVC Submissions
LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters
Haomin Zhang
,
Kristin Qi
,
Shuxin Yang
,
Zihao Chen
,
CHAOFAN DING
,
XINHAN DI
Published: 07 Aug 2025, Last Modified: 14 Aug 2025
Gen4AVC Poster
Readers:
Everyone
Do State-of-the-art Audio-visual VLMs Understand Audio-video Temporal Misalignment
Motonobu Kimura
,
Ren Ohkubo
,
Yue Qiu
,
Yutaka Satoh
Published: 07 Aug 2025, Last Modified: 18 Aug 2025
Gen4AVC Poster
Readers:
Everyone
Seeing What You Say: Expressive Image Generation from Speech
Jiyoung Lee
,
Song Park
,
Sanghyuk Chun
,
Soo-Whan Chung
Published: 07 Aug 2025, Last Modified: 23 Aug 2025
Gen4AVC Poster
Readers:
Everyone
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
Xingrui Wang
,
Jiang Liu
,
Ze Wang
,
Xiaodong Yu
,
Jialian Wu
,
Ximeng Sun
,
Yusheng Su
,
Alan Yuille
,
Zicheng Liu
,
Emad Barsoum
Published: 07 Aug 2025, Last Modified: 22 Aug 2025
Gen4AVC Poster
Readers:
Everyone
Not Like Transformers: Drop the Beat Representation for Dance Generation with Mamba-Based Diffusion Model
Sangjune Park
,
Inhyeok Choi
,
Donghyeon Soon
,
Youngwoo Jeon
,
Kyungdon Joo
Published: 07 Aug 2025, Last Modified: 17 Aug 2025
Gen4AVC Poster
Readers:
Everyone
High-Fidelity Talking Portrait Synthesis with Personalized 3D Generative Prior
Jaehoon Ko
,
Kyusun Cho
,
JoungBin Lee
,
Heeji Yoon
,
Seungryong Kim
Published: 07 Aug 2025, Last Modified: 22 Aug 2025
Gen4AVC Poster
Readers:
Everyone
Dance Video Generation using Music-to-Pose Encoder Trained on Synthetic Dataset Generation Pipeline leveraging Latent Diffusion Framework
Nokap Tony Park
Published: 07 Aug 2025, Last Modified: 20 Aug 2025
Gen4AVC Poster
Readers:
Everyone
Differentiable Room Acoustic Rendering with Multi-View Vision Priors
Derong Jin
,
Ruohan Gao
Published: 07 Aug 2025, Last Modified: 07 Aug 2025
Gen4AVC Poster
Readers:
Everyone
SpecMaskFoley: Efficient Yet Effective Synchronized Video-to-audio Synthesis via Pretraining and ControlNet
Zhi Zhong
,
Akira Takahashi
,
Shuyang Cui
,
Keisuke Toyama
,
Shusuke Takahashi
,
Yuki Mitsufuji
Published: 07 Aug 2025, Last Modified: 23 Aug 2025
Gen4AVC Poster
Readers:
Everyone
JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version I
XINHAN DI
,
Kristin Qi
Published: 07 Aug 2025, Last Modified: 14 Aug 2025
Gen4AVC Poster
Readers:
Everyone