IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model.

Yatai Ji, Shilong Zhang, Jie Wu 0001, Peize Sun, Weifeng Chen, Xuefeng Xiao 0001, Sidi Yang, Yujiu Yang 0001, Ping Luo 0002

07 Nov 2025ICLR 2025EveryoneCC BY-SA 4.0
Loading