Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

Jialuo Li, Bin Li, Jiahao Li, Yan Lu

Published: 2025, Last Modified: 05 May 2026CoRR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading