Spatial–temporal video grounding with cross-modal understanding and enhancement | OpenReview

Spatial–temporal video grounding with cross-modal understanding and enhancement

Open Webpage

Shu Luo, Jingyu Pan, Da Cao, Jiawei Wang, Yuquan Le, Meng Liu

Published: 01 May 2025, Last Modified: 21 Jan 2026Expert Systems with ApplicationsEveryoneRevisionsCC BY-SA 4.0

External IDs:doi:10.1016/j.eswa.2025.126650

Loading