QSMT-net: A query-sensitive proposal and multi-temporal-span matching network for video grounding

Published: 01 Jan 2024, Last Modified: 03 Mar 2025Image Vis. Comput. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We designed a query-sensitive proposal generation strategy and dynamically generates candidate proposals through a constructed learnable pooling module.•We developed a multi-temporal-span matching network that simulated the matching between candidate proposals and queries across various temporal perspectives.•Our approach outperformed state-of-the-art methods on three challenging video localization benchmarks.
Loading