Addressing Ambiguous Queries in Video Retrieval with Advanced Temporal Search

Bao Tran Gia, Tuong Bui Cong Khanh, Tam Le Thi Thanh, Khoa Tran, Hien Ho Trong, Thuyen Tran Doan, Khiem Le, Tien Do, Duy-Dinh Le, Thanh Duc Ngo

Published: 01 Jan 2025, Last Modified: 25 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: The increasing volume of multimedia content has intensified the demand for video retrieval systems that can efficiently and accurately extract relevant information from large-scale archives. However, existing methods frequently encounter challenges when dealing with ambiguous queries, particularly those involving complex temporal relationships, often leading to incomplete or suboptimal retrieval results. To address these limitations, we propose a novel multimodal video retrieval system designed to handle a wide range of query types by integrating outputs from multiple search models. A central feature of the system is its advanced temporal search mechanism, which improves ambiguity resolution by conducting additional searches within adjacent video shots, rather than relying solely on chronological order. The effectiveness of the proposed system is demonstrated through its performance in the 2024 Ho Chi Minh AI Challenge.
Loading