VCR: Video representation for Contextual Retrieval

Published: 01 Jan 2024, Last Modified: 13 Nov 2024CMLDS 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Streamlining content discovery in media archives requires advanced data representations and effective visualization techniques for clear communication of video topics to users. The proposed system addresses the challenge of efficiently navigating large video collections by exploiting a fusion of visual, audio, and textual features to accurately index and categorize video content through a text-based method. Additionally, semantic embeddings are employed to provide contextually relevant information and recommendations to users, resulting in an intuitive and engaging exploratory experience over our topics ontology map using LLMs (GitHub).
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview