CLDTracker: A Comprehensive Language Description for visual Tracking

Published: 01 Jan 2025, Last Modified: 15 Sept 2025Inf. Fusion 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Introduce comprehensive bag of textual descriptions for VOT tracking.•Provide a comprehensive bag of textual descriptions for six VOT datasets.•Propose TTFUM to update target text features over time.•Fuse visual and textual features using attention-based correlation.•Evaluate CLDTrack on six benchmarks against 38 SOTA trackers.
Loading