Multi-modal visual tracking based on textual generation

Published: 01 Jan 2024, Last Modified: 06 Mar 2025Inf. Fusion 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Visual-Language Interaction Prompt Manager is proposed.•Multi-modal Image Description Co-Generation Module is introduced.•Multi-modal Visual Tracking Based on Textual Generation method is designed.
Loading