Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Multi-modal visual tracking based on textual generation
Jiahao Wang
,
Fang Liu
,
Licheng Jiao
,
Hao Wang
,
Shuo Li
,
Lingling Li
,
Puhua Chen
,
Xu Liu
Published: 01 Jan 2024, Last Modified: 06 Mar 2025
Inf. Fusion 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Abstract:
Highlights•Visual-Language Interaction Prompt Manager is proposed.•Multi-modal Image Description Co-Generation Module is introduced.•Multi-modal Visual Tracking Based on Textual Generation method is designed.
Loading