Abstract: Highlights•ScopeViT: a general vision transformer for multi-scale visual interactions.•Scale-aware attention module enriches understanding across spatial scales.•Extensive experiments show the superiority of ScopeViT on various visual tasks.
External IDs:dblp:journals/pr/NieJYCZQ24
Loading