Published: 01 Jan 2023, Last Modified: 11 Mar 2024ICML 2023Readers: Everyone
Abstract:Attention-based vision models, such as Vision Transformer (ViT) and its variants, have shown promising performance in various computer vision tasks. However, these emerging architectures suffer fro...