2022 (modified: 23 Nov 2022)ICLR 2022Readers: Everyone
Abstract:The success of multi-head self-attentions (MSAs) for computer vision is now indisputable. However, little is known about how MSAs work. We present fundamental explanations to help better understand...