An Image is Worth 16x16 Words: Transformers for Image Recognition at ScaleDownload PDFOpen Website

2021 (modified: 22 Nov 2022)ICLR 2021Readers: Everyone
Abstract: While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied...
0 Replies

Loading