2021 (modified: 22 Nov 2022)ICLR 2021Readers: Everyone
Abstract:While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied...