Which transformer architecture fits my data? A vocabulary bottleneck in self-attentionDownload PDFOpen Website

2021 (modified: 28 Nov 2021)ICML 2021Readers: Everyone
Abstract: After their successful debut in natural language processing, Transformer architectures are now becoming the de-facto standard in many domains. An obstacle for their deployment over new modalities i...
0 Replies

Loading