Thinking Like TransformersDownload PDFOpen Website

2021 (modified: 28 Sept 2021)ICML 2021Readers: Everyone
Abstract: What is the computational model behind a Transformer? Where recurrent neural networks have direct parallels in finite state machines, allowing clear discussion and thought around architecture varia...
0 Replies

Loading