Infinite attention: NNGP and NTK for deep attention networksDownload PDFOpen Website

2020 (modified: 24 Apr 2023)ICML 2020Readers: Everyone
Abstract: There is a growing amount of literature on the relationship between wide neural networks (NNs) and Gaussian processes (GPs), identifying an equivalence between the two for a variety of NN architect...
0 Replies

Loading