Improving Language Models by Retrieving from Trillions of TokensDownload PDFOpen Website

2022 (modified: 24 Apr 2023)ICML 2022Readers: Everyone
Abstract: We enhance auto-regressive language models by conditioning on document chunks retrieved from a large corpus, based on local similarity with preceding tokens. With a 2 trillion token database, our R...
0 Replies

Loading