Simple linear attention language models balance the recall-throughput tradeoff | OpenReview

Simple linear attention language models balance the recall-throughput tradeoff

Open Webpage

Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, James Zou, Atri Rudra, Christopher Ré

Published: 01 Jan 2024, Last Modified: 03 Oct 2024ICML 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Loading