Toggle navigation
OpenReview
.net
Login
×
Go to
EMNLP 2023
homepage
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Joshua Ainslie
,
James Lee-Thorp
,
Michiel de Jong
,
Yury Zemlyanskiy
,
Federico Lebrón
,
Sumit Sanghai
Published: 01 Jan 2023, Last Modified: 17 Dec 2023
EMNLP 2023
Readers:
Everyone
0 Replies
Loading