Toggle navigation
OpenReview
.net
Login
×
Go to
KBS 2022
homepage
Mixhead: Breaking the low-rank bottleneck in multi-head attention language models
Zhong Zhang
,
Nian Shao
,
Chongming Gao
,
Rui Miao
,
Qinli Yang
,
Junming Shao
Published: 01 Jan 2022, Last Modified: 15 Nov 2023
Knowl. Based Syst. 2022
Readers:
Everyone
0 Replies
Loading