Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong
,
Yonggan Fu
,
Shizhe Diao
,
Wonmin Byeon
,
Zijia Chen
,
Ameya Sunil Mahabaleshwarkar
,
Shih-Yang Liu
,
Matthijs Van Keirsbilck
,
Min-Hung Chen
,
Yoshi Suhara
,
Yingyan Celine Lin
,
Jan Kautz
,
Pavlo Molchanov
Published: 01 Jan 2025, Last Modified: 13 May 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading