Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Language models scale reliably with over-training and on downstream tasks
Samir Yitzhak Gadre
,
Georgios Smyrnis
,
Vaishaal Shankar
,
Suchin Gururangan
,
Mitchell Wortsman
,
Rulin Shao
,
Jean Mercat
,
Alex Fang
,
Jeffrey Li
,
Sedrick Keh
,
Rui Xin
,
Marianna Nezhurina
,
Igor Vasiljevic
,
Luca Soldaini
,
Jenia Jitsev
,
Alex Dimakis
,
Gabriel Ilharco
,
Pang Wei Koh
,
Shuran Song
,
Thomas Kollar
et al. (1 additional authors not shown)
Published: 01 Jan 2025, Last Modified: 09 Oct 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading