OpenReview.net
  • Login
back arrowGo to DBLP homepage

Language models scale reliably with over-training and on downstream tasks

Open Webpage

Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Luca Soldaini, Jenia Jitsev, Alex Dimakis, Gabriel Ilharco, Pang Wei Koh, Shuran Song, Thomas Kollar et al. (1 additional authors not shown)

Published: 01 Jan 2025, Last Modified: 09 Oct 2025ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Sponsors
  • Donate
  • Frequently Asked Questions
  • Terms of Use
  • Privacy Policy
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • Frequently Asked Questions
  • Contact
  • Donate
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview