OpenReview.net
  • Login
back arrowGo to DBLP homepage

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Open Webpage

Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron T. Parisi, Abhishek Kumar, Alexander A. Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Fathy Elsayed, Hanie Sedghi, Igor Mordatch et al. (21 additional authors not shown)

Published: 01 Jan 2024, Last Modified: 19 May 2025Trans. Mach. Learn. Res. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Feedback
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Terms of Use
  • Privacy Policy
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Contact
  • Feedback
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview