OpenReview.net
  • Login
back arrowGo to DBLP homepage

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Open Webpage

Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan et al. (12 additional authors not shown)

Published: 01 Jan 2023, Last Modified: 14 May 2025Trans. Mach. Learn. Res. 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Feedback
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Terms of Use
  • Privacy Policy
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Contact
  • Feedback
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview