Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
,
Xander Davies
,
Claudia Shi
,
Thomas Krendl Gilbert
,
Jérémy Scheurer
,
Javier Rando
,
Rachel Freedman
,
Tomasz Korbak
,
David Lindner
,
Pedro Freire
,
Tony Tong Wang
,
Samuel Marks
,
Charbel-Raphaël Ségerie
,
Micah Carroll
,
Andi Peng
,
Phillip J. K. Christoffersen
,
Mehul Damani
,
Stewart Slocum
,
Usman Anwar
,
Anand Siththaranjan
et al. (12 additional authors not shown)
Published: 01 Jan 2023, Last Modified: 14 May 2025
Trans. Mach. Learn. Res. 2023
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading