Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother
,
Jordi Orbay
,
Quan Vuong
,
Adrien Ali Taïga
,
Yevgen Chebotar
,
Ted Xiao
,
Alex Irpan
,
Sergey Levine
,
Pablo Samuel Castro
,
Aleksandra Faust
,
Aviral Kumar
,
Rishabh Agarwal
Published: 01 Jan 2024, Last Modified: 05 Oct 2024
ICML 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading