V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous ControlDownload PDFOpen Website

H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M. Botvinick

23 Sept 2020 (modified: 05 May 2023)ICLR 2020Readers: Everyone
0 Replies

Loading