Trust Region Policy OptimizationDownload PDFOpen Website

2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone
Abstract: In this article, we describe a method for optimizing control policies, with guaranteed monotonic improvement. By making several approximations to the theoretically-justified scheme, we develop a pr...
0 Replies

Loading