Nearly Optimal Policy Optimization with Stable at Any Time GuaranteeDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 12 May 2023ICML 2022Readers: Everyone
Abstract: Policy optimization methods are one of the most widely used classes of Reinforcement Learning (RL) algorithms. However, theoretical understanding of these methods remains insufficient. Even in the ...
0 Replies

Loading