Truly Proximal Policy OptimizationDownload PDFOpen Website

2019 (modified: 24 Apr 2023)UAI 2019Readers: Everyone
Abstract: Proximal policy optimization (PPO) is one of the most successful deep reinforcement learning methods, achieving state-of-the-art performance across a wide range of challenging tasks. However, its o...
0 Replies

Loading