2020 (modified: 05 Nov 2022)ICML 2020Readers: Everyone
Abstract:Coagent policy gradient algorithms (CPGAs) are reinforcement learning algorithms for training a class of stochastic neural networks called coagent networks. In this work, we prove that CPGAs conver...