2021 (modified: 11 Nov 2021)ICML 2021Readers: Everyone
Abstract:We introduce Phasic Policy Gradient (PPG), a reinforcement learning framework which modifies traditional on-policy actor-critic methods by separating policy and value function training into distinc...