Phasic Policy GradientDownload PDFOpen Website

2021 (modified: 11 Nov 2021)ICML 2021Readers: Everyone
Abstract: We introduce Phasic Policy Gradient (PPG), a reinforcement learning framework which modifies traditional on-policy actor-critic methods by separating policy and value function training into distinc...
0 Replies

Loading