Characterizing the Gap Between Actor-Critic and Policy GradientDownload PDFOpen Website

2021 (modified: 16 Sept 2021)ICML 2021Readers: Everyone
Abstract: Actor-critic (AC) methods are ubiquitous in reinforcement learning. Although it is understood that AC methods are closely related to policy gradient (PG), their precise connection has not been full...
0 Replies

Loading