Toggle navigation
OpenReview
.net
Login
×
Go to
ICML 2021
homepage
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
,
Saurabh Kumar
,
Ramki Gummadi
,
Dale Schuurmans
2021 (modified: 16 Sept 2021)
ICML 2021
Readers:
Everyone
Abstract:
Actor-critic (AC) methods are ubiquitous in reinforcement learning. Although it is understood that AC methods are closely related to policy gradient (PG), their precise connection has not been full...
0 Replies
Loading