Toggle navigation
OpenReview
.net
Login
×
Go to
ICML 2022
homepage
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach
Shuang Wu
,
Ling Shi
,
Jun Wang
,
Guangjian Tian
2022 (modified: 17 Apr 2023)
ICML 2022
Readers:
Everyone
Abstract:
The REINFORCE algorithm \cite{williams1992simple} is popular in policy gradient (PG) for solving reinforcement learning (RL) problems. Meanwhile, the theoretical form of PG is from \cite{sutton1999...
0 Replies
Loading