Understanding Policy Gradient Algorithms: A Sensitivity-Based ApproachDownload PDFOpen Website

2022 (modified: 17 Apr 2023)ICML 2022Readers: Everyone
Abstract: The REINFORCE algorithm \cite{williams1992simple} is popular in policy gradient (PG) for solving reinforcement learning (RL) problems. Meanwhile, the theoretical form of PG is from \cite{sutton1999...
0 Replies

Loading