2019 (modified: 14 May 2023)ICML 2019Readers: Everyone
Abstract:We devise a distributional variant of gradient temporal-difference (TD) learning. Distributional reinforcement learning has been demonstrated to outperform the regular one in the recent study \cite...