2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract:The score function estimator is widely used for estimating gradients of stochastic objectives in stochastic computation graphs (SCG), eg., in reinforcement learning and meta-learning. While derivin...