2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:Reinforcement learning algorithms struggle when the reward signal is very sparse. In these cases, naive random exploration methods essentially rely on a random walk to stumble onto a rewarding stat...