2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract:Off-policy learning is key to scaling up reinforcement learning as it allows to learn about a target policy from the experience generated by a different behavior policy. Unfortunately, it has been ...