2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:We consider a model-based approach to perform batch off-policy evaluation in reinforcement learning. Our method takes a mixture-of-experts approach to combine parametric and non-parametric models o...