2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:Policy gradient methods ignore the potential value of adjusting environment variables: unobservable state features that are randomly determined by the environment in a physical setting, but are con...