2017 (modified: 11 Nov 2022)ICML 2017Readers: Everyone
Abstract:Reinforcement learning tasks are typically specified as Markov decision processes. This formalism has been highly successful, though specifications often couple the dynamics of the environment and ...