2021 (modified: 31 Mar 2022)ICML 2021Readers: Everyone
Abstract:Reinforcement learning (RL) is empirically successful in complex nonlinear Markov decision processes (MDPs) with continuous state spaces. By contrast, the majority of theoretical RL literature requ...