2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone
Abstract:We consider the problem of undiscounted reinforcement learning in continuous state space. Regret bounds in this setting usually hold under various assumptions on the structure of the reward and tra...