Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning

K. Lakshmanan, Ronald Ortner, Daniil Ryabko

2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone

Abstract: We consider the problem of undiscounted reinforcement learning in continuous state space. Regret bounds in this setting usually hold under various assumptions on the structure of the reward and tra...

0 Replies