2016 (modified: 11 Nov 2022)ICML 2016Readers: Everyone
Abstract:We propose randomized least-squares value iteration (RLSVI) – a new reinforcement learning algorithm designed to explore and generalize efficiently via linearly parameterized value functions. We ex...