2021 (modified: 18 May 2022)ICML 2021Readers: Everyone
Abstract:We propose a model-free reinforcement learning algorithm inspired by the popular randomized least squares value iteration (RLSVI) algorithm as well as the optimism principle. Unlike existing upper-...