2017 (modified: 11 Nov 2022)ICML 2017Readers: Everyone
Abstract:This paper is about the study of B-FQI, an Approximated Value Iteration (AVI) algorithm that exploits a boosting procedure to estimate the action-value function in reinforcement learning problems. ...