2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone
Abstract:We consider the infinite-horizon γ-discounted optimal control problem formalized by Markov Decision Processes. Running any instance of Modified Policy Iteration—a family of algorithms that can inte...