2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract:We describe two efficient, and exact, algorithms for computing Bellman updates in robust Markov decision processes (MDPs). The first algorithm uses a homotopy continuation method to compute updates...