AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample ComplexityDownload PDFOpen Website

2020 (modified: 03 Nov 2022)AISTATS 2020Readers: Everyone
Abstract: In this paper, we propose AsyncQVI, an asynchronous-parallel Q-value iteration for discounted Markov decision processes whose transition and reward can only be sampled through a generative model. A...
0 Replies

Loading