Approximate Relative Value Learning for Average-reward Continuous State MDPsDownload PDFOpen Website

Published: 2019, Last Modified: 17 May 2023UAI 2019Readers: Everyone
Abstract: In this paper, we propose an approximate relative value learning (ARVL) algorithm for non- parametric MDPs with continuous state space and finite actions and average reward criterion. It is a sampl...
0 Replies

Loading