Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian NoiseDownload PDFOpen Website

2020 (modified: 16 Apr 2023)UAI 2020Readers: Everyone
Abstract: Greedy-GQ is an off-policy two timescale algorithm for optimal control in reinforcement learning. This paper develops the first finite-sample analysis for the Greedy-GQ algorithm with linear functi...
0 Replies

Loading