On the Rate of Convergence and Error Bounds for LSTD(\(\lambda\))Download PDFOpen Website

2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone
Abstract: We consider LSTD(λ), the least-squares temporal-difference algorithm with eligibility traces algorithm proposed by Boyan (2002). It computes a linear approximation of the value function of a fixed ...
0 Replies

Loading