Quasi Newton Temporal Difference LearningDownload PDFOpen Website

2014 (modified: 03 Nov 2022)ACML 2014Readers: Everyone
Abstract: Fast convergent and computationally inexpensive policy evaluation is an essential part of reinforcement learning algorithms based on policy iteration. Algorithms such as LSTD, LSPE, FPKF and NTD, h...
0 Replies

Loading