An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

Ronald Parr, Lihong Li, Gavin Taylor, Christopher Painter-Wakefield, Michael L. Littman

2008 (modified: 08 Nov 2022)ICML 2008Readers: Everyone

Abstract: We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the Bellman error, and show how this relationship can guide feature selection for model improvement and/or value-function improvement. We also show how these results give insight into the behavior of existing feature-selection algorithms.

0 Replies