2007 (modified: 04 Nov 2022)AISTATS 2007Readers: Everyone
Abstract:In uncertain and partially observable environments control policies must be a function of the complete history of actions and observations. Rather than present an ever growing history to a learner,...