Published: 01 Jan 2022, Last Modified: 15 May 2023ICML 2022Readers: Everyone
Abstract:Using a model of the environment and a value function, an agent can construct many estimates of a state’s value, by unrolling the model for different lengths and bootstrapping with its value functi...