Published: 01 Jan 2022, Last Modified: 10 May 2023ICML 2022Readers: Everyone
Abstract:In this work, we study the use of the Bellman equation as a surrogate objective for value prediction accuracy. While the Bellman equation is uniquely solved by the true value function over all stat...