First return, then exploreDownload PDFOpen Website

Published: 01 Jan 2021, Last Modified: 12 May 2023Nat. 2021Readers: Everyone
Abstract: A reinforcement learning algorithm that explicitly remembers promising states and returns to them as a basis for further exploration solves all as-yet-unsolved Atari games and out-performs previous algorithms on Montezuma’s Revenge and Pitfall.
0 Replies

Loading