Learning Dynamic State Abstractions for Model-Based Reinforcement Learning

Lars Buesing; Theophane Weber; Sebastien Racaniere; S. M. Ali Eslami; Danilo Rezende; David Reichert; Fabio Viola; Frederic Besse; Karol Gregor; Demis Hassabis; Daan Wierstra

Learning Dynamic State Abstractions for Model-Based Reinforcement Learning

Lars Buesing, Theophane Weber, Sebastien Racaniere, S. M. Ali Eslami, Danilo Rezende, David Reichert, Fabio Viola, Frederic Besse, Karol Gregor, Demis Hassabis, Daan Wierstra

15 Feb 2018 (modified: 10 Feb 2022)ICLR 2018 Conference Blind SubmissionReaders: Everyone

Abstract: A key challenge in model-based reinforcement learning (RL) is to synthesize computationally efficient and accurate environment models. We show that carefully designed models that learn predictive and compact state representations, also called state-space models, substantially reduce the computational costs for predicting outcomes of sequences of actions. Extensive experiments establish that state-space models accurately capture the dynamics of Atari games from the Arcade Learning Environment (ALE) from raw pixels. Furthermore, RL agents that use Monte-Carlo rollouts of these models as features for decision making outperform strong model-free baselines on the game MS_PACMAN, demonstrating the benefits of planning using learned dynamic state abstractions.

Keywords: generative models, probabilistic modelling, reinforcement learning, state-space models, planning

Data: [Arcade Learning Environment](https://paperswithcode.com/dataset/arcade-learning-environment)

11 Replies

Loading