Recurrent Experience Replay in Distributed Reinforcement Learning

Steven Kapturowski; Georg Ostrovski; John Quan; Remi Munos; Will Dabney

Recurrent Experience Replay in Distributed Reinforcement Learning

Steven Kapturowski, Georg Ostrovski, John Quan, Remi Munos, Will Dabney

Published: 21 Dec 2018, Last Modified: 05 May 2023ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: Building on the recent successes of distributed training of RL agents, in this paper we investigate the training of RNN-based RL agents from distributed prioritized experience replay. We study the effects of parameter lag resulting in representational drift and recurrent state staleness and empirically derive an improved training strategy. Using a single network architecture and fixed set of hyper-parameters, the resulting agent, Recurrent Replay Distributed DQN, quadruples the previous state of the art on Atari-57, and matches the state of the art on DMLab-30. It is the first agent to exceed human-level performance in 52 of the 57 Atari games.

Keywords: RNN, LSTM, experience replay, distributed training, reinforcement learning

TL;DR: Investigation on combining recurrent neural networks and experience replay leading to state-of-the-art agent on both Atari-57 and DMLab-30 using single set of hyper-parameters.

Data: [Arcade Learning Environment](https://paperswithcode.com/dataset/arcade-learning-environment), [DQN Replay Dataset](https://paperswithcode.com/dataset/dqn-replay-dataset)

Code: [![Papers with Code](/images/pwc_icon.svg) 3 community implementations](https://paperswithcode.com/paper/?openreview=r1lyTjAqYX)

22 Replies

Loading