Regioned Episodic Reinforcement Learning

Jiarui Jin; Cong Chen; Ming Zhou; Weinan Zhang; Rasool Fakoor; David Wipf; Yong Yu; Jun Wang; Alex Smola

Regioned Episodic Reinforcement Learning

Jiarui Jin, Cong Chen, Ming Zhou, Weinan Zhang, Rasool Fakoor, David Wipf, Yong Yu, Jun Wang, Alex Smola

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Deep Reinforcement Learning, Episodic Memory, Sample Efficiency

Abstract: Goal-oriented reinforcement learning algorithms are often good at exploration, not exploitation, while episodic algorithms excel at exploitation, not exploration. As a result, neither of these approaches alone can lead to a sample efficient algorithm in complex environments with high dimensional state space and delayed rewards. Motivated by these observations and shortcomings, in this paper, we introduce Regioned Episodic Reinforcement Learning (RERL) that combines the episodic and goal-oriented learning strengths and leads to a more sample efficient and ef- fective algorithm. RERL achieves this by decomposing the space into several sub-space regions and constructing regions that lead to more effective exploration and high values trajectories. Extensive experiments on various benchmark tasks show that RERL outperforms existing methods in terms of sample efficiency and final rewards.

One-sentence Summary: In this paper, we introduce Regioned Episodic Reinforcement Learning (RERL) that combines the strengths of episodic and goal-oriented learning to effectively solve tasks with delayed feedbacks and high-dimensional observations.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Reviewed Version (pdf): https://openreview.net/references/pdf?id=jpHcOJWyZx

11 Replies

Loading