Deep Reinforcement Learning with Causality-based Intrinsic Reward

Peng Zhang; Furui Liu; Zhitang Chen; Jianye HAO; Jun Wang

Deep Reinforcement Learning with Causality-based Intrinsic Reward

Peng Zhang, Furui Liu, Zhitang Chen, Jianye HAO, Jun Wang

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Reinforcement Learning, Causal Relation

Abstract: Reinforcement Learning (RL) has shown great potential to deal with sequential decision-making problems. However, most RL algorithms do not explicitly consider the relations between entities in the environment. This makes the policy learning suffer from the problems of efficiency, effectivity and interpretability. In this paper, we propose a novel deep reinforcement learning algorithm, which firstly learns the causal structure of the environment and then leverages the learned causal information to assist policy learning. The proposed algorithm learns a graph to encode the environmental structure by calculating Average Causal Effect (ACE) between different categories of entities, and an intrinsic reward is given to encourage the agent to interact more with entities belonging to top-ranked categories, which significantly boosts policy learning. Several experiments are conducted on a number of simulation environments to demonstrate the effectiveness and better interpretability of our proposed method.

One-sentence Summary: We propose a novel deep reinforcement learning algorithm, which firstly learns environmental causal relations between categories of entities and then leverages the learned relational information to assist policy learning.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Reviewed Version (pdf): https://openreview.net/references/pdf?id=vbs-yXVGlK

10 Replies

Loading