Causal Inference Q-Network: Toward Resilient Reinforcement Learning

Chao-Han Huck Yang; Danny I-Te Hung; Yi Ouyang; Pin-Yu Chen

Causal Inference Q-Network: Toward Resilient Reinforcement Learning

Chao-Han Huck Yang, Danny I-Te Hung, Yi Ouyang, Pin-Yu Chen

Published: 15 Jun 2022, Last Modified: 22 Jun 2025SSL-RL 2021 SpotlightReaders: Everyone

Keywords: Deep Reinforcement Learning, Generative Reinforcement Learning, Causal Inference, Learning from Noisy Labels

TL;DR: We propose a causal inference based generative DRL algorithm called causal inference Q-network (CIQ) with auxiliary labels (interferences) toward resilient learning.

Abstract: Deep reinforcement learning (DRL) has demonstrated impressive performance in various gaming simulators and real-world applications. In practice, however, a DRL agent may receive faulty observation by abrupt interferences such as black-out, frozen-screen, and adversarial perturbation. How to design a resilient DRL algorithm against these rare but mission-critical and safety-crucial scenarios is an important yet challenging task. In this paper, we consider a generative DRL framework training with an auxiliary task of observational interferences such as artificial noises. Under this framework, we discuss the importance of the causal relation and propose a causal inference based DRL algorithm called causal inference Q-network (CIQ). We evaluate the performance of CIQ in several benchmark DRL environments with different types of interferences as auxiliary labels. Our experimental results show that the proposed CIQ method could achieve higher performance and more resilience against observational interferences.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 4 code implementations](https://www.catalyzex.com/paper/causal-inference-q-network-toward-resilient/code)

0 Replies

Loading