Model and Method: Training-Time Attack for Cooperative Multi-Agent Reinforcement Learning

Siyang Wu; Tonghan Wang; Xiaoran Wu; Jingfeng Zhang; Yujing Hu; Changjie Fan; Chongjie Zhang

Model and Method: Training-Time Attack for Cooperative Multi-Agent Reinforcement Learning

Siyang Wu, Tonghan Wang, Xiaoran Wu, Jingfeng Zhang, Yujing Hu, Changjie Fan, Chongjie Zhang

08 Oct 2022 (modified: 05 May 2023)Deep RL Workshop 2022Readers: Everyone

Keywords: Cooperative multi-agent reinforcement learning, training-time attack

TL;DR: This paper proposed a training-time attack method for cooperative multi-agent reinforcement learning.

Abstract: The robustness of deep cooperative multi-agent reinforcement learning (MARL) is of great concern and limits the application to real-world risk-sensitive tasks. Adversarial attack is a promising direction to study and improve the robustness of MARL but is largely under-studied. Previous work focuses on deploy-time attacks which may exaggerate attack performance because the MARL learner even does not anticipate the attacker. In this paper, we propose training-time attacks where the learner is allowed to observe and adapt to poisoned experience. For the stealthiness of attacks, we contaminate action sampling and restrict the attack budget so that non-adversarial agents cannot distinguish attacks from exploration noise. We derive two specific attack methods by modeling the influence of action-sampling on experience replay and further on team performance. Experiments show that our methods significantly undermine MARL algorithms by subtly disturbing the exploration-exploitation balance during the learning process.

Supplementary Material: zip

0 Replies

Loading