Learning to Share in Multi-Agent Reinforcement Learning

Yuxuan Yi; Ge Li; Yaowei Wang; Zongqing Lu

Learning to Share in Multi-Agent Reinforcement Learning

Yuxuan Yi, Ge Li, Yaowei Wang, Zongqing Lu

28 Sept 2020 (modified: 26 May 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Abstract: In this paper, we study the problem of networked multi-agent reinforcement learning (MARL), where a number of agents are deployed as a partially connected network. Networked MARL requires all agents make decision in a decentralized manner to optimize a global objective with restricted communication between neighbors over the network. We propose a hierarchically decentralized MARL method, \textit{LToS}, which enables agents to learn to dynamically share reward with neighbors so as to encourage agents to cooperate on the global objective. For each agent, the high-level policy learns how to share reward with neighbors to decompose the global objective, while the low-level policy learns to optimize local objective induced by the high-level policies in the neighborhood. The two policies form a bi-level optimization and learn alternately. We empirically demonstrate that LToS outperforms existing methods in both social dilemma and two networked MARL scenarios.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/learning-to-share-in-multi-agent/code)

Reviewed Version (pdf): https://openreview.net/references/pdf?id=fHzOZ477eH

13 Replies

Loading