Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks

Georgios Papoudakis; Filippos Christianos; Lukas Schäfer; Stefano V Albrecht

Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks

Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V Albrecht

Published: 29 Jul 2021, Last Modified: 26 May 2025NeurIPS 2021 Datasets and Benchmarks Track (Round 1)Readers: Everyone

Abstract: Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-agent learning tasks. Our experiments serve as a reference for the expected performance of algorithms across different learning tasks, and we provide insights regarding the effectiveness of different learning approaches. We open-source EPyMARL, which extends the PyMARL codebase to include additional algorithms and allow for flexible configuration of algorithm implementation details such as parameter sharing. Finally, we open-source two environments for multi-agent research which focus on coordination under sparse rewards.

Supplementary Material: zip

URL: https://github.com/uoe-agents/epymarl https://github.com/uoe-agents/lb-foraging https://github.com/uoe-agents/robotic-warehouse

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/arxiv:2006.07869/code)

13 Replies

Loading