Multi-agent Learning Dynamics: A SurveyOpen Website

2007 (modified: 24 Sept 2025)CIA 2007Readers: Everyone
Abstract: In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration and policy iteration. Four characteristics are studied: initial conditions, parameter settings, convergence speed, and local versus global convergence. Global convergence is still difficult to achieve in practice, despite existing theoretical guarantees. Multiple visualizations are included to provide a comprehensive insight into the learning dynamics.
0 Replies

Loading