Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria

Published: 2025, Last Modified: 08 Jan 2026IEEE Trans. Neural Networks Learn. Syst. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Understanding and predicting the behavior of large-scale multiagents in games remains a fundamental challenge in multiagent systems. This article examines the role of heterogeneity in equilibrium formation by analyzing how smooth regret matching drives a large number of heterogeneous agents with diverse initial policies toward unified behavior. By modeling the system state as a probability distribution of regrets and analyzing its evolution through the continuity equation, we uncover a key phenomenon in diverse multiagent settings: the variance of the regret distribution diminishes over time, leading to the disappearance of heterogeneity and the emergence of consensus among agents. This universal result enables us to prove convergence to quantal response equilibria in both competitive and cooperative multiagent settings. This work advances the theoretical understanding of multiagent learning and offers a novel perspective on equilibrium selection in diverse game-theoretic scenarios.
Loading