Auto-Aligning Multiagent Incentives with Global Objectives

Minae Kwon; John P Agapiou; Edgar A. Duéñez-Guzmán; Romuald Elie; Georgios Piliouras; Kalesha Bullard; Ian Gemp

Auto-Aligning Multiagent Incentives with Global Objectives

Minae Kwon, John P Agapiou, Edgar A. Duéñez-Guzmán, Romuald Elie, Georgios Piliouras, Kalesha Bullard, Ian Gemp

Published: 16 Jun 2023, Last Modified: 17 Jul 2023ICML LLW 2023EveryoneRevisionsBibTeX

Keywords: Price of Anarchy, Multiagent Learning, Reward Sharing, Collective Intelligence

TL;DR: Our work automatically modifies agent rewards in a multi-agent system so the multi-agent system can optimize an arbitrary global objective.

Abstract: The general ability to achieve a singular task with a set of decentralized, intelligent agents is an important goal in multiagent research. The complex interaction between individual agents' incentives makes designing their objectives such that the resulting multiagent system aligns with a desired global goal particularly challenging. In this work, instead of considering the problem of designing suitable incentives from scratch, we assume a multiagent system with given preset incentives and consider $\textit{automatically modifying}$ these incentives online to achieve a new goal. This reduces the search space over possible individual incentives and takes advantage of the effort instilled by the previous system designer. We demonstrate the promise as well as the limitations of re-purposing multiagent systems in this way, both theoretically and empirically, on a variety of domains. Surprisingly, we show that training a diverse multiagent system to align with a modified global objective ($g \rightarrow g')$ can, in at least one case, lead to better generalization performance in unseen test scenarios, when evaluated on the original objective ($g$).

Submission Number: 4

Loading