Theory of Mind as Intrinsic Motivation for Multi-Agent Reinforcement Learning

Published: 20 Jun 2023, Last Modified: 29 Jun 2023ToM 2023EveryoneRevisionsBibTeX
Keywords: Theory of Mind, ToM, RL, multi-agent, reinforcement learning, intrinsic motivation, concept learning, interpretability
TL;DR: Presents a method of grounding semantic beliefs in RL policies, and uses prediction of other agents' beliefs as intrinsic motivation in multi-agent RL.
Abstract: The ability to model the mental states of others is crucial to human social intelligence, and can offer similar benefits to artificial agents with respect to the social dynamics induced in multi-agent settings. We present a method of grounding semantically meaningful, human-interpretable beliefs within policies modeled by deep networks. We then consider the task of 2nd-order belief prediction. We propose that ability of each agent to predict the beliefs of the other agents can be used as an intrinsic reward signal for multi-agent reinforcement learning. Finally, we present preliminary empirical results in a mixed cooperative-competitive environment.
Submission Number: 37
Loading