Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

Published: 19 Dec 2025, Last Modified: 05 Jan 2026AAMAS 2026 FullEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Multi-Agent Reinforcement Learning, Model-Based Reinforcement Learning, Representation Learning, Multi-Agent Cooperation, Centralized Training with Decentralized Execution
Abstract: Learning to coordinate many agents in partially observable and highly dynamic environments requires both informative representations and data-efficient training. To address this challenge, we present a novel model‑based multi‑agent reinforcement learning framework that unifies joint state-action representation learning with imaginative roll-outs. We design a world model trained with variational auto‑encoders and augment the model using the state‑action learned embedding (SALE). SALE is injected into both the imagination module that forecasts plausible future roll-outs and the joint agent network whose individual action values are combined through a mixing network to estimate the joint action-value function. By coupling imagined trajectories with SALE‑based action values, the agents acquire a richer understanding of how their choices influence collective outcomes, leading to improved long‑term planning and optimization under limited real‑environment interactions. Empirical studies on well-established multi‑agent benchmarks, including StarCraft II Micro-Management, Multi‑Agent MuJoCo, and Level‑Based Foraging challenges, demonstrate consistent gains of our method over baseline algorithms and highlight the effectiveness of joint state-action learned embeddings within a multi-agent model‑based paradigm.
Area: Learning and Adaptation (LEARN)
Generative A I: I acknowledge that I have read and will follow this policy.
Submission Number: 71
Loading