Off-Beat Multi-Agent Reinforcement Learning

Published: 01 Jan 2023, Last Modified: 03 Sept 2025AAMAS 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We investigate cooperative multi-agent reinforcement learning in environments with off-beat actions, i.e., all actions have execution durations. During execution durations, the environmental changes are not synchronised with action executions. To learn efficient multi-agent coordination in environments with off-beat actions, we propose a novel reward redistribution method built on our novel graph-based episodic memory. We name our solution method as LeGEM. Empirical results on stag-hunter game show that it significantly boosts multi-agent coordination.
Loading