Abstract: Highlights•Design a new modular policy in MARL to effectively handle heterogeneous teams.•The proposed policy modules adapt to unseen environments through indirect training.•Our architecture simultaneously learns multiple tasks and agents without forgetting.
External IDs:dblp:journals/eswa/KimP25
Loading