Abstract: Highlights•Design a new modular policy in MARL to effectively handle heterogeneous teams.•The proposed policy modules adapt to unseen environments through indirect training.•Our architecture simultaneously learns multiple tasks and agents without forgetting.
Loading