Robust Multi-Agent Reinforcement Learning with Diverse Adversarial Agent Generation and Contrastive Policy Encoding

This project implements Robust Multi-Agent Reinforcement Learning (MARL) with:
- Diverse adversarial agent generation for adversarial training
- Contrastive policy encoding for adversarial behavior representation
- Co-evolutionary robust MARL training
- Support for both Predator-Prey (PettingZoo MPE) and SMAC environments.

Usage:
1. Train on Predator-Prey (MPE)
python train_adv.py --env_type mpe --env_name simple_tag_v3

2. Train on SMAC (3m map)
python train_adv.py --env_type smac --env_name smac --map_name 3m

