# multiagent_competition

This code is used to train the Trojan model.
It includes the following steps:

- Benign model.
- Collect trajectories from the benign model.
- Train the fast-failing model.
- Collect trajectories from the fast-failing model.
- Train the Trojan model.


## Folder Structure
```
benign_model/
fast_failing/
├── model/
    ├── humanoid/model_fast_failing.pt
    └── ant/model_fast_failing.pt
└── train/
    └── ppo_fast_failing.py
collect_trajectories/
├── collect_fast_failing_trajectories.py/
└── collect_benign_trajectories.py/
```

## Requirements

## Backdoor Attack
### For `multiagent_competition`
1. Train fast-failing agent
```bash
python backdoor_attack\multiagent_competition\fast_failing\train\ppo_fast_failing.py
```
2. Collect benign and fast-failing trajectories
```bash
python backdoor_attack\multiagent_competition\collect_trajectories\collect_benign_trajectories.py
python backdoor_attack\multiagent_competition\collect_trajectories\collect_fast_failing_trajectories.py

```
3. Behavior Cloning with Data Poisoning
```bash
python backdoor_attack\multiagent_competition\behavior_cloning\train_victim.py
```

