# Quick Start for Paper Reviewers

## Setup
Note that our experiments in BipedalWalker are running by A40 with newer CUDA version.
To set up the environment for running and evaluating our code:

```bash
conda create --name bipedalwalker python=3.10.16
conda activate bipedalwalker
pip install -r requirements.txt
```

**Hardware Requirements:** Default GPU is A40.

## Running the Code

**Important:** All training and evaluation commands should be run from the `bipedalwalker` directory.

### Training

The `train_scripts` directory provides instructions on how to train models for each baseline algorithm. Use these scripts to train models with the specific hyperparameter settings used in our experiments. Please have the training script running under /bipedalwalker.

### Evaluation

We provide two evaluation scripts:
- `evaluate_general_script.sh` - General evaluation script
- `evaluate_robust_script.sh` - Robust evaluation script

### Provided Checkpoints

We provide one training checkpoint at 20k updates for each baseline algorithm in the `logs` directory. These can be used to reproduce our evaluation results without retraining.

### Reproducing Results

You can reproduce our results by either:
1. Using the provided checkpoints in `logs` and running the evaluation scripts
2. Training models from scratch using the scripts in `train_scripts` with the same hyperparameter settings
3. Specifically, for eval_general_script.py and eval_robust_script.py, you should copy the XPID from train_script/ one specific algorithm, and choose the log location of the trained model (we give examples in the script) 

4. To change the hyperparameter of early stop reward and sampling patience, go to line 151 and 159 of runners/adversarial_runner_hybrid.py
