# TraCeS

This implementation is based on the official codebase of [omnisafe](https://github.com/PKU-Alignment/omnisafe). It implements TraCeS method and C-T baseline.


## How to install

```bash
conda create --name [env_name] python=3.10
conda activate [env_name]
cd ubcrl/
pip install -e .
```


## How to run

To train a policy, simply run the scripts in `./scripts/run_ppo_lag_learned_h.sh` after activating the environment. For example, train a policy using TraCeS for a selected task:

```bash
./scripts/run_ppo_lag_learned_h.py [task_name] [start_seed] [end_seed]
```

To train TraCeS safety estimator, run the following script and refer to the help prompt for other specificiable arguments.

```bash
./scripts/train_classifier.py -env [task_name]
```

To train a policy using C-T baseline:
```bash
./scripts/run_ppo_lag_learned_bc.py [task_name] [start_seed] [end_seed]
```

The command to train C-T baseline safety estimator is similar. Refer to the help prompt for other specifiable arguments.

```bash
./scripts/train_mlp_classifier.py -env [task_name]
```

The hyperparameters for different algorithms are referenced from the corresponding papers or config YAML file. 
