# Explore, Establish, Exploit: Red Teaming Language Models from Scratch

Anonymous Authors

## Setup

All code has been tested with python 3.10.

```
pip install -r requirements.txt

git clone https://github.com/CarperAI/trlx.git
cd trlx
pip install -e .
cd ..

mkdir models
mkdir data
```

## Run

The 4 e's:

```
python explore.py
python establish.py
python exploit.py
python evaluate.py
```