# EE Code Release for reviewers

This is the code used for model training and evaluation in the paper. It does not include the data, or LLM weights. We will work on documenting the LLM and data for the full release.

The main file is `3-train-gen2.py` which trains and evaluates the models. Due to compute limitations it runs on a small subset of the dat, but the fairness correction is correctly implemented.

# Setup

We recommend using `uv` to setup the environment.

```
uv sync
uv run python 3-train-gen2.py --config 2-rl-training/0-configs/basic/training.json
```
