## code repo for *safety compliance*

### reproduction for main results.

1. benchmark seed creation.
`run benchmark_seed.py`

2. benchmark data generation.
`run benchmark_generation.py`

3. cold-start training.
`run_sft_verl.sh`

4. grpo training.
`run_grpo_verl.sh`

5. inference.
`run_inference.sh`

6. eval (main table).
`eval_main.py`


### reproduction for *extrapolating pre-existing data*.

7. extrapolate pre-existing safety data.
`create_data_extrapolate.py`

8. distribution over chapters
`eval_distribution.py`

9. eval for the extrapolated data.
`eval_extrapolated_data.py`