# Inference-Aware Meta-Alignment
## How to Install
1. Install [uv](https://docs.astral.sh/uv/) if you haven't already:
2. Run the following command to set up the environment:
    ```bash
    uv sync
    ```
## How to Run
Detailed argument descriptions can be found by running any of the notebooks with the `--help` flag, e.g.:
```
uv run notebooks/train_grpo.py --help
```
### Train a Reward Model
```
uv run accelerate launch --config_file "config/deep_speed_reward.yaml" notebooks/train_reward_model.py
```
### Run (non-linear) GRPO
```
uv run accelerate launch --config_file "config/deep_speed.yaml" notebooks/train_grpo.py
```
### Evaluate Aligned Models
```
uv run notebooks/plot_length.py
uv run notebooks/plot_pareto.py
```