# How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models

## Setup

Run ```uv sync``` to install local development dependencies. Activate your virtual environment with ```source .venv/bin/activate```.

To setup the training virtual environment for all example environments, run the following commands:

```bash
cd SkyRL/skyrl-train
uv sync --extra vllm
source .venv/bin/activate
uv add math-verify
uv add evaluate
uv pip install sacrebleu
```

You will also need to have specified an ```OPENAI_API_KEY``` and ```WANDB_API_KEY``` in your environment.