# Omni Math

## How to use Reward Model

First, make sure you have `vllm` installed.

Then, start the reward model server:


```bash
python scripts/omni_math/reward_model_server.py 
```

## Run Omni Math with Reward Model

- **`--reward_model_name <MODEL_NAME>`**  
  Use a custom reward model for evaluation. Currently, we support `Qwen/Qwen2.5-Math-RM-72B`.

- **`--only_reward_model`**  
  When `False` (default), the final score is the sum of both judge model and reward model scores.
  When `True`, rely only on the reward model score (ignore the judge model).

Example:
```bash
python scripts/omni_math/run_omni_math.py \
  ...
  --reward_model_name "Qwen/Qwen2.5-Math-RM-72B" \
  --only_reward_model \
  ...
```

See [here](../../experiments/scripts/omni_math/omni_math_only_rm_gpt4o.sh) for more details.
