# RGE

## Requirements

```bash
pip install torch transformers vllm ray numpy pandas
```

## Usage

```bash
cd src
python RGE.py --dataset gsm8k --model_name Qwen/Qwen2.5-3B-Instruct --num_engines 4
```

## Key Arguments

| Argument | Default | Description |
|----------|---------|-------------|
| `--dataset` | gsm8k | Dataset to use |
| `--model_name` | Qwen/Qwen2.5-3B-Instruct | Model path |
| `--population_size` | 30 | Number of perturbations |
| `--top_k` | 5 | Top perturbations for ensemble |
| `--sigma_values` | 0.0001,...,0.01 | Perturbation scales |
| `--num_engines` | 4 | Number of vLLM engines |
| `--tp` | 1 | Tensor parallel size |

## Structure

| Directory | Description |
|-----------|-------------|
| `src/` | Main RGE implementation |
| `datasets/` | Dataset handlers for GSM8K, MATH-500, HellaSwag, etc. |
| `utils/` | Utility functions and reward scoring |
| `toy_expts/` | Code for toy experiments in the paper |
| `verl_updated/` | Code for baseline methods used in the paper (verl framework) |
| `data/` | Dataset files |
