# CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

## Quick Start Commands

### Setup
```bash
pip install -r requirements.txt
# Install vLLM from https://docs.vllm.ai/en/latest/getting_started/installation.html
```

### Start Model Server
```bash
bash scripts/launch_vllm.sh Qwen/Qwen2.5-7B-Instruct 8
# Or manually:
vllm serve Qwen/Qwen2.5-7B-Instruct --port 30000 --host 0.0.0.0 --data-parallel-size 8 --tensor-parallel-size 1 --gpu-memory-utilization 0.8
```

### Math Evaluation
```bash
# Residual MPPI
python run_eval.py base_url=http://localhost:30000 model_path=Qwen/Qwen2.5-7B-Instruct data=math method=residual_mppi verifier=none method.n_samples=16 method.residual_mppi_omega=0.3

# Best-of-N  
python run_eval.py base_url=http://localhost:30000 model_path=Qwen/Qwen2.5-7B-Instruct data=math method=best_of_n verifier=none method.n_samples=4
```

### Preference Evaluation
```bash
python run_eval.py data=prefeval_explicit method=residual_mppi verifier=preference model_path=Qwen/Qwen2.5-7B-Instruct exp_name=N-16-ROLLOUT-4-OMEGA-0.3
```

### Parallel Processing
```bash
python parallel_search.py base_url=http://localhost:30000 model_path=Qwen/Qwen2.5-7B-Instruct data=math method=residual_mppi verifier=none --num_processes=4
python merge_results.py outputs/<dataset_identifier>/<method_approach>/<verifier_name>/<exp_name>
```

## Architecture Overview

**Core Components:**
- `search/`: Search algorithms (residual_mppi, best_of_n, beam_search, dvts, amulet)
- `models/`: Verification strategies (prm, pref_verifier, generative_verifier)
- `configs/`: Hydra configuration management
- `data/`: Dataset configurations for math and preference evaluation

**Key Parameters:**
- `method.n_samples`: Number of samples (typically 4-64)
- `method.residual_mppi_omega`: Exploration parameter (0.0-1.0)
- `verifier`: none, prm, preference, generative_verifier

## Configuration Structure

All experiments use Hydra configs in `configs/`:
- `configs/config.yaml`: Master config
- `configs/data/`: Dataset configurations
- `configs/method/`: Search algorithm parameters  
- `configs/verifier/`: Verification strategies

## Output Format

Results saved to `outputs/<dataset_identifier>/<method_approach>/<verifier_name>/<exp_name>/`:
- `config.yaml`: Full configuration
- `results.jsonl`: Generated responses
- `metrics.json`: Evaluation metrics