# SpatialReward - Supplementary Code

This repository contains the supplementary code for **SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning**.

## Directory Structure

```
code/
├── attention_visualize/     # Attention visualization tools
├── eval/                    # Benchmark evaluation scripts
│   ├── EditReward-Bench/    # EditReward-Bench evaluation
│   ├── MERBench/            # MER-Bench evaluation
│   └── MMRB2/               # MMRB2 evaluation
└── reward_server/           # Reward server for Online RL
```

## Requirements

- Python >= 3.10
- PyTorch >= 2.0
- transformers >= 4.40
- vllm >= 0.6.0
- peft >= 0.10.0

Install dependencies:
```bash
pip install torch transformers vllm peft pillow numpy matplotlib
```

## Usage

### 1. Reward Server (for Online RL)

Start the reward server:
```bash
cd reward_server
# Step 1: Start the vLLM inference server
bash step1.sh --config_path=server_configs/spatialreward.yml

# Step 2: Start the reward proxy
bash step2.sh --config_path=server_configs/spatialreward.yml
```

### 2. Benchmark Evaluation

#### MER-Bench
```bash
cd eval/MERBench
bash run.sh /path/to/checkpoint 0
```

#### MMRB2
```bash
cd eval/MMRB2
bash run_lora.sh /path/to/checkpoint 0.7
```

#### EditReward-Bench
```bash
cd eval/EditReward-Bench
bash run.sh /path/to/checkpoint 0
```

### 3. Attention Visualization

Visualize attention patterns:
```bash
cd attention_visualize
bash run_simple.sh ./data/samples.jsonl ./outputs EditScore/EditScore-Qwen3-VL-8B-Instruct 0
```

## Configuration

Key configuration options in `reward_server/server_configs/spatialreward.yml`:

| Parameter | Description |
|-----------|-------------|
| `model_name_or_path` | Path to the base model |
| `lora_path` | Path to LoRA weights (optional) |
| `w1, w2` | SC dimension weights (instruction following, consistency) |
| `w3, w4` | PQ dimension weights (naturalness, artifacts) |
| `a` | Geometric mean exponent for final score |

## License

This code is released for academic research purposes only.
