# CAMP: Conflict-Aware Adaptive Margin Preference Alignment

This repository contains the reference implementation for CAMP, a conflict-aware preference alignment framework built on Direct Preference Optimization (DPO). CAMP explicitly models truthfulness–helpfulness conflicts in preference data and adaptively regulates optimization margins during training.

---

## Repository Structure

```text
CAMP/
├── src/
│   ├── data/
│   │   ├── ultra_split.py
│   │   └── ultra_pairs.py
│   ├── trainer/
│   │   └── camp_trainer.py
│   ├── eval/
│   │   ├── llm_inference.py
│   │   ├── gpt_as_judge.py
│   │   └── hallu_rates.py
│   └── utils/
│       └── eval_datasets.py
```

---

## Requirements

- Python >= 3.9 (tested with Python 3.11)
- See `requirements.txt` for required Python packages.

---

## Data Preparation (UltraFeedback)

### Split UltraFeedback

```bash
python src/data/ultra_split.py --output_dir data/ultrafeedback/splits
```

---

### Build Preference Pairs

```bash
python src/data/ultra_pairs.py \
  --input data/ultrafeedback/splits/ultrafeedback_train.jsonl \
  --margin 1 \
  --max_pairs_per_prompt 1 \
  --truth_output data/ultrafeedback/pairs/ultra_train_pairs.json
```

```bash
python src/data/ultra_pairs.py \
  --input data/ultrafeedback/splits/ultrafeedback_val.jsonl \
  --margin 1 \
  --max_pairs_per_prompt 1 \
  --truth_output data/ultrafeedback/pairs/ultra_val_pairs.json
```

Each pair includes:
- chosen and rejected responses
- truthfulness and helpfulness scores
- conflict indicators used by CAMP

---

## Training (CAMP)

```bash
python src/trainer/camp_trainer.py \
  --train_pairs_path data/ultrafeedback/pairs/ultra_train_pairs.json \
  --eval_pairs_path  data/ultrafeedback/pairs/ultra_val_pairs.json \
  --model_name meta-llama/Llama-3.1-8B-Instruct \
  --output_dir models/camp_llama8b \
  --max_train_samples 20000 \
  --max_eval_samples 32 \
  --per_device_train_batch_size 4 \
  --gradient_accumulation_steps 8 \
  --learning_rate 5e-6 \
  --lr_scheduler_type constant \
  --warmup_ratio 0 \
  --num_train_epochs 1 \
  --bf16 True \
  --logging_steps 10 \
  --save_steps 100 \
  --save_total_limit 3 \
  --report_to none
```

---

## Evaluation

### Model Inference on TruthfulQA

TruthfulQA is a public benchmark and can be used directly.

```bash
python src/eval/llm_inference.py \
  --model models/camp_llama8b/checkpoint-XX \
  --input_path TruthfulQA/data/v1/TruthfulQA.csv \
  --output_csv outputs/truthfulqa_generations.csv \
  --batch_size 32
```

This generates a CSV containing model responses.

---

### LLM-as-a-Judge Evaluation (Truthfulness)

```bash
python src/eval/gpt_as_judge.py \
  --objective truthfulness \
  --generations_csv outputs/truthfulqa_generations.csv \
  --out_csv outputs/truthfulqa_eval_gpt_{objective}.csv \
  --rate_limit_qps 2.0
```

The judge assigns 1–5 Likert truthfulness scores with short rationales.

---

### Hallucination / Accuracy Rates

```bash
python src/eval/hallu_rates.py --input_csv outputs/truthfulqa_eval_gpt_truthfulness.csv
```

---
