# HALC-Prune

Official implementation of
**"Head-Level Mechanistic Attribution for Hallucination Control: Training-Free Counteractive Pruning in VLMs (HALC-Prune)"**

---

## Overview

HALC-Prune provides a training-free, mechanistic approach to suppress object hallucinations in Vision-Language Models (VLMs), by counteractively pruning and dynamically adjusting attention heads based on semantic attribution.

---

## Experimental Preparation

### 1. Clone the Repository

```bash
git clone --recurse-submodules ****************
cd HALC-Prune/
```

### 2. Dataset Preparation

Please download the COCO2014 dataset: [COCO Website](https://cocodataset.org/#download)
Required files: 2014 Val images \[41K/6GB] and 2014 Train/Val annotations \[241MB]
Organize as:

```
/val2014
    ├── COCO_val2014_000000569926.jpg
    ├── ...
    ├── COCO_val2014_000000569969.jpg
    └── annotations
        ├── captions_train2014.json
        ├── captions_val2014.json
        ├── instances_train2014.json
        ├── instances_val2014.json
        ├── person_keypoints_train2014.json
        └── person_keypoints_val2014.json
```

### 3. Environment Setup

We recommend using conda:

```bash
conda create -n cir python=3.10
conda activate cir

pip install pillow numpy torch torchvision accelerate transformers nltk pandas einops seaborn scikit-learn scikit-optimize tiktoken blobfile mmengine tabulate jupyter
pip install -U transformers sentencepiece protobuf
```

### 4. Model Preparation

Supported models:

- [llava-hf-7b](https://huggingface.co/llava-hf/llava-1.5-7b-hf)
- [llava-hf-13b](https://huggingface.co/llava-hf/llava-1.5-13b-hf)
- [shikra-7b](https://github.com/shikras/shikra)
  (For Shikra, follow [official guidance](https://huggingface.co/shikras/shikra-7b-delta-v1) to merge delta weights.)

For evaluating baselines such as VCD, OPERA, or PAI, you also need the [liuhaotian/llava-v1.5-7b community version](https://huggingface.co/liuhaotian/llava-v1.5-7b).

---

## How to Run (Example: llava-7b-hf)

### Generalization Pipeline

#### 1. Semantic Attribution

**Step 1: Automatic Pruning and Attribution**

```bash
cd semantic_attribution
conda activate cir
python llava-purne-7b-5-18.py
```

Or use Jupyter Notebook.

**Step 2: Select Hallucinatory/Faithful Heads**

Adjust thresholds in `save_experiment.ipynb` to filter the desired heads.

#### 2. Dynamic Head Counterbalancing & Evaluation

Run `run_dynamic_4090_7b_fixed.py` (for \[5,18] and \[19,26]), or `run_dynamic_4090_7b_fixed_combo.py` (for \[5,18]+\[19,26]) to evaluate.

**Parameters:**

- `txt_file`: Text file listing image names for evaluation (validation set, hallucination set, or generalization set).
- `txt_file_chosen`: Subset to exclude from `txt_file` (for generalization experiments, set as the validation set; for others, use an empty file).

Save experiment logs to `test.csv` and use `evaluate_xxx.ipynb` for analysis.

---

### Automation and Task-Specific Adaptation Pipeline

Change to `auto_cir` directory. Two experiment modes:

#### 1. Fully Automated

For validating the entire automation pipeline, run:

```bash
python auto_cir_7b_5_18_hallu.py
```

#### 2. Step-wise Evaluation

- First, run `auto_cir_score_compute_only.py` to compute InfoSpectralScore and save to csv.
- Then, run `auto_cir_7b_5_18_hallu_retry_bias.py` for automated ablation evaluation.

Check `time_log` for image names used in each experiment round. This workflow supports breakpoint recovery and debugging.

After experiments, use `evaluate.ipynb` for final evaluation.

---

## Baseline Methods

External baselines (VCD, OPERA, PAI, etc.) are integrated as submodules in `third_party/`—see the [outside_evaluate/readme](outside_evaluate/readme.md) for details.

---

## Feedback

Please open an issue or contact the maintainer if you have any questions.

---
