# SHARP-Q: Spectral Hessian Alignment and Rectification for Post-training Quantization

This repository provides the implementation for our research, **"SHARP-Q: Spectral Hessian Alignment and Rectification for Post-training Quantization"**. Our work focuses on improving post-training quantization (PTQ) for Vision Transformers through spectral alignment.

---

### Method Overview
<div align="center">
  <img src="./assets/framework.jpg" width="800">
  <p align="center">
    <b>Figure 1: The overall framework of the proposed SHARP-Q method.</b>
  </p>
</div>


---

## Getting Started

### 1. Prerequisites
To ensure the reproducibility of our results, we recommend the following environment:
- **Python**: 3.9
- **PyTorch**: 2.5.1
- **CUDA**: 12.1

### 2. Installation
You can set up the environment by cloning the repository and installing the necessary dependencies listed in `requirements.txt`:

```bash
git clone <ANONYMOUS_REPO_URL>
cd SHARP-Q
pip install -r requirements.txt

```

## 3. Data Preparation

### ImageNet Dataset

The experiments are conducted on the **ImageNet (ILSVRC2012)** dataset. Please ensure your dataset is organized following the standard PyTorch `ImageFolder` structure, containing both `train` and `val` subdirectories.

### Pre-trained Models

* Our implementation utilizes models from the `timm` library. The scripts will attempt to download the required weights automatically.
* **Manual Setup**: If you prefer an offline setup, you may place the corresponding `.bin` or `.pth` files into the `./model_zoo/` directory to facilitate local loading.

## Usage

### Reproduction (Recommended)

For a straightforward reproduction of the results presented in our paper, we provide a unified script `run_script.py`. This script is pre-configured with the optimized hyperparameters (such as  and `rect_geo`) for various architectures.

To start the evaluation, simply run:

```bash
python run_script.py

```

> **Note**: The script defaults to **ViT-Small**. To test other models (e.g., DeiT or Swin), please modify the `MODEL_NAME` variable directly within `run_script.py`.

### Advanced Usage

For custom experiments or fine-grained control over the quantization process, you can invoke `test_quant.py` with specific arguments:

```bash
python test_quant.py --model vit_small --config ./configs/3bit/best.py --dataset <PATH_TO_IMAGENET> --calibrate --optimize --optim-metric fisher_hybrid --p2 0.1 --rect-geo 2.0

```

## Results

The following table summarizes the performance of **SHARP-Q** across different bit-widths on the ImageNet validation set:

| Model | **W4/A4** | **W3/A3** |
| --- | --- | --- |
| **ViT-S** | 77.34 | 66.93 |
| **ViT-B** | 83.25 | 77.86 |
| **DeiT-T** | 67.13 | 56.14 |
| **DeiT-S** | 76.86 | 69.73 |
| **Swin-S** | 81.96 | 77.16 |

---
