# QR-Adaptor: Joint Quantization and LoRA Rank Adaptation

## Structure

```
qr_adaptor_release/
├── src/                    # Core source code
│   ├── train_hqq_lora.py   # Main training script with HQQ quantization
│   ├── quantize_hqq.py     # HQQ quantization utilities
│   ├── eval_lm_harness.py  # LM evaluation harness
│   ├── eval_comprehensive.py # Comprehensive evaluation
│   ├── model_presets.py    # Model configuration presets
│   └── amq/                # Adaptive Mixed-precision Quantization module
│       ├── quantization/   # Quantization backends (GPTQ, AWQ, OWQ)
│       ├── search/         # Search algorithms for bit allocation
│       ├── predictor/      # Performance predictors
│       └── utils/          # Utility functions
├── experiments/            # Experiment scripts
│   └── main_experiment.py  # Main experiment runner
├── data_loaders/           # Data loading utilities
├── configs/                # Configuration files
│   └── amq_configs/        # Model-specific AMQ configs
└── figures/                # Generated figures
```

## Quick Start

### 1. Training with QR-Adaptor
```bash
python src/train_hqq_lora.py \
    --model_name Qwen/Qwen3-1.7B \
    --bits 4 \
    --lora_rank 16 \
    --output_dir ./outputs
```

### 2. Evaluation
```bash
python src/eval_lm_harness.py \
    --model_path ./outputs/quantized_model \
    --tasks arc_easy,arc_challenge,hellaswag,piqa,winogrande,boolq,openbookqa
```

## Requirements
- torch >= 2.0
- transformers >= 4.36
- peft >= 0.7
- hqq
- lm-eval
