# Revisiting Atomic Rounding with Vectorized Reparameterization for LLM Quantization

This is the smallest reproduction of our work.

To run VQRound+GPTQ with the following command:
```bash
python main.py  \
  --model_path facebook/opt-125m   \
  --teacher_model facebook/opt-125m  \
  --calib_data c4   \
  --seqlen 2048   \
  --nsamples 128   \
  --batch_size 1   \
  --w_bits 4   \
  --pre_gptq   \
  --gptq_actorder   \
  --D 8   \
  --K 4096   \
  --steps 5000   \
  --lr 1e-2   \
  --kd_temperature 2.0   \
  --kd_alpha 1.0   \
  --use_round_reg   \
  --round_weight 1e-2   \
  --beta_hi 20   \
  --beta_lo 2   \
  --beta_hold_ratio 0.1 \
  --kmeans_iters 100 \
  --build_teacher_cache
```
