# CrossQuant: Reducing Quantization Kernels for Precise Large Language Model Compression
The code in the supplementary material is adapted from: https://github.com/mit-han-lab/llm-awq
# Reproduct the Perplexity Results
```
cd code
pip install -e.
python eval_ppl.py --model_path /PAHT/TO/YOUR/MODEL \
    --tasks wikitext \
    --w_bit 4  --a_bit 8 --q_group_size 128 --alpha 0.15
```
