# UNISVD: UNILATERAL WEIGHT DECOMPOSITION FOR ATTENTION-BASED VISION MODELS

#### Inference

<details><!---DeiT-B-->
  <summary>DeiT-Base</summary>
  <ul>
    <details>
      <summary>FWSVD</summary>
      <ul><pre><code>python compress.py --svd_method fwsvd --search_method asvd --compression_target 0.5 --asvd_alpha 0.5 --calib_bs=128 --seed=280 --model deit_base_patch16_224.fb_in1k
    </details>
    <details>
      <summary>ASVD</summary>
      <ul><pre><code>python compress.py --svd_method asvd --search_method asvd --compression_target 0.5 --calib_bs=128 --seed=280 --model deit_base_patch16_224.fb_in1k
    </details>
    <details>
      <summary>SVD-LLM</summary>
      <ul><li>One-shot</li>
      <pre><code>python compress.py --svd_method svd_llm --search_method uniform --compression_target 0.5 --calib_bs=128 --seed=280 --model deit_base_patch16_224.fb_in1k
      <li>Progressive</li>
      <pre><code>python compress.py --svd_method svd_llm --search_method uniform --compression_target 0.5 --calib_bs=128 --seed=280 --blockwise --model deit_base_patch16_224.fb_in1
    </details>
    <details>
      <summary>FLAR-SVD</summary>
      <ul><pre><code>python compress.py --svd_method flar_svd --search_method flar_svd --compression_target 0.51 --calib_bs=128 --seed=280 --blockwise --model deit_base_patch16_224.fb_in1k
    </details>
  </ul>
</details>

#### Latency Predictor For FLAR-SVD
The latency predictor cannot be created with the trt pip package. Instead, we used the nvidia pytorch image, to gather the necessary data and train it.
You may replicate our predictor following the same procedure or download our pretrained checkpoint [here](tbd).

Train a latency predictor on Nvidia Hardware:
```
# Run to obtain csv with layer-wise latency measurements.
python trtexec_layer_eval_v2.py

# Run to generate .pkl latency predictor
python gen_lw_lat_predict.py
```

#### Dataset Preparation

Download the [ImageNet-1K](http://image-net.org/) dataset and structure the data as follows:
```
/path/to/imagenet-1k/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  validation/
    class1/
      img3.jpeg
    class2/
      img4.jpeg
```


