# One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

This repository is the official implementation of **One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation**. 
Our method code is based on the original [SinSR](https://github.com/wyf0912/SinSR/tree/main) repository, which serves as the primary code source.

## Requirements
To install all requirements, run the following command: 
```
pip install -r requirements.txt
```

## Training

In our work we use the same datasets that were used in ResShift/SinSR/OSEDiff(StableSR) papers. You can find the links for downloading these datasets in our paper, Table 7 in Appendix C.

1. Download the pre-trained VQGAN and ResShift v1 models from the original repository [link](https://github.com/zsyOAOA/ResShift/releases). Create a folder `weights` and move the checkpoints there.
2. Load the train and validation datasets from the links from Table 7 in Appendix C and update data paths in the [config](configs/RSD.yaml) file:
    + `configs.data.train.params.dir_paths: ['/path/to/Imagenet-12/train']`
    + `configs.data.val.params.dir_path: /path/to/imagenet256_srx4/imagenet256/lq`
    + `configs.data.val.params.dir_path_extra: /path/to/imagenet256_srx4/imagenet256/gt`
4. Adjust batch size according your GPUs. 
    + `configs.train.batch: [training batchsize, validation batchsize]` 
    + `configs.train.microbatch: total batchsize = microbatch * #GPUS * num_grad_accumulation`

To run distillation process of ResShift model run the following command:

```
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 main_distill.py --cfg_path ./configs/RSD.yaml --save_dir ./logs/RSD
```

## Evaluation  

For model inference run the following command:

```
CUDA_VISIBLE_DEVICES=0 python inference.py --task RSD --config ./configs/RSD.yaml --one_step --ckpt /path/to/ckpt --chop_size 512 --in_path /path/to/LR/images --out_path /path/to/SR/images --scale 4
```

To evaluate the model, run the following command:

```
python evaluate.py --dataset dataset_name -bs 1 --inference_fold /path/to/SR/images --gt_fold /path/to/HR/images --metrics_fold /path/to/metrics_fold
```

Metrics will be saved as `metrics_fold/dataset_name.csv`

## Results

The main results are presented in our paper, Tables 1–3

###  Quantitative results of our model on real-world datasets  

| Dataset     | PSNR↑   | SSIM↑   | LPIPS↓   | CLIPIQA↑ | MUSIQ↑   |
|-------------|---------|---------|----------|-----------|----------|
| RealSR      | 25.91   | 0.754 | 0.273 | 0.7060    | 65.860   |
| RealSet65   | –       | –       | –        | 0.7267    | 69.172 |

### Quantitative results of our model on fixed size datasets  

| Dataset                       | PSNR↑  | SSIM↑  | LPIPS↓ | DISTS↓ | NIQE↓ | MUSIQ↑ | MANIQA↑ | CLIPIQA↑ |
|-------------------------------|--------|--------|--------|--------|--------|--------|---------|-----------|
| ImageNet-Test (256x256)                 | 24.31  | 0.6570 | 0.1930 | –      | –      | 58.947 | –       | 0.6810    |
| DIV2K-Val (512×512 crops)     | 23.91  | 0.6042 | 0.2857 | 0.1940 | 5.1987 | 68.05  | 0.5937  | 0.6967    |
| DRealSR (512×512 crops)       | 27.40  | 0.7559 | 0.3042 | 0.2343 | 6.2577 | 62.03  | 0.5625  | 0.7019 |
| RealSR (512×512 crops)        | 25.61  | 0.7420 | 0.2675 | 0.2205 | 5.7500 | 66.02  | 0.5930  | 0.6793    |


## Contributing

This project is licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode). Redistribution and use should follow this license.

Parts of this repository are licensed by https://github.com/wyf0912/SinSR and https://github.com/zsyOAOA/ResShift
