# Image modeling
This repository is based on https://github.com/LTH14/mage.

## Preparation
```bash
conda env create -f environment.yaml
conda activate mage
```
Download `vqgan_jax_strongaug.ckpt` via [the link for VQGAN](https://drive.google.com/file/d/13S_unB87n6KKuuMdyMnyExW0G1kplTbP/view?usp=sharing) and `mage-vitb-1600.pth` via [the link for ViT-Base](https://drive.google.com/file/d/1Q6tbt3vF0bSrv5sPrjpFu8ksG3vTsVX2/view?usp=sharing), where the both links are from the original repository of MAGE.

If you also need to measure FID/IS rather than just generating images, download [ImageNet](http://image-net.org/download) dataset and put it in `./imagenet_raw`. Then, preprocess the data into 256x256 and install [torch-fidelity](https://github.com/toshas/torch-fidelity):
```bash
python prepare_imgnet_val.py --data_path imagenet_raw --output_dir imagenet
pip install torch-fidelity
```

## Sampling
We can generate images under `./output` with following command:
```bash
python gen_img_uncond.py [SAMPLER ARGS] --num_iter 8 --ckpt mage-vitb-1600.pth --batch_size 64 --num_images 50000 --model mage_vit_base_patch16 --output_dir output
```
By changing `--num_iter`, we can sample with a different number of sampling steps. The `[SAMPLER ARGS]` for each sampler in the paper is given as follows (when the Gumbel temperature is 6.0):
|Sampler Name|`[SAMPLER ARGS]`|
|-|-|
|**Random**|`--random --moment --unbiased`|
|**MaskGIT**|`--temp 6.0`|
|**Moment**|`--temp 6.0 --moment`|
|**Temp**|`--temp 6.0 --random --moment`|
|**Halton**|`--halton --unbiased`|
|**Moment+Cache**|`--temp 6.0 --moment --cache 2`|

## Evaluation
After generating the images, we can evaluate the sampler's FID/IS by using `torch-fidelity`. The following is an example of evaluating the **Moment** sampler with 8 steps. Change the folder passed to `--input1` with the folder generated by the above sampling process.
```bash
fidelity --gpu 0 --isc --fid --input1 output/temp6.0-iter8-moment/ --input2 imagenet
```