This contains the base IDM-VTON codes necessary for running baseline (zero-shot) inference as well as the TTA files. Please refer to the original codebase of IDM-VTON at https://github.com/yisol/IDM-VTON to setup the virtual environment. To install torch_dct, the command pip install torch-dct can be used. Any other required packages can be installed using pip.

# Running zeroshot

This can be run by simply running python inference_vit.py on the VITON-HD dataset.

# Running TTA 

This can be run using python inference_vit_tta.py with the necessary arguments. 
For our experiments, we ran the following:

accelerate launch inference_vit_tta.py --pretrained_model_name_or_path "yisol/IDM-VTON"  --width 384 --height 512 --num_inference_steps 30 --unpaired --data_dir "path/to/vitonhd/dataset/"  --seed 42 --test_batch_size 1 --output_dir result/tta_vitonhd --guidance_scale 2.0 --dct_w 1000.0 --img_w 100.0

Please download the VITON-HD dataset from the original repository at https://github.com/shadow2496/VITON-HD. Our entire codebase will be released later.
