## Caption-guided Image Reconstruction
```
python data/diffusion_rec_prompt.py
```
### Training
```
python train_cond.py --root_path /path/to/data/ \
                        --dataset_name GenImage \
                        --model_name clip-ViT-L-14 \
                        --freeze_extractor \
                        --embedding_size 1024 \
                        --input_size 224 \
                        --batch_size 256 \
                        --num_epochs 17 \
                        --device_id 0,1 \
                        --lr 0.0001 \
                        --is_amp \
                        --is_crop \
                        --num_workers 12 \
```
## Testing 
```
bash test_GenImage_cond.sh
```

## Acknowledgments
Our code is developed based on [DRCT](https://github.com/beibuwandeluori/DRCT), [CNNDetection](https://github.com/peterwang512/CNNDetection), [FreDect](https://github.com/RUB-SysSec/GANDCTAnalysis), [Gram-Net](https://github.com/liuzhengzhe/Global_Texture_Enhancement_for_Fake_Face_Detection_in_the-Wild)
, [DIRE](https://github.com/ZhendongWang6/DIRE), [UnivFD](https://github.com/Yuheng-Li/UniversalFakeDetect) . Thanks for their sharing codes and models:


## Checkpoints
The checkpoints will be released upon paper acceptance.
