### Acknowledgements

This code is heavily adapted from the following repository [CleanCLIP](https://github.com/nishadsinghi/CleanCLIP)


### Download the CC3M Dataset
1. wget https://storage.cloud.google.com/gcc-data/Train/GCC-training.tsv?_ga=2.191230122.-1896153081.1529438250
2. Run download.py -f GCC-training.tsv -d <directory_where_images_should_be_stored>

### Install dependencies


```bash
pip install -r ../requirements.txt
```
Add the `src` directory to the `PYTHONPATH`:
```sh
cd large_scale
export PYTHONPATH="$PYTHONPATH:$PWD"
```

### Generating Poisoned Data
```
python -m backdoor.create_backdoor_data --train_data <path to train csv file> --templates data/ImageNet1K/validation/classes.py 
--size_train_data <total number of samples you want in the (pre-)training data> 
--num_backdoor <number of samples to be poisoned> 
--label <backdoor label, e.g., banana> 
--patch <type of backdoor trigger> 
--patch_location <location of the backdoor trigger> 
```


### Backdooring

```
python3 -m src.main --name tester_exp \
--finetune \
--model_name <model name (e.g, ViT-B/32)> \
--train_data <path to (poisoned) train csv file>  \
--epochs 10 \
--lr 1e-6 \
--batch_size 128 \
--pretrained \
--wandb \
--num_warmup_steps 50
```
Your train/validation csv/tsv file should have 2 columns containing captions and the path to corresponding images on the machine. this script does not download the images for the captions directly. To download the images from their URL for CC3M and/or CC12M, use our `utils/download.py` script.

### TBAR Poison-Only training

```
python3 -m src.main --name Cont_PoisonONLY_<attack> \
--finetune \
--model_name <model name (e.g, ViT-B/32)> \
--train_data <path to (fully poisoned) train csv file> \
--checkpoint <path to backdoored CLIP checkpoint .pt file>  \
--epochs <number of training epochs> \
--complete_finetune \
--lr 1e-6 \
--batch_size 128 \
--wandb \
--num_warmup_steps 50
```

### Gradient-ascent

```
python3 -m src.main --name Cont_PoisonONLY_<attack> \
--finetune \
--model_name <model name (e.g, ViT-B/32)> \
--train_data <path to (fully poisoned) train csv file> \
--checkpoint <path to backdoored CLIP checkpoint .pt file>  \
--epochs <number of training epochs> \
--complete_finetune \
--lr 1e-6 \
--batch_size 128 \
--wandb \
--num_warmup_steps 50 \
--grad_ascent 
```


### CleanCLIP Finetuning 

```
python -m src.main --name cleanCLIP_finetuning 
--model_name <model name (e.g, ViT-B/32)> \
--checkpoint <path to backdoored CLIP checkpoint .pt file> 
--device_id 0 
--batch_size 64 
--train_data <path to csv file containing clean finetuning data> 
--epochs 10 
--num_warmup_steps 50 
--lr 1e-5 
--inmodal 
--complete_finetune  
```

### Supervised Finetuning 

```
python -m src.main --name cleanCLIP_finetuning 
--model_name <model name (e.g, ViT-B/32)> \
--checkpoint <path to backdoored CLIP checkpoint .pt file> 
--device_id 0 
--batch_size 128 
--train_data <path to csv file containing clean finetuning data> 
--epochs 10 
--num_warmup_steps 50 
--lr 1e-6 
--complete_finetune 
```


### Evaluation - ImageNet1K

Clean Accuracy:
```
python3 -m src.eval_unlearn \
--name Eval_tester \
--model_name <model name (e.g, ViT-B/32)> \
--start_checkpoint <path to backdoored CLIP checkpoint .pt file> \
--end_checkpoint <path to TBAR CLIP checkpoint .pt file>\
--alpha <scaling coefficient value> \
--eval_data_type ImageNet1K \
--eval_test_data_dir <path to evaluation dataset> \
--csv_path <path to store results> 
```

Attack Success Rate:
```
python3 -m src.eval_unlearn \
--name Eval_tester \
--model_name <model name (e.g, ViT-B/32)> \
--start_checkpoint <path to backdoored CLIP checkpoint .pt file> \
--end_checkpoint <path to TBAR CLIP checkpoint .pt file>\
--alpha <scaling coefficient value> \
--eval_data_type ImageNet1K \
--eval_test_data_dir <path to evaluation dataset> \
--csv_path <path to store results> \
--patch <type of backdoor trigger> 
--patch_location <location of the backdoor trigger> 
--add_backdoor \
--asr 
```