
# Image Generation

Below is a guide on how to use the command, along with detailed explanations of each parameter for our image generation code.

## **Command Usage**

Execute the following command in your terminal to image generation:

```bash
python scaled_img_generation.py --csv-path ./captions_and_hn_3M.csv \
                                --save-folder ./images \
                                --tti-name SANA \
                                --tti-dtype bfloat16 \
                                --task cap \
                                --batch-size 16 \
                                --dp-size 8 \
                                --save-size 256 \
                                --exist-ok
```

## **Parameters Explained**

- `--csv-path`: Path of the captions' csv

- `--save-folder`: Folder where the generated images will be saved.

- `--tti-name`: The selected model between "SD15", "SD2", "SANA", "SDXLT_16b". A subfolder with such name will be created under `--save-folder`

- `--tti-dtype`: "float16" can be chosen for every model. "bfloat16" can be chosen for SANA

- `--task`: "cap" for the generating the images from the base captions, "hn" for generating the images from the hard negative captions. A subfolder with such name will be created under `--save-folder`/`--tti-name`.

- `--batch-size`: Per-gpu batch size

- `--dp-size`: Number of gpu for data parallelization.

- `--save-size`: Resolution of the stored image.

- `--exist-ok`: If set, filters out captions which images are already present in the correct folder