# Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models

### Description

The default search algorithm for RATTPO can be found in the class `LLMQueryHintSearch` of `search_algorithm.py`. Datasets and preprocessing codes are in `dataset` directory, for example `eval_prompt_lexica.jsonl`.

### Usage
An example code usage, for Promptist Reward on Lexica dataset, is shown below:
```
OLLAMA_URL="localhost:11434"
python main.py  --config configs/diffusion_configs/sd1.4.json \
                        configs/search_configs/rattpo.json \
                        configs/verifier_configs/prompt_adaptation/lexica_sd1.4_seed42.json \
                --dataset lexica \
                --seed 42 \
                --exp_name example_exp_name
```
For other experiment setups, compose three configs (diffusion, search, and verifier) as desired. 

Note that the (local) ollama server should be running.
We refer to [official ollama repo](https://github.com/ollama/ollama) for setting up LLM server.

### Running UniDet Evaluator
For UniDet evaluators, detectron and expert weights (for detph estimation and object detection) are required. Below commands would be sufficient, but we refer to [official T2I-Compbench repo](https://github.com/Karine-Huang/T2I-CompBench) for details. 
```
python -m pip install 'git+https://github.com/facebookresearch/detectron2.git' --user
mkdir -p UniDet_eval/experts/expert_weights
cd UniDet_eval/experts/expert_weights
wget https://huggingface.co/shikunl/prismer/resolve/main/expert_weights/Unified_learned_OCIM_RS200_6x%2B2x.pth
wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/annotator/ckpts/dpt_hybrid-midas-501f0c75.pt
pip install gdown
gdown https://docs.google.com/uc?id=1C4sgkirmgMumKXXiLOPmCKNTZAc3oVbq
```