# Code_Banner

To run this code, please download the MIMO dataset. Place the logos in poster_logos, put the initial prompts in poster_data_prompt, and put the target images in poster_data_renamed. Then configure the settings in the config file.

## Prerequisites

- Python 3.10+
- Config files under `config/`: `config.ini`, `config_llm.ini` (API keys and endpoints)
- Data under project root: `poster_data_prompt/` (or `poster_logos/` with `*_prompt.txt`), `poster_data_renamed/` (ground-truth images)

## Running Experiments

### 1. Configure

Edit `poster_qa_experiments/poster_experiment_config.json`:

```json
{
  "modes": ["no_user", "Naive_Agent_Fixed_Binary", "Naive_Agent_Flexible", "MPQC_Adaptive"],
  "num_samples": 2,
  "start_index": 1,
  "max_qa_cycles": 12,
  "max_questions_per_batch": 1,
  "model_version": "gemini25",
  "question_agent_model_version": "gpt52", // or gemini25
  "answer_agent_model_version": "gemini25"
}
```

- **modes**: `"all"` or a list of mode IDs (e.g. `no_user`, `Naive_Agent_Fixed_Binary`, `Naive_Agent_Fixed_MultiChoice`, `Naive_Agent_Fixed_OpenText`, `Naive_Agent_Free_Ask`, `Naive_Agent_Flexible`, `MPQC_Adaptive`, `RAG_Flexible`).
- **num_samples** / **start_index**: which samples to run.
- **max_qa_cycles** / **max_questions_per_batch**: QA loop limits.

### 2. Run all configured modes

From the **Code_Banner** directory:

```bash
cd poster_qa_experiments
python test_all_modes.py --config-file poster_experiment_config.json
```

Optional overrides:

```bash
python test_all_modes.py --config-file poster_experiment_config.json --num-samples 1 --max-qa-cycles 2
```

### 3. Output layout

Per-run output is under the directory passed as `--output-dir` (or the one chosen by `test_all_modes.py`, e.g. `gpt_gm_results_final_12_1/test_<mode>/`). For each sample you get:

- `{sample_id}_{brand}/`: folder for that sample  
  - `poster.png`: generated banner  
  - `ground_truth.png`: reference image  
  - `plan_history.json`: design plan over cycles  
  - `qa_conversation.json`: Q&A log  
  - `judge_evaluation.json`: similarity scores (if judge ran)

Batch run also produces `batch_results.json` in the same output directory.

### Judge scores

Similarity is scored 1–5 on: Overall_Color, Layout_Composition, Button_Style, Image_Content, Text_Content. Check `judge_evaluation.json` in each sample folder or the aggregated stats in `batch_results.json`.

## Structure

```
Code_Banner/
  config/                 # config.ini, config_llm.ini
  poster_qa_experiments/
    poster_experiment_config.json
    test_all_modes.py     # run all modes
    src/
      poster_experiments.py   # single-run entry
      poster_qa_manager.py
      poster_adaptive_qa.py
      poster_config.py
      poster_data_loader.py
      poster_judge_evaluator.py
      poster_rag_agent.py
    prompts/              # paper-aligned prompt templates
    tools/
      analyze_poster_results.py
  poster_data_prompt/     # or poster_logos/ with *_prompt.txt
  poster_data_renamed/    # ground-truth images
  poster_logos/           # optional, for logo-based setup
```
