# DialectGen: Benchmarking & Improving Dialect Robustness in Multimodal Generation  
*Anonymous code release for ICLR 2026*

DialectGen provides a **reproducible** benchmark and mitigation pipeline for assessing how well multimodal generative models handle English dialectal variation.

---

## 🚀 Quick‑start (TL;DR)

```bash

cd DialectGen

# 2. Create & activate the conda environment (Python 3.10)
mamba env create -f environment.yml        # or: conda env create -f environment.yml
conda activate dialect                  # env name is set inside the .yml

# 3. Multimodal Generation
python src/benchmark/img_generation/{model_name}.py     --dialects aae bre che sge ine     --mode concise

# 4. Evaluate Generated Artifacts 
python src/benchmark/basic_eval/vqa_understanding.py    

# 5. Aggregate Results

python src/benchmark/basic_eval/aggragate_model_scores.py  

```

All logs, metrics, and artefacts are written under `out/` with deterministic timestamps.



## Repository layout

```
multimodal-dialectal-bias/
├── src/            # YAML experiment definitions
│   ├── benchmark/      # settings for each generator model
│   └── mitigation/     # settings for mitigation methods
├── data/               # benchmark prompts & metadata
├── environment.yml     # conda environment (Python 3.8)
├── out                 # folder to store outputs
├── README.md           # you are here
└── README.md           # you are here
```



---

## Dataset

The benchmark contains prompts** across six dialects:

* African American English (AAE)  
* Singaporean English (SgE)  
* British English (BrE)  
* Chicano English (ChE)  
* Indian English (InE)  
* Standard Amrican English (CarE)  

CSV files live in `data/text/{mode}/{dialect}.csv` with columns  
`Dialect_Word,SAE_Word,Dialect_Prompt,SAE_Prompt,person_in_prompt,dialect_word_count,polysemic,Polysemy_Prompt`.



---

## License

Released under the MIT License (see `LICENSE`).

---

## Acknowledgements

All third‑party model checkpoints remain under their original licenses.  
This repository distributes only *dialect prompt data* and evaluation code.

*For questions during review, please use the ICLR 2026 anonymous comment system.*
