# Data Cleaning 
## Codebase
- `part1.ipynb`: handles data cleaning of ttcw, creativity_index, creative_short_story, dat, and aut
- `part2.ipynb`: handles data cleaning of neocoder, cs4, creative_math, 
- `part3.ipynb`: handels data cleaning of ttct
- `raw`: contains raw data (all publicly available)
- `processed`: cleaned data for each task

## Data Access:
- [CS4](https://github.com/anirudhlakkaraju/cs4_benchmark)
- [NeoCoder](https://github.com/JHU-CLSP/NeoCoder)
- [CreativeMath](https://github.com/JunyiYe/CreativeMath)
- [CreativityIndex](https://github.com/GXimingLu/creativity_index)
- [DAT](https://github.com/DingNLab/probing_creativity)
- [TTCW](https://github.com/salesforce/creativity_eval)
- [CreativeShortStory](https://github.com/mismayil/creative-story-gen)