# MagiC: Evaluating Multimodal Cognition Toward Grounded Visual Reasoning

### Mininum Requirement
- Nvidia GPU with at-least 80GB of RAM (A100/H100/H200/etc.)

### Image Pre-Processing
Download the full GQA dataset and run [this script](./src/util/process_image.py) to populate all necessary images.


Please refer to each subfolder for more information about the specific module. Instruction to setup enviroment can be found [here](./src/eval/README.md#pre-requisites---install-required-packages).
- [Eval](./src/eval/README.md)
- [Inference](./src/inference/README.md)