# Structured and Abstractive Reasoning on Multi-modal Relational Knowledge Images

## Model Training and Evaluation
- First, you should install `LLaMA-Factory` and `vLLM` in your python environment.
- Second, you need to download the MLLMs used in the experiments including Qwen2.5-VL-3B/7B/32B, LLaVA-1.5-7B, and LLaVA-NEXT-34B
- Next, run `bash train.sh` to fine-tune MLLMs with `LLaMA-Factory`.
- Finally, use vLLM to conduct inference on the trained MLLMs to obtain the results and calculate the metrics.