### Code for benchmark construction:
- Diagram cleaning
- Diagram tagging
- Diagram annotation
  - Triple
  - QA
- Benchmark Running

### Code for benchmark check:
- Diagram consistency checking

### Code for experiments:
- Shortcut experiments

Code snippet for running the benchmark: 
```
cd benchmark_construction/benchmark
python run.py \
    --bench_dir BENCH_DIR \
    --lvlm_path llava-hf/llava-v1.6-34b-hf \
    --modality real
```

