Supplementary Material
======================

> Associated with our XLLM-Reason-Plan'25 submission entitled "_Are
> General-Purpose LLMs Ready for Planning? A Large-Scale Evaluation in
> PDDL_", we prepare an archive containing additional materials which
> may be useful.


File Hierarchy
--------------

This archive mainly contains the benchmarks we used and the obtained
results from the 20 LLMs we reviewed. The hierarchy is as follows:

- `README.md` this file
- Domain/
   - action_gen_raw.json
   - action_gen_score.csv
- Problem/
   - planetarium_test_raw.json
   - problem_gen_score.csv
- Plan/
   - plan_gen_raw.json
   - plan_gen_score.csv
