Language Models For PDDL Planning: Generating Sound and Programmatic Policies

Dillon Ze Chen; Johannes Zenn; Tristan Cinquin; Sheila A. McIlraith

Language Models For PDDL Planning: Generating Sound and Programmatic Policies

Dillon Ze Chen, Johannes Zenn, Tristan Cinquin, Sheila A. McIlraith

Published: 17 Jul 2025, Last Modified: 07 Oct 2025EWRL 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: planning, PDDL, language models, programs, value functions, policies

Abstract: We study the usage of language models (LMs) for planning over world models specified in the Planning Domain Definition Language (PDDL). We prompt LMs to generate Python programs that serve as generalised policies for solving PDDL problems from a given domain. Notably, our approach synthesises policies that are provably sound relative to the PDDL domain without reliance on external verifiers. We conduct experiments on competition benchmarks which show that our policies can solve more PDDL problems than PDDL planners and recent LM approaches within a fixed time and memory constraint. Our approach manifests in the LMPlan planner which can solve planning problems with several hundreds of relevant objects. Surprisingly, we observe that LMs used in our framework sometimes plan more effectively over PDDL problems written in meaningless symbols in place of natural language; e.g. rewriting `(at dog kitchen)` as `(p2 o1 o3)`. This finding challenges hypotheses that LMs reason over word semantics and memorise solutions from its training corpus, and is worth further exploration.

Confirmation: I understand that authors of each paper submitted to EWRL may be asked to review 2-3 other submissions to EWRL.

Serve As Reviewer: ~Dillon_Ze_Chen1

Track: Regular Track: unpublished work

Submission Number: 113

Loading