Evaluating Language Models Planning Capabilities on Goal Ordering Challenges

Eran Hirsch; Guy Uziel; Ateret Anaby Tavor

Evaluating Language Models Planning Capabilities on Goal Ordering Challenges

Eran Hirsch, Guy Uziel, Ateret Anaby Tavor

Published: 10 Oct 2024, Last Modified: 25 Dec 2024NeurIPS'24 Compositional Learning Workshop PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: planning, goal, llm

TL;DR: We examine LLMs' ability to handle goal ordering in planning, revealing struggles with reasonable orderings.

Abstract: Planning involves the composition of primitive actions to achieve specific goals within a given environment. Classical planning research has well-established different types of goal-ordering challenges which have implications on the planning heuristics. In this study, we investigate the performance of Large Language Models (LLMs) in identifying if an order between two goals hold. We distinguish between three types of goal orderings challenges: reasonable, necessary, and optimal. Our findings reveal that LLMs predominantly struggle with reasonable goal ordering tasks compared to necessary and optimal goal orderings. Advancing this area could lead to improvements in the planning abilities of LLMs.

Submission Number: 15

Loading