Learning General Policies for Planning through GPT Models

Published: 12 Feb 2024, Last Modified: 06 Mar 2024ICAPS 2024EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Deep Learning for Planning, GPT, General Policies, Transformers for planning
TL;DR: The paper proposes an approach based on GPT and a system implementing it to learn general policies for automated planning.
Abstract: Transformer-based architectures, such as T5, BERT and GPT, have shown revolutionary capabilities in Natural Language Processing. Several studies showed that deep learning models using these architectures not only possess a remarkable linguistic knowledge, but they also exhibit forms of factual knowledge, common sense, and even programming skills. However, the scientific community still debates about their reasoning capabilities, which have been recently tested in the context of automated AI planning; the literature presents mixed results, and the prevailing view is that current transformer-based models may not be adequate for planning. In this paper, we addresses this challenge differently. We introduce a GPT-based model customised for planning (PlanGPT) to learn a general policy for classical planning by training the model from scratch with a dataset of solved planning instances. Once PlanGPT has been trained for a domain, it can be used to generate a solution plan for an input problem instance in that domain. Our training procedure exploits automated planning knowledge to enhance the performance of the trained model. We build and evaluate our GPT model with several planning domains, and we compare its performance w.r.t. other recent deep learning techniques for generalised planning, demonstrating the effectiveness of the proposed approach.
Primary Keywords: Learning
Category: Long
Student: Graduate
Supplemtary Material: pdf
Submission Number: 367
Loading