Learning Generalised Policies for Numeric Planning

Ryan Xiao Wang; Sylvie Thiebaux

Learning Generalised Policies for Numeric Planning

Ryan Xiao Wang, Sylvie Thiebaux

Published: 12 Feb 2024, Last Modified: 06 Mar 2024ICAPS 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Generalised Policies, Action Schema Networks, Numeric Planning

TL;DR: We propose a neural network architecture for learning generalised policies for numeric planning.

Abstract: We extend Action Schema Networks (ASNets) to learn generalised policies for numeric planning, which features quantitative numeric state variables, preconditions and effects. We propose a neural network architecture that can reason about the numeric variables both directly and in context of other variables. We also develop a dynamic exploration algorithm for more efficient training, by better balancing the exploration versus learning tradeoff to account for the greater computational demand of numeric teacher planners. Experimentally, we find that the learned generalised policies are capable of outperforming traditional numeric planners on some domains, and the dynamic exploration algorithm to be on average much faster at learning effective generalised policies than the original ASNets training algorithm.

Primary Keywords: Learning

Category: Long

Student: Undergraduate

Submission Number: 227

Loading