Action Policy Explanations in Oversubscription PlanningDownload PDF

Published: 01 May 2023, Last Modified: 21 Jul 2023HAXP 2023Readers: Everyone
Keywords: policy, explanations, OSP
Abstract: Oversubscription planning (OSP) is a planning formalism that can deal with planning scenarios where not all goals can be achieved. If the global optimization goal is not fixed, an iterative process in which users refine their preferences based on the sample plans is suitable. To help the users, the planning systems should be able to provide answers to questions such as ``Why is goal $g$ not satisfied by the sample plan?'' In this paper, we focus on explaining the behavior of a given black-box policy under the aforementioned planning scenario. That is, we assume that sample plans are provided by a state-dependent deterministic policy $\phi$ (in our case a neural network policy), and we try to automatically answer the question ``Why is the goal $g$ not satisfied by $\phi$?'' by providing information how much the policy $\phi$ would need to diverge from its decision in order to satisfy the goal $g$. Moreover, we also provide information in which state and how the policy should diverge. To this end, we extended action policies based on action schema networks to support OSP tasks and design an algorithm that is able to provide the desired explanations. We evaluate the performance of the proposed algorithm and provide a case study with sample explanations.
0 Replies

Loading