SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning

Andrew Chester, Michael Dann, Fabio Zambetta, John Thangarajah

Published: 01 Jan 2023, Last Modified: 21 Oct 2024AI (2) 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Model-based reinforcement learning algorithms are typically more sample efficient than their model-free counterparts, especially in sparse reward problems. Unfortunately, many interesting domains are too complex to specify complete models, and learning a model takes a large number of environment samples. If we could specify an incomplete model and allow the agent to learn how best to use it, we could take advantage of our partial understanding of many domains. In this work we propose SAGE, an algorithm combining learning and planning to exploit a previously unusable class of incomplete models. This combines the strengths of symbolic planning and neural learning approaches in a novel way that outperforms competing methods on variations of taxi world and Minecraft.