Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

Rushang Karia; Pulkit Verma; Alberto Speranzon; Siddharth Srivastava

Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

Rushang Karia, Pulkit Verma, Alberto Speranzon, Siddharth Srivastava

Published: 12 Feb 2024, Last Modified: 06 Mar 2024ICAPS 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Learning, Model-learning, Reinforcement Learning, Relational Reinforcement Learning, Symbolic Models, Planning and Learning, Non-stationarity

TL;DR: We propose an approach that integrates intelligent data gathering, planning and learning for efficient symbolic RL

Abstract: This paper introduces a new approach for continual planning and model learning in non-stationary stochastic environments expressed using relational representations. Such capabilities are essential for the deployment of sequential decision-making systems in the uncertain, constantly evolving real world. Working in such practical settings with unknown (and non-stationary) transition systems and changing tasks, the proposed framework models gaps in the agent's current state of knowledge and uses them to conduct focused, investigative explorations. Data collected using these explorations is used for learning generalizable probabilistic models for solving the current task despite continual changes in the environment dynamics. Empirical evaluations on several benchmark domains show that this approach significantly outperforms planning and RL baselines in terms of sample complexity in non-stationary settings. Theoretical results show that the system reverts to exhibit desirable convergence properties when stationarity holds.

Primary Keywords: Learning, Knowledge Representation/Engineering

Category: Long

Student: Graduate

Supplemtary Material: zip

Submission Number: 153

Loading