Probabilistic Counterexample Guidance for Safer Reinforcement Learning

Xiaotong Ji, Antonio Filieri

Published: 2023, Last Modified: 20 Jan 2026QEST 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Safe exploration aims at addressing the limitations of Reinforcement Learning (RL) in safety-critical scenarios, where failures during trial-and-error learning may incur high costs. Several methods exist to incorporate external knowledge or to use proximal sensor data to limit the exploration of unsafe states. However, reducing exploration risks in unknown environments, where an agent must discover safety threats during exploration, remains challenging.