Probabilistic Counterexample Guidance for Safer Reinforcement Learning

Published: 01 Jan 2023, Last Modified: 15 May 2025QEST 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Safe exploration aims at addressing the limitations of Reinforcement Learning (RL) in safety-critical scenarios, where failures during trial-and-error learning may incur high costs. Several methods exist to incorporate external knowledge or to use proximal sensor data to limit the exploration of unsafe states. However, reducing exploration risks in unknown environments, where an agent must discover safety threats during exploration, remains challenging.
Loading