Learning Robust Reward Machines from Noisy Labels

Published: 01 Jan 2024, Last Modified: 13 May 2025KR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper presents PROB-IRM, an approach that learns robust reward machines (RMs) for reinforcement learning (RL) agents from noisy execution traces. The key aspect of RM-driven RL is the explo...
Loading