Published: 2020, Last Modified: 12 May 2023ICML 2020Readers: Everyone
Abstract:We seek to align agent behavior with a user’s objectives in a reinforcement learning setting with unknown dynamics, an unknown reward function, and unknown unsafe states. The user knows the rewards...