Learning Human Objectives by Evaluating Hypothetical BehaviorDownload PDFOpen Website

Published: 2020, Last Modified: 12 May 2023ICML 2020Readers: Everyone
Abstract: We seek to align agent behavior with a user’s objectives in a reinforcement learning setting with unknown dynamics, an unknown reward function, and unknown unsafe states. The user knows the rewards...
0 Replies

Loading