Dueling Posterior Sampling for Preference-Based Reinforcement LearningDownload PDFOpen Website

Published: 01 Jan 2020, Last Modified: 12 May 2023UAI 2020Readers: Everyone
Abstract: In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback. While there is increasing research activity in pre...
0 Replies

Loading