2020 (modified: 05 Nov 2022)UAI 2020Readers: Everyone
Abstract:Human feedback is widely used to train agents in many domains. However, previous works rarely consider the uncertainty when humans provide feedback, especially in cases that the optimal actions are...