PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference LearningDownload PDFOpen Website

2021 (modified: 25 Apr 2023)ICML 2021Readers: Everyone
Abstract: We study reinforcement learning (RL) with no-reward demonstrations, a setting in which an RL agent has access to additional data from the interaction of other agents with the same environment. Howe...
0 Replies

Loading