Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from ObservationsDownload PDFOpen Website

2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract: A critical flaw of existing inverse reinforcement learning (IRL) methods is their inability to significantly outperform the demonstrator. This is because IRL typically seeks a reward function that ...
0 Replies

Loading