Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionDownload PDFOpen Website

2022 (modified: 24 Apr 2023)ICML 2022Readers: Everyone
Abstract: Reinforcement learning algorithms require many samples when solving complex hierarchical tasks with sparse and delayed rewards. For such complex tasks, the recently proposed RUDDER uses reward redi...
0 Replies

Loading