Dexterous Manipulation by Multi-Task RL and Domain Randomization: Phase 2 Report

Anonymous

Dexterous Manipulation by Multi-Task RL and Domain Randomization: Phase 2 Report

Anonymous

16 Nov 2020 (modified: 05 May 2023)RCC 2020 Challenge Blind SubmissionReaders: Everyone

Abstract: Learning to control a robot by directly applying model-free Reinforcement Learning (RL) is prone to fail due to extreme sample inefficiency. We propose to address this issue by employing several techniques to improve sample complexity. In simulation we employ reward shaping, multi-task learning, and apprenticeship learning. To transfer the learned policy to the real robot we use domain randomization techniques to improve the robustness of the learned policy. In subsequent phases we plan to use learned domain randomization to target performance on the real system rather than robustness.

0 Replies

Loading