Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Yuanpei Chen; Tianhao Wu; Shengjie Wang; Xidong Feng; Jiechuan Jiang; Zongqing Lu; Stephen Marcus McAleer; Hao Dong; Song-Chun Zhu; Yaodong Yang

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen Marcus McAleer, Hao Dong, Song-Chun Zhu, Yaodong Yang

Published: 17 Sept 2022, Last Modified: 08 Feb 2026NeurIPS 2022 Datasets and Benchmarks Readers: Everyone

Keywords: Multi-Agent RL, Robotics, Reinforcement Learning

Abstract: Achieving human-level dexterity is an important open problem in robotics. However, tasks of dexterous hand manipulation even at the baby level are challenging to solve through reinforcement learning (RL). The difficulty lies in the high degrees of freedom and the required cooperation among heterogeneous agents (e.g., joints of fingers). In this study, we propose the Bimanual Dexterous Hands Benchmark (Bi-DexHands), a simulator that involves two dexterous hands with tens of bimanual manipulation tasks and thousands of target objects. Tasks in Bi-DexHands are first designed to match human-level motor skills according to literature in cognitive science, and then are built in Issac Gym; this enables highly efficient RL trainings, reaching 30,000+ FPS by only one single NVIDIA RTX 3090. We provide a comprehensive benchmark for popular RL algorithms under different settings; this includes multi-agent RL, offline RL, multi-task RL, and meta RL. Our results show that PPO type on-policy algorithms can learn to solve simple manipulation tasks that are equivalent up to 48-month human baby (e.g., catching a flying object, opening a bottle), while multi-agent RL can further help to learn manipulations that require skilled bimanual cooperation (e.g., lifting a pot, stacking blocks). Despite the success on each individual task, when it comes to mastering multiple manipulation skills, existing RL algorithms fail to work in most of the multi-task and the few-shot learning tasks, which calls for more future development from the RL community. Our project is open-sourced at https://github.com/PKU-MARL/DexterousHands.

Author Statement: Yes

TL;DR: We propose a bimanual dexterous manipulation benchmark according to literature from cognitive science for comprehensive reinforcement learning research.

URL: https://github.com/PKU-MARL/DexterousHands

Dataset Url: https://github.com/PKU-MARL/DexterousHands You can go to our github repository page to access our dataset and benchmark. For specific steps, please refer to the README.md in our repository.

License: All dataset or code are under a Apache License 2.0.

Supplementary Material: zip

Contribution Process Agreement: Yes

In Person Attendance: Yes

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/towards-human-level-bimanual-dexterous/code)

31 Replies

Loading