Adapting Humanoid Locomotion over Challenging Terrain via Two-Phase Training

Wenhao Cui; Shengtao Li; Huaxing Huang; Bangyu Qin; Tianchu Zhang; hanjinchao; Liang Zheng; Ziyang Tang; Chenxu Hu; NING Yan; Jiahao Chen; Zheyuan Jiang

Adapting Humanoid Locomotion over Challenging Terrain via Two-Phase Training

Wenhao Cui, Shengtao Li, Huaxing Huang, Bangyu Qin, Tianchu Zhang, hanjinchao, Liang Zheng, Ziyang Tang, Chenxu Hu, NING Yan, Jiahao Chen, Zheyuan Jiang

Published: 05 Sept 2024, Last Modified: 08 Nov 2024CoRL 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: humanoid robots, locomotion, reinforcement learning, curriculum, sim-to-real

TL;DR: A training framework that leverages a 2 phase reinforcement learning and command curriculum learning, also refining DreamWaQ for mitigating joint oscillations, culminating in the successful sim-to-real transfer.

Abstract: Humanoid robots are a key focus in robotics, with their capacity to navigate tough terrains being essential for many uses. While strides have been made, creating adaptable locomotion for complex environments is still tough. Recent progress in learning-based systems offers hope for robust legged locomotion, but challenges persist, such as tracking accuracy at high speeds and on uneven ground, and joint oscillations in actual robots. This paper proposes a novel training framework to address these challenges by employing a two-phase training paradigm with reinforcement learning. The proposed framework is further enhanced through the integration of command curriculum learning, refining the precision and adaptability of our approach. Additionally, we adapt DreamWaQ to our humanoid locomotion system and improve it to mitigate joint oscillations. Finally, we achieve the sim-to-real transfer of our method. A series of empirical results demonstrate the superior performance of our proposed method compared to state-of-the-art methods.

Spotlight Video: mp4

Website: https://sites.google.com/view/adapting-humanoid-locomotion/two-phase-training

Publication Agreement: pdf

Student Paper: no

Supplementary Material: zip

Submission Number: 661

Loading