MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control

Nolan Wagener; Andrey Kolobov; Felipe Vieira Frujeri; Ricky Loynd; Ching-An Cheng; Matthew Hausknecht

MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control

Nolan Wagener, Andrey Kolobov, Felipe Vieira Frujeri, Ricky Loynd, Ching-An Cheng, Matthew Hausknecht

Published: 17 Sept 2022, Last Modified: 04 Aug 2025NeurIPS 2022 Datasets and Benchmarks Readers: Everyone

Keywords: mocap, motion capture, hierarchical reinforcement learning, humanoid control, task transfer, motion completion

Abstract: Simulated humanoids are an appealing research domain due to their physical capabilities. Nonetheless, they are also challenging to control, as a policy must drive an unstable, discontinuous, and high-dimensional physical system. One widely studied approach is to utilize motion capture (MoCap) data to teach the humanoid agent low-level skills (e.g., standing, walking, and running) that can then be re-used to synthesize high-level behaviors. However, even with MoCap data, controlling simulated humanoids remains very hard, as MoCap data offers only kinematic information. Finding physical control inputs to realize the demonstrated motions requires computationally intensive methods like reinforcement learning. Thus, despite the publicly available MoCap data, its utility has been limited to institutions with large-scale compute. In this work, we dramatically lower the barrier for productive research on this topic by training and releasing high-quality agents that can track over three hours of MoCap data for a simulated humanoid in the dm_control physics-based environment. We release MoCapAct (Motion Capture with Actions), a dataset of these expert agents and their rollouts, which contain proprioceptive observations and actions. We demonstrate the utility of MoCapAct by using it to train a single hierarchical policy capable of tracking the entire MoCap dataset within dm_control and show the learned low-level component can be re-used to efficiently learn downstream high-level tasks. Finally, we use MoCapAct to train an autoregressive GPT model and show that it can control a simulated humanoid to perform natural motion completion given a motion prompt. Videos of the results and links to the code and dataset are available at https://microsoft.github.io/MoCapAct.

Author Statement: Yes

TL;DR: We release a dataset of experts and their rollouts for tracking 3.5 hours of MoCap data in dm_control.

URL: https://microsoft.github.io/MoCapAct/

Dataset Url: https://microsoft.github.io/MoCapAct/

License: Dataset: CDLA Permissive 2.0 Code: MIT License

Supplementary Material: pdf

Contribution Process Agreement: Yes

In Person Attendance: Yes

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/mocapact-a-multi-task-dataset-for-simulated/code)

30 Replies

Loading