MotionGlot: A Multi-Embodied Motion Generation Model

Sudarshan Sunil Harithas; Srinath Sridhar

MotionGlot: A Multi-Embodied Motion Generation Model

Sudarshan Sunil Harithas, Srinath Sridhar

Published: 10 Nov 2024, Last Modified: 10 Nov 2024CoRL-X-Embodiment-WS 2024 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: motion generation, multi-task learning

TL;DR: Techniques from multi-lingual LLMs can be adapted to train a GPT for motion generation across multiple embodiment with different action dimensions.

Abstract: This paper introduces MotionGlot, a model that can generate motion across multiple embodiments with different action dimensions, such as quadruped robots and human bodies. By leveraging the well-established training procedures commonly used in large language models (LLMs), we introduce an instruction-tuning template specifically designed for motion-related tasks. Our approach demonstrates that the principles underlying LLM training can be successfully adapted to learn a wide range of motion generation tasks across multiple embodiments with different action dimensions. We demonstrate the various abilities of MotionGlot on a set of 6 tasks and report an average improvement of 35.3% across tasks. Additionally, we contribute two new datasets: (1) a dataset of expert-controlled quadruped locomotion with approximately 48,000 trajectories paired with direction-based text annotations, and (2) a dataset of over 23,000 situational text prompts for human motion generation tasks. Finally, we conduct hardware experiments to validate the capabilities of our system in real-world applications.

Previous Publication: No

Submission Number: 29

Loading