Supplemental#
This is the page documenting and showcasing various demonstration videos of the ManiSkill3 system. Best viewed by opening the supplemental.html file in a web browser such as Chrome, otherwise one can look through the videos folder for the individual videos and follow this document for descriptions of each video.
Parallel Rendering#
Heterogeneous Simulation#
Fast Visual Training Speed#
Vision-Based Zero-shot Sim2Real Manipulation#
We demonstrate some zero-shot sim2real manipulation results using the low-cost $300 Koch v1.1. robot arm and 🤗 LeRobot code for robot hardware interface/control. Policy is trained with PPO on RGB camera inputs and robot proprioceptive data for about an hour on a single 4090 GPU on a domain randomized simulation environment.
Real World Uncut Evaluation#
Interestingly there are some untrained behaviors such as being able to pick up non cube-shaped objects, although we do not claim this kind of generalization always works.
Reset Distributions#
Real2Sim Evaluation Environments#
We port over some of the Real2Sim evaluation environments from the SIMPLER project. The videos below show 4 different vision language action (VLA) models being evaluated on 4 different tasks (videos are originally from SIMPLER). These videos are subsets of the 128 environments that are being simulated and rendered in parallel to evaluate VLAs.
Teleoperation#
We provide a VR based teleoperation system for various robot configurations that is integrated with ManiSkill3, enabling low-latency teleoperation with 4K stereo video streaming to the user at 60 Hz. The video shows a teleoperator controlling two dextrous 5-fingered hands to pick up an object from the YCB dataset in simulation via the Meta Quest 3.