Scaling Robot-Learning by Crowdsourcing Simulation Environments

Published: 26 Jun 2024, Last Modified: 09 Jul 2024DGR@RSS2024 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Real-to-Sim-to-Real, Continual Data Collection, Scaling-up
Abstract: Scaling robot learning requires data collection pipelines that scale favorably with human effort to ensure a sufficient diversity and quality of expert data. In this work, we propose *Scaling, Crowdsourcing and Amortizing Real-to-Sim-to-Real* **SCAR**, a pipeline for scaling up data collection and learning generalist policies where human effort scales sublinearly with the number of environments where data is collected. The key idea is to crowdsource digital twins of real-world scenes using 3D reconstruction techniques and collect large-scale data in these simulation scenes, rather than in the real-world. Data collection in simulation is initially driven by reinforcement learning bootstrapped with human demonstrations. However, as the training of a generalist policy progresses across environments, the generalization capabilities of the learned generalist policy can be used to replace human effort with model generated demonstrations. This results in a pipeline where environments are easily sourced from non-experts through 3D capture, while behavioral data is collected with continually reducing amounts of human effort. We analyze the zero-shot and few-shot scaling laws of **SCAR** on two real-world tasks: *placing mugs/bowls/cups into a sink* and *placing boxes on a shelf* across a diverse range of environments. We also demonstrate the capabilities of the **SCAR** pipeline to finetune trained policies in a target scenario using a novel unsupervised fine-tuning technique that can improve behavior simply using 3D environments scans at test time, without requiring additional human demonstrations.
Submission Number: 28
Loading