Generalizable Robotic Insertion with World Models

Nicklas Hansen; Iretiayo Akinola; Yijie Guo; Jie Xu; Bingjie Tang; Hao Su; Xiaolong Wang; Abhishek Gupta; Dieter Fox; Yashraj Narang

Generalizable Robotic Insertion with World Models

Nicklas Hansen, Iretiayo Akinola, Yijie Guo, Jie Xu, Bingjie Tang, Hao Su, Xiaolong Wang, Abhishek Gupta, Dieter Fox, Yashraj Narang

Published: 01 Jun 2025, Last Modified: 23 Jun 2025OOD Workshop @ RSS2025EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Model-based Reinforcement Learning, Manipulation, Generalization

TL;DR: We present a framework for generalizable robotic insertion with visual world models, and demonstrate that our approach enables zero-shot generalization to unseen objects by training a single world model on up to 90 insertion tasks.

Abstract: Robotic assembly in high-mixture settings requires adaptable systems that can handle diverse parts, yet current approaches typically rely on policies specialized to each insertion task. Although this can reach high success rates, it makes the process of deploying systems for new problems tedious and time consuming. We present a framework for generalizable insertion using world models that combine robot proprioceptive information with raw visual observations captured by a wrist-mounted camera. Our model-based approach trains a single world model on up to 90 insertion tasks with geometrically diverse parts, achieving 56% zero-shot success on unseen objects with unknown geometry compared to just 7% with a model-free baseline. Importantly, performance improves as more objects are included in the training dataset, demonstrating strong scalability. Lastly, finetuning the generalist model on held-out objects significantly enhances data-efficiency compared to training from scratch and, in some cases, achieves better asymptotic performance. To our knowledge, this is the first system capable of assembling unseen objects in an entirely data-driven manner, and thus represents a significant step toward scalable, generalizable robotic assembly systems.

Submission Number: 14

Loading