Multi-Objective Photoreal Simulation (MOPS) Dataset for Computer Vision in Robotic Manipulation

Published: 21 Jun 2025, Last Modified: 21 Jun 2025SWOMO RSS25 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Affordance Detection, Synthetic Dataset, Simulation
TL;DR: MOPS generates synthetic data for computer vision tasks such as part segmentation and affordance detection.
Abstract: We introduce the Multi-Object Photoreal Simulation (MOPS) dataset, addressing the lack of computer vision datasets specifically designed for robot manipulation. MOPS provides photorealistic simulated environments with comprehensive ground truth annotations, using a zero-shot asset augmentation pipeline based on large language models. This pipeline annotates 3D assets at the part level and normalizes assets across libraries. The dataset delivers pixel-level segmentations for critical robotics tasks including part segmentation and affordance prediction. By combining detailed annotations with photorealistic simulation, MOPS generates diverse indoor scenes to accelerate progress in robot perception, manipulation, and interaction with real-world environments. The dataset and generation framework will be made publicly available.
Submission Number: 2
Loading