HomeRobot: Open-Vocabulary Mobile Manipulation

Published: 21 Oct 2023, Last Modified: 01 Nov 2023LangRob @ CoRL 2023 PosterEveryoneRevisionsBibTeX
Keywords: Open Vocabulary, Mobile Manipulation, Benchmark, Sim2Real
TL;DR: We proposed a combined simulation and real-world benchmark on the problem of Open-Vocabulary Mobile Manipulation (OVMM).
Abstract: Abstract: HomeRobot (noun): An affordable compliant robot that navigates homes and manipulates a wide range of objects in order to complete everyday tasks. Open-Vocabulary Mobile Manipulation (OVMM) is the problem of picking any object in any unseen environment and placing it in a commanded location. This is a foundational challenge for robots to be useful assistants in human environments because it involves tackling sub-problems from across robotics: perception, language understanding, navigation, and manipulation are all essential to OVMM. In addition, integration of the solutions to these sub-problems poses its own substantial challenges. To drive research in this area, we introduce the HomeRobot OVMM benchmark, where an agent navigates household environments to grasp novel objects and place them on target receptacles. HomeRobot has two components: a simulation component, which uses a large and diverse curated object set in new, high-quality multi-room home environments; and a real-world component, providing a software stack for the low-cost Hello Robot Stretch to encourage replication of real-world experiments across labs. We implement both reinforcement learning and heuristic (model-based) baselines and show evidence of sim-to-real transfer of the nav and place skills. Our baselines achieve a 20% success rate in the real world; our experiments identify ways future work can improve performance.
Submission Number: 32
Loading