SceneScore: Learning a Cost Function for Object Arrangement

Published: 23 Oct 2023, Last Modified: 05 Nov 2023CoRL23-WS-LEAP PosterEveryoneRevisionsBibTeX
Keywords: Object Rearrangement, Graph Neural Networks, Energy-Based Models, Task Planning
TL;DR: Learn an energy-based model which predicts a lower cost for more desirable object arrangements, represented with a graph abstraction. At test time, use this to determine low-cost goal poses and satisfy constraints.
Abstract: Arranging objects correctly is a key capability for robots which unlocks a wide range of useful tasks. A prerequisite for creating successful arrangements is the ability to evaluate the desirability of a given arrangement. Our method "SceneScore" learns a cost function for arrangements, such that desirable, human-like arrangements have a low cost. We learn the distribution of training arrangements offline using an energy-based model, solely from example images without requiring environment interaction or human supervision. Our model is represented by a graph neural network which learns object-object relations, using graphs constructed from images. Experiments demonstrate that the learned cost function can be used to predict poses for missing objects, generalise to novel objects using semantic features, and can be composed with other cost functions to satisfy constraints at inference time.
Submission Number: 10
Loading