Pose Estimation from RGB Images of Highly Symmetric Objects using a Novel Multi-Pose Loss and Differential Rendering

Stefan Hein Bengtson, Hampus Åström, Thomas B. Moeslund, Elin A. Topp, Volker Krüger

Published: 16 Dec 2021, Last Modified: 27 Feb 20262021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: We propose a novel multi-pose loss function to train a neural network for 6D pose estimation, using synthetic data and evaluating it on real images. Our loss is inspired by the VSD (Visible Surface Discrepancy) metric and relies on a differentiable renderer and CAD models. This novel multi-pose approach produces multiple weighted pose estimates to avoid getting stuck in local minima. Our method resolves pose ambiguities without using predefined symmetries. It is trained only on synthetic data. We test on real-world RGB images from the T-LESS dataset, containing highly symmetric objects common in industrial settings. We show that our solution can be used to replace the codebook in a state-of-the-art approach. So far, the codebook approach has had the shortest inference time in the field. Our approach reduces inference time further while a) avoiding discretization, b) requiring a much smaller memory footprint and c) improving pose recall.

External IDs:doi:10.1109/iros51168.2021.9636839