A Shared Pose Regression Network for Pose Estimation of Objects from RGB Images

Stefan Hein Bengtson, Hampus Åström, Thomas B. Moeslund, Elin A. Topp, Volker Krueger

Published: 10 Apr 2023, Last Modified: 27 Feb 2026 2022 16th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: In this paper we propose a shared regression network to jointly estimate the pose of multiple objects, replacing multiple object-specific solutions. We demonstrate that this shared network can outperform other similar approaches that rely on multiple object-specific models by evaluating it on the TLESS dataset using the VSD (Visible Surface Discrepancy). Our approach offers a less complex solution, with fewer parameters, lower memory consumption and less training required. Furthermore, it inherently handles symmetric objects by using a depth-based loss during training and can predict in real-time. Finally, we show how our proposed pipeline can be used for fine-tuning a feature extractor jointly on all objects while training the shared pose regression network. This fine-tuning process improves the pose estimation performance.

External IDs:doi:10.1109/sitis57111.2022.00022