Unsupervised Depth and Confidence Prediction from Monocular Images using Bayesian Inference

Vishal Bhutani, Madhu Babu Vankadari, Omprakash Jha, Anima Majumder, Swagat Kumar, Samrat Dutta

Published: 01 Jan 2020, Last Modified: 05 Nov 2023IROS 2020Readers: Everyone

Abstract: In this paper, we propose an unsupervised deep learning framework with Bayesian inference for improving the accuracy of per-pixel depth prediction from monocular RGB images. The proposed framework predicts confidence map along with depth and pose information for a given input image. The depth hypotheses from previous frames are propagated forward and fused with the depth hypothesis of the current frame by using Bayesian inference mechanism. The ground truth information required for training the confidence map prediction is constructed using image reconstruction loss thereby obviating the need for explicit ground truth depth information used in supervised methods. The resulting unsupervised framework is shown to outperform the existing state-of-the-art methods for depth prediction on the publicly available KITTI outdoor dataset. The usefulness of the proposed framework is further established by demonstrating a real-world robotic pick-and-place application where the pose of the robot end-effector is computed using the depth predicted from an eye-in-hand monocular camera. The design choices made for the proposed framework is justified through extensive ablation studies.

0 Replies