ReBaR: Reference-Based Reasoning for Robust Human Pose and Shape Estimation from Monocular Images

Yongkang Cheng; Shaoli Huang; Jifeng Ning; Xiaohang Zhan; Ying Shan

ReBaR: Reference-Based Reasoning for Robust Human Pose and Shape Estimation from Monocular Images

Yongkang Cheng, Shaoli Huang, Jifeng Ning, Xiaohang Zhan, Ying Shan

15 Sept 2023 (modified: 11 Feb 2024)Submitted to ICLR 2024EveryoneRevisionsBibTeX

Primary Area: representation learning for computer vision, audio, language, and other modalities

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.

Keywords: Soft-Attention-Guided; Avatar; Deepth Error

Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2024/AuthorGuide.

TL;DR: Solve depth error and occlusion problems in 3D human body reconstruction through body reference features

Abstract: This paper introduces a novel method, ReBaR (Reference-Based Reasoning for Robust Human Pose and Shape Estimation), designed to estimate human body shape and pose from single-view images. ReBaR effectively addresses the challenges of occlusions and depth ambiguity by learning reference features for part regression reasoning. Our approach starts by extracting features from both body and part regions using an attention-guided mechanism. Subsequently, these features are used to encode additional part-body dependencies for individual part regression, with part features serving as queries and the body feature as a reference. This reference-based reasoning allows our network to infer the spatial relationships of occluded parts with the body, utilizing visible parts and body reference information. ReBaR outperforms existing state-of-the-art methods on two benchmark datasets, demonstrating significant improvement in handling depth ambiguity and occlusion. These results strongly support the effectiveness of our reference-based framework for estimating human body shape and pose from single-view images.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors' identity.

Supplementary Material: zip

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Submission Number: 99

Loading