Structural Equivariance Self-Supervised Learning for Facial Pose Estimation

Yaoxing Wang, Heng Zhou, Zhendong Li, Xian Mo, Hao Liu

Published: 01 Jan 2023, Last Modified: 13 Nov 2024ICME 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: In this paper, we propose a self-supervised learning method for robust facial pose estimation. Conventional methods usually split the coherent head motion into discrete and finite outputs, likely leading to bias prediction because the performance of head pose estimation highly relies on structural facial appearance. To address this issue, our model achieves structural equivariance to poses through a self-supervised learning strategy from extrinsic attributes of face neighbors and underlying local associations. Specifically, we construct a complete neighbor graph to capture the extrinsic properties of face neighbors, where different latent semantic attributes are assigned to each subgraph. Accordingly, we design a set of proxy tasks based on different attribute subgraphs, where the model is encouraged to learn the underlying relation of local features under pose variation. Extensive experimental results on the challenging, widely evaluated datasets indicate the effectiveness of our model compared with the state of the arts.