Self Decoupling-Reconstruction Network for Facial Expression Recognition

Published: 2024, Last Modified: 05 Feb 2026IJCNN 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Facial Expression Recognition (FER) poses significant challenges due to various imaging conditions, including diverse head poses, lighting conditions, resolutions, and occlusions. Additionally, different personal attributes such as age, gender, and racial background further contribute to the complexity of FER. To accurately extract meaningful expression features amidst these interfering factors to enhance recognition accuracy and the model’s generalization, we propose a Self Decoupling-Reconstruction Network (SDRNet). Specifically, our approach involves two learning processes. In the first phase, the network is trained to decouple facial images with expressions into expression and neutral components. This process involves reconstructing neutral facial images and the original input, ensuring the preservation of meaningful expression components devoid of interference in the decoupling process. In the second learning phase, we employ simple convolutional neural networks (CNNs) to recognize the extracted expression components. Our method has achieved state-of-the-art results across multiple widely used datasets, providing substantial evidence of its effectiveness. Additionally, we demonstrate the robust generalization performance of our approach through cross-database evaluations.
Loading