Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face Recognition

Yang Xu, Junyi Wu, Yan Yan, Xinsheng Du, Huiji Zhang, Jianqiang Zhao, Zhipeng Gao

Published: 01 Jan 2023, Last Modified: 15 Apr 2024PRCV (1) 2023Readers: Everyone

Abstract: Cross-modality face recognition aims to match facial images across different modalities. This task becomes very challenging when one of the modalities is the facial caricature, which enhances instinctive facial features through extreme distortions and exaggerations with diverse styles by artists. In this paper, we develop a novel modality interference decoupling and representation alignment (MIR) method for visual-caricature face recognition. Our MIR method consists of a backbone network, an identity-interference orthogonal decoupling (IIOD) module, and a modality feature alignment (MFA) module. The IIOD module adopts a three-branch structure to decouple the deep semantic features extracted by the backbone network into identity features and modality features. In IIOD, we design an identity subspace alignment (ISA) module to align the identity features from different branches. Moreover, we design the MFA module to perform feature alignment between the modality feature from the IIOD module and that from the pre-trained modality interference information encoder (MIE) via adversarial learning, extracting the modality-specific information. Based on the above designs, we can effectively alleviate the interference of modality differences and style differences, improving the final performance. Extensive experimental results on multiple datasets show that our proposed method outperforms several state-of-the-art cross-modality face recognition methods.

0 Replies