Disentangling Identity Features from Interference Factors for Cloth-Changing Person Re-identification
Abstract: Cloth-Changing Person Re-Identification (CC-ReID) aims to accurately identify a target person in the more realistic surveillance scenario where clothes of the pedestrian may change drastically, which is critical in public security systems for tracking down disguised criminal suspects. Existing methods mainly transform the CC-ReID problem into cross-modality feature alignment from the data-driven perspective, without modelling the interference factors such as clothes and camera view changes meticulously. This may lead to over-consideration or under-consideration of the influence of these factors on the extraction of robust and discriminative identity features. This paper proposes a novel algorithm for thoroughly disentangling identity features from interference factors brought by clothes and camera view changes while ensuring the robustness and discriminativeness. It adopts a dual-stream identity feature learning framework consisting of a raw image stream and a cloth-erasing stream, to explore discriminative and cloth-irrelevant identity feature representations. Specifically, an adaptive cloth-irrelevant contrastive objective is introduced to contrast features extracted by the two streams, aiming to suppress the fluctuation caused by clothes textures in the identity feature space. Moreover, we innovatively mitigate the influence of the interference factors through a generative adversarial interference factor decoupling network. This network is targeted at capturing identity-related information residing in the interference factors and disentangling the identity features from such information. Extensive experimental results demonstrate the effectiveness of the proposed method, achieving superior performances to state-of-the-art methods. Our source code is available in the supplementary materials.
Primary Subject Area: [Engagement] Multimedia Search and Recommendation
Secondary Subject Area: [Engagement] Multimedia Search and Recommendation, [Content] Vision and Language
Relevance To Conference: Person Re-Identification (ReID) is a crucial task in multimedia processing such as surveillance and security, involving the identification or re-identification of individuals across different images or video sequences. This task becomes particularly challenging when individuals change their clothing, as most conventional ReID systems rely heavily on appearance-based features, such as clothing, to distinguish between different pedestrians. Our research into disentangling identity features from interference factors, such as clothing changes, aligns directly with advancing the capabilities of multimedia processing systems in handling more complex, real-world scenarios.
Supplementary Material: zip
Submission Number: 501
Loading