Decoupling feature-driven and multimodal fusion attention for clothing-changing person re-identification

Hao Yuan

Published: 12 May 2025, Last Modified: 17 Mar 2026Artificial Intelligence ReviewEveryoneCC BY-NC-ND 4.0

Abstract: Person Re-Identification (ReID) plays a crucial role in intelligent surveillance, public safety, and intelligent transportation systems. However, clothing variation remains a significant challenge in this field. To address this issue, this paper introduces a method named Decoupling Feature-Driven and Multimodal Fusion Attention for Clothing-Changing Person Re-Identification (DM-ReID). The proposed approach employs a dual-stream feature extraction framework, consisting of a global RGB image feature stream and a clothing-irrelevant feature enhancement stream. These streams respectively capture comprehensive appearance information and identity features independent of clothing. Additionally, two feature fusion strategies are proposed: firstly, an initial fusion of RGB features and clothing-irrelevant features is achieved through the Hadamard product in the mid-network stage to enhance feature complementarity; secondly, a multimodal fusion attention mechanism is integrated at the network’s end to dynamically adjust feature weights, further improving feature representation capabilities. To optimize model performance, a composite loss function combining identity loss and triplet loss is utilized, effectively enhancing the model’s discriminative ability and feature distinctiveness. Experimental results on multiple public datasets, including PRCC, LTCC, and VC-Clothes, demonstrate that DM-ReID surpasses most existing mainstream methods in Rank-1 accuracy and mean Average Precision (mAP) metrics under clothing-changing scenarios. These findings validate the method’s effectiveness and robustness in handling complex clothing variations, highlighting its promising prospects for practical applications.