Abstract: Domain generalization person reidentification (DG ReID) is a critical research topic for intelligent video surveillance, aiming at building a robust model generalizing to the unseen domain. Particularly in single-source domain generalization (SDG) scenarios, the monotonous data style presents a significant challenge. To address this issue, existing domain generalization (DG) methods typically employ image- or feature-level data augmentation techniques. However, these approaches may have limitations when dealing with the complexities of real-world scenarios due to their reliance on a single augmentation strategy. To overcome these limitations, we propose a novel method called multilevel feature perturbation (MLFP) for SDG ReID. MLFP integrates two complementary augmentation modules: random background perturbation (RBP) at the image level and the uncertain sampling normalization module (USNM) at the feature level. Specifically, RBP is designed to perform a random mixture of image backgrounds at the image level, while preserving the integrity of the pedestrian subject. By focusing on the pedestrian objectives instead of the backgrounds, RBP encourages the model to learn domain-invariant, causal features and reduce the bias toward the source domain. USNM is designed to diversify the feature-level representations through manipulating feature statistics. To further enhance the learning process, we introduce a novel loss function called improved whitening loss (IW Loss). This loss function is designed to capture consistent representations and explore invariant patterns among diverse feature distributions. Extensive experimental results demonstrate that our MLFP outperforms state-of-the-art methods across several SDG ReID benchmarks, highlighting its effectiveness in addressing the challenges of domain generalization.
External IDs:dblp:journals/tim/ChenLCSW25
Loading