Abstract: Single-camera-training person reidentification (SCT re-ID) aims to train a reidentification (re-ID) model using single-camera-training (SCT) datasets where each person appears in only one camera. The main challenge of SCT re-ID is to learn camera-invariant feature representations without cross-camera same-person (CCSP) data as supervision. Previous methods address it by assuming that the most similar person should be found in another camera. However, this assumption is not guaranteed to be correct. In this article, we propose a novel solution: the camera-invariant meta-learning network (CIMN) for SCT re-ID. CIMN operates under the premise that camera-invariant feature representations should remain robust despite changes in camera settings. To achieve this, we partition the training data into a meta-train set and a meta-test set based on camera IDs. We then conduct a cross-camera simulation (CCS) using a meta-learning strategy, aiming to enforce the feature representations learned from the meta-train set to be robust when applied to the meta-test set. We further introduce three specific loss functions to leverage potential identity relations between the meta-train set and the meta-test set. Through the CCS and the introduced loss functions, CIMN can extract feature representations that are both camera-invariant and identity-discriminative even in the absence of CCSP data. Our experimental results demonstrate that CIMN can extract feature representations that are both camera-invariant and identity-discriminative, even in the absence of CCSP data. our method achieves comparable performance with and without the use of CCSP data, and outperforms state-of-the-art methods on three SCT re-ID benchmarks.
External IDs:dblp:journals/iotj/PeiJMWLW25
Loading