Pixel-Level Contrastive Pretrainer for Industrial Image Representation

Published: 01 Jan 2024, Last Modified: 13 Nov 2024IEEE Trans. Instrum. Meas. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Industrial quality inspection aims to identify defective parts in industrial production processes. Commonly used methods for industrial quality inspection rely on feature representations that have been pretrained on natural image datasets, such as ImageNet. However, these pretrained models are not specifically tailored for industrial scenarios and therefore do not transfer well to downstream industrial tasks. In this study, we have curated a large-scale industrial production dataset called Ind-2M, which is specifically collected from industrial scenarios. This dataset serves to enhance the industrial representation of pretraining models. Additionally, we propose a Pixel-level COntrastive (PiCO) pretrainer for industrial image representation. PiCO not only improves the global industrial representation through industrial production classification, but also enhances the local industrial representation through pixel-level self-supervision. Experimental results demonstrate that PiCO effectively transfers to downstream industrial tasks, such as multilabel defect classification and anomaly detection, outperforming existing pretrained methods. We hope PiCO can initiate a new paradigm for industrial image pretraining.
Loading