IRAD: Input-Reference Joint Driven Reconstruction for Unified Anomaly Detection

Zixin Chen, Xincheng Yao, Yan Luo, Zhenyu Liu, Baozhu Zhang, Chongyang Zhang

Published: 2024, Last Modified: 14 Feb 2026VCIP 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Unified (multi-class and cross-class) anomaly detection (AD) is a growing area of interest in real-world applications. However, the popular reconstruction-based AD approach usually faces two significant challenges: the "identical shortcut" issue (copying the input as output) and the lack of class adaptability (the AD model cannot be directly applied to new classes). To address these challenges, we propose a novel unified AD method, named IRAD (Input-Reference Joint Driven). Our core insight is to effectively incorporate both input and references into the reconstruction process. Our IRAD consists of three components: 1) A Suspicious Anomaly Substituting module that replaces the potential abnormal regions of input with anomaly-free reference patches to prevent abnormal information leakage, effectively addressing the "identical shortcut". 2) An Input-Reference Fusing module that merges reference embeddings with input, which urges the subsequent Decoder to effectively utilize the normal reference patterns to reconstruct anomaly-free samples, making our model more class-adaptive. 3) A Rich Feature Preserving Decoder that efficiently preserves low-level details, mitigating low-level information degradation during reverse construction from high to low level. In multi-class AD, IRAD achieves better results on Mvtec-AD, BTAD, and VisA. In cross-class AD, IRAD also outperforms the baesline methods on Mvtec-AD and VisA.

External IDs:dblp:conf/vcip/ChenYLLZZ24