EHAN: An explicitly high-order attention network for accurate camouflaged object detection

Published: 31 Mar 2025, Last Modified: 22 Apr 2025OpenReview Archive Direct UploadEveryoneCC BY 4.0
Abstract: Wild animals often change their appearance such as color and textures to seamlessly blend into the surrounding environments to avoid predators and enemies, which poses extreme challenges to the task of Camouflaged Object Detection (COD). Recently, most COD methods have struggled to improve the detection performance using attention mechanisms to guide deep networks to focus on the subtle differences between camouflaged objects and backgrounds. However, these approaches tend to suffer from the lower orders in attention, which have limited abilities to model the subtle differences, as observed by applying the Taylor expansion to attention mappings. To solve this problem, we propose Explicitly High-order Attention (EHA) based on the tensor decomposition theory to effectively detect hidden objects, especially at edges. Furthermore, we use EHA as a basic block to design a new COD network EHAN, which consists of four modules: Cross-layer High-order Fusion (CHF), High-order Semantic Attention (HSA), High-order Edge Attention (HEA), and High-order Reverse Attention (HRA). Specifically, CHF progressively improves the features of a backbone by merging three of them at adjacent levels at a time. HSA roughly locates the disguised objects using the features of CHF at the three highest levels, whereas HEA integrates the features of CHF at the two lowest levels to predict edges accurately. HRA aims to discover distractions and incorporate the predicted edges for edge refinement. Extensive experiments on four public datasets demonstrate the superiority of our EHAN over state-of-the-art methods in quality and quantity.
Loading