Proposal Feature Learning Using Proposal Relations for Weakly Supervised Object Detection

Published: 01 Jan 2024, Last Modified: 30 Jan 2025ICME 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Weakly Supervised Object Detection (WSOD) trains detectors using only image-level annotations. Most existing WSOD models are based on pre-computed proposals and do not fully explore the relations of proposals. In this work, we address this limitation by proposing two approaches of Proposal Feature Learning for WSOD (PFL-WSOD), which effectively capture intra-proposal relations and inter-proposal relations respectively, thus improving proposal representation. To extract intra-proposal relations, we propose to utilize Self-Attention on Single Proposal for capturing relations inside each proposal. For inter-proposal relations, we propose Salient Region Banks by capturing a unique type of inter-proposal relation called deep inclusion, which significantly improves proposal representation when used in synergy with contrastive learning. Experimental results on benchmarks demonstrate the effectiveness of our methods.
Loading