Abstract: Weakly Supervised Object Detection (WSOD) trains detectors using only image-level annotations. Most existing WSOD models are based on pre-computed proposals and do not fully explore the relations of proposals. In this work, we address this limitation by proposing two approaches of Proposal Feature Learning for WSOD (PFL-WSOD), which effectively capture intra-proposal relations and inter-proposal relations respectively, thus improving proposal representation. To extract intra-proposal relations, we propose to utilize Self-Attention on Single Proposal for capturing relations inside each proposal. For inter-proposal relations, we propose Salient Region Banks by capturing a unique type of inter-proposal relation called deep inclusion, which significantly improves proposal representation when used in synergy with contrastive learning. Experimental results on benchmarks demonstrate the effectiveness of our methods.
Loading