Libra-SOD: Balanced label assignment for small object detection

Published: 2024, Last Modified: 13 Nov 2024Knowl. Based Syst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Small object detection (SOD) is one of the notoriously challenging tasks in the computer vision community. Due to instances occupying fairly small regions and having limited overlap with priors (anchors or points), strict label assignment based on pre-defined IoU thresholds usually results in a lack of sufficient training samples for small objects. Despite center sampling or IoU statistic-based label assignment strategies mitigate imbalanced label assignment results, they struggle to deliver consistent gains for small, medium and large objects simultaneously. In this paper, we propose a novel model with a balanced label assignment (BLA) strategy for SOD in complex scenes, called Libra-SOD. First, the BLA is proposed, which considers both classification confidence and localization quality in the assignment process, and assigns the same number of positive samples to each Ground Truth. Second, to cooperate with BLA closely, we introduce a task-aware head, which makes the assignment results more reliable by interweaving classification and regression tasks. Finally, the task-aware loss is designed to dynamically assign weight factors and labels during supervised predictions, allowing the framework to focus more on valuable samples. Extensive experiments are performed on four challenging datasets. In DIOR (object DetectIon in Optical Remote sensing image), Libra-SOD achieves a state-of-the-art performance of 73.7 mAP with ResNet-50 as the backbone. To the best of our knowledge, Libra-SOD is the first single-stage framework that performs over 30 AP on SODA-D (Small Object Detection dAtasets).
Loading