Abstract: In unmanned aerial systems, especially in complex environments, accurately detecting tiny objects is crucial. Resizing images is a common strategy to improve detection accuracy, particularly for small objects. However, simply enlarging images significantly increases computational costs and the number of negative samples, severely degrading detection performance and limiting its applicability. This article proposes a dynamic pooling network (DPNet) for tiny object detection to mitigate these issues. DPNet employs a flexible down-sampling strategy by introducing a factor (df) to relax the fixed down-sampling process of the feature map to an adjustable one. Furthermore, we design a lightweight predictor to predict df for each input image, which will be used to decrease the resolution of feature map in backbone. Thus, we achieve input-aware down-sampling. We design an adaptive normalization module (ANM) to make a unified detector well compatible with different dfs. At the same time, we also design a guidance loss to supervise the predictor’s training. DPNet realizes the dynamic allocation of computing resources to tradeoff detection accuracy and efficiency through this. Experiments on the TinyCOCO and TinyPerson datasets show that our DPNet can save over 35% and 25% GFLOPs, respectively, while maintaining comparable detection performance. The code will be made publicly available.
External IDs:dblp:journals/iotj/GongCCYLZH25
Loading