A Lightweight Hybrid Network for Object Detection in Remote Sensing Images Balancing Global and Local Information

Shuting Huang, Ge Zhang, Huanzun Zhang, Hui Xu, Guangzhen Yao, Sandong Zhu, Long Zhang, Jun Kong

Published: 01 Jan 2025, Last Modified: 04 Feb 2026IEEE Geoscience and Remote Sensing LettersEveryoneRevisionsCC BY-SA 4.0

Abstract: In recent years, hybrid convolutional neural networks (CNNs) and Transformer-based object detection technologies have achieved remarkable success. In the field of remote sensing image detection, since remote sensing systems rely on the large-scale deployment of edge devices, detection models need to be lightweight with low parameter complexity to adapt to resource-constrained environments. However, existing lightweight models often struggle with an imbalance in extracting low-frequency global and high-frequency local information. In particular, when processing high-frequency local information (such as edges, textures, and fine structures), these models often lack in-depth analysis, leading to insufficient extraction of local features and reduced detection accuracy. To address the imbalance between low-frequency global information and high-frequency local information in lightweight remote sensing models, we propose an efficient and lightweight hybrid network detection framework, which mainly consists of the global–local balance (GLB) module and the detail-aware feature fusion (DAFF) module. The GLB module adopts dynamic weight adjustment and context-aware mechanisms to effectively aggregate high-frequency local information in the image. The DAFF module further enhances feature fusion and detail refinement, improving the model’s performance and generalization ability. Experimental results on remote sensing datasets, including RSOD, NWPU VHR-10, and LEVIR datasets, demonstrate that our proposed method achieves a well-balanced tradeoff between model size and detection accuracy, reaching state-of-the-art performance.

External IDs:doi:10.1109/lgrs.2025.3588788