Abstract: Highlights•LESOD achieves better detection performance with fewer parameters.•We design a cross-modal integratio module to enable multi-modal fusion.•We devise a multi-level feature enhancement module to generate the predicted map.•The model has 2.9M parameters, and it shows high computational efficiency.
External IDs:dblp:journals/pr/ZhongSWS26
Loading