A complementary dual-backbone transformer extracting and fusing weak cues for object detection in extremely dark videos
Abstract: Highlights•Dual-backbone structure strengthens feature extraction capability for low-light images/videos.•Powerful temporal feature aggregation fuses information of multiple frames in dark videos.•Illumination-aware feature fusion adapts to dark data with high dynamic range and variant darkness levels.•Overall hybrid-backbone model achieves superior performance to existing works for object detection in extremely dark videos.
External IDs:dblp:journals/inffus/ZhangSD23
Loading