Abstract: Highlights•Proposed an encoder–decoder architecture that integrates DCNNs and Transformer.•Proposed a TMF block that dynamically fuses semantic information in high-level features.•Proposed a UA block that combines spatial attention and channel attention.
Loading