Abstract: Highlights•Local and global information fusion improve estimation accuracy.•A multi-level framework aggregates information from multiple areas.•Spatial and channel attention fusion improve feature representation ability.•Attention mechanism effectively fuses multi-modal information.
Loading