Abstract: Highlights•Global multiscale features are implicitly extracted with limited computational cost.•Correlations across modalities are first used to decouple the segmentation pipeline.•Our method achieves the current state-of-the-art segmentation accuracy.