Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation

Yixing Lu, Zhaoxin Fan, Min Xu

Published: 2024, Last Modified: 10 Nov 2025MMM (2) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: In this paper, we introduce a novel semi-supervised learning framework tailored for medical image segmentation. Central to our approach is the innovative Multi-scale Text-aware ViT-CNN Fusion scheme. This scheme adeptly combines the strengths of both ViTs and CNNs, capitalizing on the unique advantages of both architectures as well as the complementary information in vision-language modalities. Further enriching our framework, we propose the Multi-Axis Consistency framework for generating robust pseudo labels, thereby enhancing the semi-supervised learning process. Our extensive experiments on several widely-used datasets unequivocally demonstrate the efficacy of our approach.

External IDs:dblp:conf/mmm/LuFX24