MITER: Medical Image-TExt joint adaptive pretRaining with multi-level contrastive learning

Published: 2024, Last Modified: 06 Nov 2025Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Novel joint pretraining framework for medical image/text via contrastive learning.•Uni-modal procedure effectively exploits spontaneous relationship.•Effective negative sampling mechanism using Alignment and Uniformity properties.•Significantly outperforms SoTA models on uni-modal and cross-modal tasks.
Loading