MITER: Medical Image-TExt joint adaptive pretRaining with multi-level contrastive learning

Published: 01 Jan 2024, Last Modified: 11 Apr 2025Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Novel joint pretraining framework for medical image/text via contrastive learning.•Uni-modal procedure effectively exploits spontaneous relationship.•Effective negative sampling mechanism using Alignment and Uniformity properties.•Significantly outperforms SoTA models on uni-modal and cross-modal tasks.
Loading