Abstract: Highlights•Novel joint pretraining framework for medical image/text via contrastive learning.•Uni-modal procedure effectively exploits spontaneous relationship.•Effective negative sampling mechanism using Alignment and Uniformity properties.•Significantly outperforms SoTA models on uni-modal and cross-modal tasks.
Loading