Graph-Based Multimodal Learning for Early Sepsis Prediction in Resource-Constrained Clinical Settings

Published: 22 Sept 2025, Last Modified: 22 Sept 2025WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Graph Neural Networks, Multimodal fusion, Sepsis prediction, Temporal modeling, Cross-patient reasoning, Low resource settings
Abstract: Sepsis continues to represent a major challenge in intensive care units (ICUs), with mortality risk increasing by 4–8% for every hour of delayed intervention [1]. While recent deep learning approaches have shown promise in predicting sepsis onset from electronic health records and physiological time series [2,3], their reliance on complete, regularly sampled data renders them ill-suited to real-world low resource settings characterized by heterogeneous, asynchronous, and frequently missing modalities [4]. Moreover, existing multimodal fusion methods often neglect temporal irregularities and inter-modality dependencies that are central to clinical reasoning [5]. To address these limitations, we introduce a graph-based multimodal framework that explicitly models both temporal dynamics and cross-modality relationships. Each patient is represented as a temporal-modality graph in which clinical variables—including vital signs, laboratory biomarkers, demographic attributes, and chest radiographs—form modality nodes connected by edges encoding temporal proximity and physiological correlation. We develop a time-aware, resource-weighted message passing mechanism that adaptively prioritizes recent and reliable modalities, thereby mitigating the information loss associated with irregular sampling. In addition, a cross-patient graph propagation layer leverages population-level similarities to enhance prediction under severe modality sparsity, a scenario frequently observed in low- and middle-income healthcare systems. We evaluate the framework on a curated dataset integrating MIMIC-IV and MIMIC-CXR, encompassing 1,248 patients with balanced positive and negative sepsis outcomes. The proposed model achieves an AUROC of 0.945 and an AUPRC of 0.918, substantially outperforming strong baselines including dynamic temporal GNNs, interpretable sepsis risk scores, and static multimodal graph models. Ablation studies confirm that time-aware message passing, modality-level attention, and cross-patient reasoning each provide significant performance, with the latter proving essential for robustness under missing data. Importantly, the model maintains superior performance even when 50% of modalities are absent, underscoring its applicability in resource-constrained clinical workflows. References: [1] Seymour, C., Gesten, F., Prescott, H., Friedrich, M., Iwashyna, T., Phillips, G., Lemeshow, S., Osborn, T., Terry, K. & Levy, M. Time to treatment and mortality during mandated emergency care for sepsis. New England Journal Of Medicine. 376, 2235-2244 (2017). [2] Wang, Z. & Yao, B. Multi-branching temporal convolutional network for sepsis prediction. IEEE Journal Of Biomedical And Health Informatics. 26, 876-887 (2021). [3] He, Z., Chen, X., Fang, Z., Yi, W., Wang, C., Jiang, L., Tong, Z., Bai, Z., Li, Y. & Pan, Y. Early sepsis prediction using ensemble learning with features extracted from LSTM recurrent neural network. 2019 Computing In Cardiology (CinC). pp. Page-1 (2019). [4] Agnello, L., Vidali, M., Padoan, A., Lucis, R., Mancini, A., Guerranti, R., Plebani, M., Ciaccio, M. & Carobene, A. Machine learning algorithms in sepsis. Clinica Chimica Acta. 553 pp. 117738 (2024). [5] Lin, C., Zhang, Y., Ivy, J., Capan, M., Arnold, R., Huddleston, J. & Chi, M. Early diagnosis and prediction of sepsis shock by combining static and dynamic information using convolutional-LSTM. 2018 IEEE International Conference On Healthcare Informatics (ICHI). pp. 219-228 (2018)
Submission Number: 229
Loading