Predicting Dementia Risk Using Longitudinal Electronic Health Records Data

Published: 23 Sept 2025, Last Modified: 18 Oct 2025TS4H NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Dementia, Risk Screening, LLMs, GNNs, time-course data, UK Biobank, EHR
Abstract: EHR data are collected as part of routine clinical practice and can be used to train predictive models that estimate and stratify disease risk at the population level. This is particularly valuable for conditions like dementia, where advances in disease-modifying treatments and early interventions have the potential to substantially reduce disease burden. Longitudinal analysis of EHR data can provide valuable insights into dementia risk. In this work, we investigate the utility of statistical learning, graph neural networks, and large language modelling approaches to predict dementia five years before diagnosis using time-course medical history data. We evaluate the performance and utility of five different modelling approaches using data from the UK Biobank (n=9,537) and present a risk stratification model.
Submission Number: 7
Loading