VARIATIONAL MODELLING OF TEMPORAL EHRS FOR PHENOTYPIC CLUSTERINGDownload PDF

Published: 01 Mar 2023, Last Modified: 22 Apr 2023ICLR 2023 TSRL4H PosterReaders: Everyone
Keywords: Unsupervised Learning, Clustering EHR, Multi-dimensional Time-Series, Probabilistic Methods
TL;DR: We propose a generative model to cluster EHR data based on identifying clinically meaningful phenotypes with regard to patient outcome prediction and physiological trajectory.
Abstract: The recent availability of Electronic Health Records (EHR) has allowed for the development of algorithms predicting inpatient risk of deterioration and trajectory evolution. However, prediction of disease progression with EHR is challenging since these data are sparse, heterogeneous, multi-dimensional, and multimodal time-series. Clustering is an alternative approach to identify similar groups within the patient cohort that can be leveraged to predict patient status evolution. In particular, the identification of phenotypically separable clusters has proven very useful in improving healthcare delivery. Some clustering models have been proposed to identify phenotypically separable clusters; however, these have struggled in clinical settings characterised by several, highly imbalanced event classes. To that end, we propose a generative model to cluster EHR data based on identifying clinically meaningful phenotypes with regard to patient outcome prediction and physiological trajectory. We introduce a novel probabilistic method that is capable of simultaneously a) generating observation data, b) modelling temporal cluster assignments, and c) predicting admission outcomes. Our results show performance similar to state-of-the-art methods, with increased clustering separability, and capability to generate observation data.
Anonymity: We agree to keep the submission (including any supplements and/or code) anonymous.
Formatting: We confirm that we read and complied with the author's instructions.
0 Replies

Loading