Confirmation: I have read and agree with the IEEE BHI 2025 conference submission's policy on behalf of myself and my co-authors.
Keywords: Injury Severity Score (ISS), trauma severity, ensemble regression models, imbalanced data, data augmentation, generative models
TL;DR: This paper presents a machine learning framework using ensemble regression and generative models to accurately predict Injury Severity Scores from imbalanced clinical, demographic, and vehicle data collected at a trauma care center.
Abstract: The Injury Severity Score (ISS) is a critical metric in trauma care, widely used for assessing injury severity, guiding clinical decisions, and evaluating patient outcomes. Despite the practical challenges of computing the score, due to its clinical significance, we propose a machine learning (ML) framework to predict ISS using structured clinical, demographic, and vehicle data, routinely documented in trauma registries and hospitals. We evaluate four ensemble-based regression models such as Random Forest Regressor (RFR), Gradient Boosting Regressor (GBR), eXtreme Gradient Boosting (XGBoost) regressor, and Gradient Boosting Machine (LightGBM). We identify GBR as the best performer when applied on a dataset generated at our clinical site, with coefficient of determination $R^2 = 0.78$. However, the original dataset exhibits substantial imbalance, with most cases concentrated in low-severity scores. To address the challenges of skewed ISS distribution, we implement various data augmentation techniques including transformations of target, resampling, interpolation, and noise-based strategies. Moreover, we develop two generative models Conditional Variational Autoencoder (cVAE) and Conditional Generative Adversarial Network (cGAN) to synthesize data from underrepresented severity ranges. The cVAE-augmented model achieves the highest performance of $R^2 = 0.94$, demonstrating the value of generative augmentation in enhancing regression accuracy under data imbalance.
Track: 4. Clinical Informatics
Registration Id: F6NT6B5WBST
Submission Number: 219
Loading