Validation of a Diabetes Subtype Classification Model Using Data from U.S. Adults Before and After the COVID-19 Pandemic

Published: 19 Mar 2026, Last Modified: 07 May 2026MetabolitesEveryoneRevisionsCC BY-SA 4.0
Abstract: Background: We (and others) have previously identified five clinically distinct diabetes subtypes. Currently, few models to identify diabetes subtypes are readily accessible. Further, while COVID-19 has been associated with increased risk of new-onset diabetes, it remains unknown whether the pandemic is also associated with changes in diabetes subtype distribution. Methods: We used the electronic health records of patients diagnosed with diabetes from 2010 to 2019 at the Kirklin Clinic of the University of Alabama at Birmingham (UAB) to train models to assign diabetes subtypes previously identified by hierarchical clustering. We then applied the trained model to conduct a retrospective cluster analysis of electronic health records of patients diagnosed with diabetes from 2020 to 2024 at UAB. We further validated our findings using data from the 2015–2023 National Health and Nutrition Examination Surveys (NHANES). Results: The trained classification model had an average specificity of 98% and an average sensitivity of 93%. Using the model, we identified a significant difference in the distribution of type 2 diabetes subtypes in patients at UAB and in participants in NHANES. In particular, the proportion of patients with severe insulin-dependent diabetes or severe insulin-resistant diabetes subtypes increased from 42% to 61% and 31% to 40% at the UAB and in NHANES, respectively. Conclusions: The model presented here can facilitate the identification of diabetes subtypes. The proportions of patients with severe subtypes of diabetes have seemed to increase in the more recent years following the pandemic. Further studies are required to determine the potential causes of this phenomenon.
Loading