Classifying polyneuropathy and myopathy patients on Electronic Health Records

Md Shamim Ahmed, Nicolai Dinh Khang Truong, Elisabeth Nyoungui, Jiawei Zhao, Hanna Wedemeyer, Rudolf Mayer, Verena Schuster, Jana Zschüntzsch, Richard Röttger

Published: 12 Dec 2025, Last Modified: 14 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: h3>Abstract</h3> <h3>Background</h3> <p>Rare neuromuscular diseases such as polyneuropathy (PN) and myopathy (MY) often share symptomatic characteristics, leading to diagnostic challenges and delays. Machine learning applied to routine care data of electronic health records (EHRs) offers the potential for accelerating accurate diagnosis.</p><h3>Objective</h3> <p>To develop and evaluate machine learning models to distinguish between patients with PN and MY using EHR data, as a step toward tools that could support improved diagnostic processes.</p><h3>Methods</h3> <p>We analyzed EHR data from 2,181 patients (1,853 PN, 328 MY) provided by the Medical Data Integration Center of the University of Göttingen. The features were curated according to the recommendations of the physicians, the literature, and statistical analysis. We implemented Logistic Regression, Random Forest, and XGBoost models, optimized with Grid Search, and addressed class imbalance using SMOTE.</p><h3>Results</h3> <p>Random Forest and XGBoost models achieved the best performance with F1 Macro scores of 0.82-0.84 and AUC-ROC scores of 0.92-0.93 when trained on demographic data, feature-engineered variables, laboratory test results, and ICD-10 codes. Patient age emerged as a significant predictive factor, with MY patients typically diagnosed at younger ages (mean=51.39) than PN patients (mean=67.11).</p><h3>Conclusion</h3> <p>Machine learning models can effectively differentiate between PN and MY patients using EHR data with low data depth, potentially accelerating diagnostic processes for these rare neuromuscular diseases.</p><h3>Availability and implementation</h3> <p>https://gitlab.sdu.dk/screen4care/classifying-pn-and-my</p>
Loading