Linking Electrocardiogram and Echocardiogram: Comparing Classical Machine Learning and Deep Learning Neural Networks for the Detection of Regional Wall Motion Abnormalities
Abstract: Historically, electrocardiogram (ECG) datasets have been created based on physicians’ interpretation of the ECG, which may introduce human biases and errors. Nightingale Open Science provides an open-source ECG dataset linking to an imaging marker, regional wall motion abnormality (RWMA), that is primarily associated with myocardial ischemia or infarction. RWMA refers to an area of the heart muscle that does not contract properly and can be identified on an imaging modality such as an echocardiogram or cardiac magnetic resonance imaging (MRI). Our study aimed to predict RWMA using both classical machine learning (ML) methods and a one-dimensional convolutional neural network (1D CNN) model on the 12-lead ECG data from 3,750 unique patients provided by the Nightingale Open Science platform. Of the 3,750 patients, 341 (9.1%) were diagnosed with positive RWMA on their echocardiogram within a year of the ECG. After filtering out patients with missing data, we identified 313 patients with positive RWMA label and randomly undersampled 313 from the remaining patients to keep the data balanced. We repeated the experiments with five random selections of the 313 patients without RWMA to ensure the robustness of the models. Our results indicate that the optimized 1D CNN model achieved an area under the receiver operating characteristic curve (AUROC) of 73.40% (1D CNN) versus 63.58% (Random Forest), an area under the precision-recall curve (AUPRC) of 72.45% (1D CNN) versus 72.64% (Random Forest), and accuracy of 71.15% (1D CNN) versus 63.58% (Random Forest). The Random Forest model revealed that QT duration and maximal amplitude differential of ST elevation or depression as the key features associated with RWMA. This study demonstrated that CNN models and classical ML methods offer complementary advantages. Additionally, we compared the performance, training time and model complexity of 1D CNN model with the latest foundation model, HuBERT-ECG. This is a preliminary step for further elucidation of the relationship between ECG and echocardiogram. Future studies are needed to determine the optimal CNN architecture that balances the target classifier performance and computational efficiency for ease of deployment with larger datasets.
Loading