Electrocardiogram arrhythmia detection with novel signal processing and persistent homology-derived predictors
Many approaches to computer-aided electrocardiogram (ECG) arrhythmia detection have been performed, several of which combine persistent homology and machine learning. We present a novel ECG signal processing pipeline and method of constructing predictor variables for use in statistical models. Speci...
Saved in:
Published in | Data Science Vol. 7; no. 1; pp. 29 - 53 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
26.06.2024
|
Online Access | Get full text |
ISSN | 2451-8484 2451-8492 |
DOI | 10.3233/DS-240061 |
Cover
Loading…
Summary: | Many approaches to computer-aided electrocardiogram (ECG) arrhythmia detection have been performed, several of which combine persistent homology and machine learning. We present a novel ECG signal processing pipeline and method of constructing predictor variables for use in statistical models. Specifically, we introduce an isoelectric baseline to yield non-trivial topological features corresponding to the P, Q, S, and T-waves (if they exist) and utilize the N-most persistent 1-dimensional homological features and their corresponding area-minimal cycle representatives to construct predictor variables derived from the persistent homology of the ECG signal for some choice of N. The binary classification of (1) Atrial Fibrillation vs. Non-Atrial Fibrillation, (2) Arrhythmia vs. Normal Sinus Rhythm, and (3) Arrhythmias with Morphological Changes vs. Sinus Rhythm with Bradycardia and Tachycardia Treated as Non-Arrhythmia was performed using Logistic Regression, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Naive Bayes, Random Forest, Gradient Boosted Decision Tree, K-Nearest Neighbors, and Support Vector Machine with a linear, radial, and polynomial kernel Models with stratified 5-fold cross validation. The Gradient Boosted Decision Tree Model attained the best results with a mean F1-score and mean Accuracy of [Formula: see text], [Formula: see text], and [Formula: see text] across the five folds for binary classifications of (1), (2), and (3), respectively. |
---|---|
ISSN: | 2451-8484 2451-8492 |
DOI: | 10.3233/DS-240061 |