Electrocardiogram arrhythmia detection with novel signal processing and persistent homology-derived predictors

Many approaches to computer-aided electrocardiogram (ECG) arrhythmia detection have been performed, several of which combine persistent homology and machine learning. We present a novel ECG signal processing pipeline and method of constructing predictor variables for use in statistical models. Speci...

Full description

Saved in:
Bibliographic Details
Published inData Science Vol. 7; no. 1; pp. 29 - 53
Main Author Dlugas, Hunter
Format Journal Article
LanguageEnglish
Published 26.06.2024
Online AccessGet full text
ISSN2451-8484
2451-8492
DOI10.3233/DS-240061

Cover

Loading…
More Information
Summary:Many approaches to computer-aided electrocardiogram (ECG) arrhythmia detection have been performed, several of which combine persistent homology and machine learning. We present a novel ECG signal processing pipeline and method of constructing predictor variables for use in statistical models. Specifically, we introduce an isoelectric baseline to yield non-trivial topological features corresponding to the P, Q, S, and T-waves (if they exist) and utilize the N-most persistent 1-dimensional homological features and their corresponding area-minimal cycle representatives to construct predictor variables derived from the persistent homology of the ECG signal for some choice of N. The binary classification of (1) Atrial Fibrillation vs. Non-Atrial Fibrillation, (2) Arrhythmia vs. Normal Sinus Rhythm, and (3) Arrhythmias with Morphological Changes vs. Sinus Rhythm with Bradycardia and Tachycardia Treated as Non-Arrhythmia was performed using Logistic Regression, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Naive Bayes, Random Forest, Gradient Boosted Decision Tree, K-Nearest Neighbors, and Support Vector Machine with a linear, radial, and polynomial kernel Models with stratified 5-fold cross validation. The Gradient Boosted Decision Tree Model attained the best results with a mean F1-score and mean Accuracy of [Formula: see text], [Formula: see text], and [Formula: see text] across the five folds for binary classifications of (1), (2), and (3), respectively.
ISSN:2451-8484
2451-8492
DOI:10.3233/DS-240061