Evaluating Binary Classifiers for Cardiovascular Disease Prediction: Enhancing Early Diagnostic Capabilities

Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstr...

Full description

Saved in:

Bibliographic Details
Published in	Journal of cardiovascular development and disease Vol. 11; no. 12; p. 396
Main Authors	Iacobescu, Paul, Marina, Virginia, Anghel, Catalin, Anghele, Aurelian-Dumitrache
Format	Journal Article
Language	English
Published	Switzerland MDPI AG 01.12.2024 MDPI
Subjects	Accuracy Algorithms artificial intelligence artificial intelligence in medical diagnosis Blood vessels Cardiovascular disease Cardiovascular diseases Data collection Datasets Disease prevention Exercise Health care reform Health risks Heart Machine learning Magnetic resonance imaging Medical research Medicine, Experimental Mortality Neural networks Optimization Regression analysis Review Romania Support vector machines Taiwan World health Romania Taiwan artificial intelligence in medical diagnosis cardiovascular diseases machine learning artificial intelligence
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstrated their potential to accurately predict the risk of cardiovascular disease (CVD) by analyzing patient data. This study evaluates seven binary classification algorithms, including Random Forests, Logistic Regression, Naive Bayes, K-Nearest Neighbors (kNN), Support Vector Machines, Gradient Boosting, and Artificial Neural Networks, to understand their effectiveness in predicting CVD. Advanced preprocessing techniques, such as SMOTE–ENN for addressing class imbalance and hyperparameter optimization through Grid Search Cross-Validation, were applied to enhance the reliability and performance of these models. Standard evaluation metrics, including accuracy, precision, recall, F1-score, and Area Under the Receiver Operating Characteristic Curve (ROC-AUC), were used to assess predictive capabilities. The results show that kNN achieved the highest accuracy (99%) and AUC (0.99), surpassing traditional models like Logistic Regression and Gradient Boosting. The study examines the challenges encountered when working with datasets related to cardiovascular diseases, such as class imbalance and feature selection. It demonstrates how addressing these issues enhances the reliability and applicability of predictive models. These findings emphasize the potential of kNN as a reliable tool for early CVD prediction, offering significant improvements over previous studies. This research highlights the value of advanced machine learning techniques in healthcare, addressing key challenges and laying a foundation for future studies aimed at improving predictive models for CVD prevention.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Review-3 content type line 23
ISSN:	2308-3425 2308-3425
DOI:	10.3390/jcdd11120396