Predictive modeling of 30-day readmission risk of diabetes patients by logistic regression, artificial neural network, and EasyEnsemble

Objective: To determine the most influential data features and to develop machine learning approaches that best predict hospital readmissions among patients with diabetes. Methods: In this retrospective cohort study, we surveyed patient statistics and performed feature analysis to identify the most...

Full description

Saved in:
Bibliographic Details
Published inAsian Pacific journal of tropical medicine Vol. 14; no. 9; pp. 417 - 428
Main Authors Xiang, Xiayu, Liu, Chuanyi, Zhang, Yanchun, Xiang, Wei, Fang, Binxing
Format Journal Article
LanguageEnglish
Published Wolters Kluwer India Pvt. Ltd 01.09.2021
Wolters Kluwer Medknow Publications
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Objective: To determine the most influential data features and to develop machine learning approaches that best predict hospital readmissions among patients with diabetes. Methods: In this retrospective cohort study, we surveyed patient statistics and performed feature analysis to identify the most influential data features associated with readmissions. Classification of all-cause, 30-day readmission outcomes were modeled using logistic regression, artificial neural network, and EasyEnsemble. F1 statistic, sensitivity, and positive predictive value were used to evaluate the model performance. Results: We identified 14 most influential data features (4 numeric features and 10 categorical features) and evaluated 3 machine learning models with numerous sampling methods (oversampling, undersampling, and hybrid techniques). The deep learning model offered no improvement over traditional models (logistic regression and EasyEnsemble) for predicting readmission, whereas the other two algorithms led to much smaller differences between the training and testing datasets. Conclusions: Machine learning approaches to record electronic health data offer a promising method for improving readmission prediction in patients with diabetes. But more work is needed to construct datasets with more clinical variables beyond the standard risk factors and to fine-tune and optimize machine learning models.
ISSN:1995-7645
2352-4146
2352-4146
DOI:10.4103/1995-7645.326254