Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients

The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people's lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retr...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in cellular and infection microbiology Vol. 12; p. 838749
Main Authors Zhao, Yu, Zhang, Rusen, Zhong, Yi, Wang, Jingjing, Weng, Zuquan, Luo, Heng, Chen, Cunrong
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 19.04.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people's lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retrospective analysis for both COVID-19 and other types of pneumonia by combining the COVID-19 clinical data, eICU and MIMIC-III databases. Machine learning models, including logistic regression, random forest, XGBoost and deep learning neural networks, were developed to predict the severity of COVID-19 infections as well as the mortality of pneumonia patients in intensive care units (ICU). Statistical analysis and feature interpretation, including the analysis of two-level attention mechanisms on both temporal and non-temporal features, were utilized to understand the associations between different clinical variables and disease outcomes. For the COVID-19 data, the XGBoost model obtained the best performance on the test set (AUROC = 1.000 and AUPRC = 0.833). On the MIMIC-III and eICU pneumonia datasets, our deep learning model (Bi-LSTM_Attn) was able to identify clinical variables associated with death of pneumonia patients (AUROC = 0.924 and AUPRC = 0.802 for 24-hour observation window and 12-hour prediction window). The results highlighted clinical indicators, such as the lymphocyte counts, that may help the doctors to predict the disease progression and outcomes for both COVID-19 and other types of pneumonia.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Reviewed by: Prajoy Podder, Bangladesh University of Engineering and Technology, Bangladesh; Sina Ardabili. University of Mohaghegh Ardabili, Iran
Edited by: Christoph M. Friedrich, Dortmund University of Applied Sciences and Arts, Germany
These authors have contributed equally to this work
This article was submitted to Virus and Host, a section of the journal Frontiers in Cellular and Infection Microbiology
ISSN:2235-2988
2235-2988
DOI:10.3389/fcimb.2022.838749