82 A Comparison of Machine Learning and Logistic Regression Performance for the Prediction of In-Hospital Mortality Among Acute Biliary Pancreatitis Patients

INTRODUCTION:For population databases, traditional multivariate analysis is accepted standard for predictive modelling of mortality in acute biliary pancreatitis (ABP). Machine Learning (ML) and artificial intelligence (AI) are novel methodologies to analyze databases. The gradient boosting machine...

Full description

Saved in:

Bibliographic Details
Published in	The American journal of gastroenterology Vol. 114; no. 1; pp. S48 - S49
Main Authors	Luthra, Anjuli K., Porter, Kyle, Hinton, Alice, Krishna, Somashekar G.
Format	Journal Article
Language	English
Published	New York Wolters Kluwer Health Medical Research, Lippincott Williams & Wilkins 01.10.2019
Subjects	Algorithms Artificial intelligence Machine learning Mortality Pancreatitis Regression analysis
Online Access	Get full text

Cover

Loading…

More Information
Summary:	INTRODUCTION:For population databases, traditional multivariate analysis is accepted standard for predictive modelling of mortality in acute biliary pancreatitis (ABP). Machine Learning (ML) and artificial intelligence (AI) are novel methodologies to analyze databases. The gradient boosting machine (GBM) is a tree-based ML approach. Tree-based algorithms partition the data based on predictors in an attempt to accurately classify the outcome. The gradient boosting approach employs successive trees to improve upon misclassifications of the earlier trees. Alternatively, logistic regression is a statistical approach, in which predicted probabilities are generated based on parameters estimated by maximum likelihood estimation. With the advent of universal electronic medical records, patient databases are more complex, large, and interlinked to multiple institutions. Hence, we sought to compare traditional statistical methods and ML in building prediction models for mortality in ABP.METHODS:Using the Nationwide Readmission Database (2010–2014), we identified all patients (age > 18 years) with index admissions for ABP. This data was randomly divided into a training set (70%) and a test set (30%), stratified by outcome in order to get similar event rates in both sets. The same training and test sets were used for GBM learning and logistic regression. The area under the ROC curve (AUROC) was calculated for both GBM and logistic regression modelling for predicting mortality in ABP.RESULTS:A total of 97,027 patients were hospitalized for ABP from 2010–2014 in the US. The in-hospital mortality rate was 0.97% (n = 944). Using the same set of predictor variables (demographic, hospital, etiological, comorbidities, severity of acute pancreatitis (AP), procedures, and duration of stay), ML and logistic regression performed similarly in predicting mortality with GBM achieving an AUROC of 0.955 and logistic regression AUROC 0.948 (Figure 1). Key predictors of mortality for AP using logistic regression included: Severe AP, sepsis, increasing age, and failure to perform cholecystectomy. Foremost predictors using GBM included length of stay, age, severe AP and income quartile.CONCLUSION:Notwithstanding emergence of novel AI methodologies, traditional logistic regression is comparable to novel ML algorithms in the predictive modelling of hospital outcomes in ABP and can continue to be utilized in large population databases.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0002-9270 1572-0241
DOI:	10.14309/01.ajg.0000589860.52730.1c