Variable importance evaluation with personalized odds ratio for machine learning model interpretability with applications to electronic health records‐based mortality prediction

The interpretability of machine learning models, even though with an excellent prediction performance, remains a challenge in practical applications. The model interpretability and variable importance for well‐performed supervised machine learning models are investigated in this study. With the comm...

Full description

Saved in:

Bibliographic Details
Published in	Statistics in medicine Vol. 42; no. 6; pp. 761 - 780
Main Authors	Yu, Duo, Wu, Hulin
Format	Journal Article
Language	English
Published	Hoboken, USA John Wiley & Sons, Inc 15.03.2023 Wiley Subscription Services, Inc
Subjects	Electronic health records interpretable machine learning Machine learning predictive modeling variable importance variable importance electronic health records predictive modeling interpretable machine learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The interpretability of machine learning models, even though with an excellent prediction performance, remains a challenge in practical applications. The model interpretability and variable importance for well‐performed supervised machine learning models are investigated in this study. With the commonly accepted concept of odds ratio (OR), we propose a novel and computationally efficient Variable Importance evaluation framework based on the Personalized Odds Ratio (VIPOR). It is a model‐agnostic interpretation method that can be used to evaluate variable importance both locally and globally. Locally, the variable importance is quantified by the personalized odds ratio (POR), which can account for subject heterogeneity in machine learning. Globally, we utilize a hierarchical tree to group the predictors into five groups: completely positive, completely negative, positive dominated, negative dominated, and neutral groups. The relative importance of predictors within each group is ranked based on different statistics of PORs across subjects for different application purposes. For illustration, we apply the proposed VIPOR method to interpreting a multilayer perceptron (MLP) model, which aims to predict the mortality of subarachnoid hemorrhage (SAH) patients using real‐world electronic health records (EHR) data. We compare the important variables derived from MLP with other machine learning models, including tree‐based models and the L1‐regularized logistic regression model. The top importance variables are consistently identified by VIPOR across different prediction models. Comparisons with existing interpretation methods are also conducted and discussed based on publicly available data sets.
Bibliography:	Funding information Cancer Prevention and Research Institute of Texas, Grant/Award Number: RP170668; NIH Texas D‐CFAR, Grant/Award Number: P30 AI161943 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0277-6715 1097-0258 1097-0258
DOI:	10.1002/sim.9642