Laboratory parameter‐based machine learning model for excluding non‐alcoholic fatty liver disease (NAFLD) in the general population

Summary Background Non‐alcoholic fatty liver disease (NAFLD) affects 20%‐40% of the general population in developed countries and is an increasingly important cause of hepatocellular carcinoma. Electronic medical records facilitate large‐scale epidemiological studies, existing NAFLD scores often req...

Full description

Saved in:
Bibliographic Details
Published inAlimentary pharmacology & therapeutics Vol. 46; no. 4; pp. 447 - 456
Main Authors Yip, T. C.‐F., Ma, A. J., Wong, V. W.‐S., Tse, Y.‐K., Chan, H. L.‐Y., Yuen, P.‐C., Wong, G. L.‐H.
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 01.08.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Summary Background Non‐alcoholic fatty liver disease (NAFLD) affects 20%‐40% of the general population in developed countries and is an increasingly important cause of hepatocellular carcinoma. Electronic medical records facilitate large‐scale epidemiological studies, existing NAFLD scores often require clinical and anthropometric parameters that may not be captured in those databases. Aim To develop and validate a laboratory parameter‐based machine learning model to detect NAFLD for the general population. Methods We randomly divided 922 subjects from a population screening study into training and validation groups; NAFLD was diagnosed by proton‐magnetic resonance spectroscopy. On the basis of machine learning from 23 routine clinical and laboratory parameters after elastic net regulation, we evaluated the logistic regression, ridge regression, AdaBoost and decision tree models. The areas under receiver‐operating characteristic curve (AUROC) of models in validation group were compared. Results Six predictors including alanine aminotransferase, high‐density lipoprotein cholesterol, triglyceride, haemoglobin A1c, white blood cell count and the presence of hypertension were selected. The NAFLD ridge score achieved AUROC of 0.87 (95% CI 0.83‐0.90) and 0.88 (0.84‐0.91) in the training and validation groups respectively. Using dual cut‐offs of 0.24 and 0.44, NAFLD ridge score achieved 92% (86%‐96%) sensitivity and 90% (86%‐93%) specificity with corresponding negative and positive predictive values of 96% (91%‐98%) and 69% (59%‐78%), and 87% of overall accuracy among 70% of classifiable subjects in the validation group; 30% of subjects remained indeterminate. Conclusions NAFLD ridge score is a simple and robust reference comparable to existing NAFLD scores to exclude NAFLD patients in epidemiological studies. Linked ContentThis article is linked to Gallacher et al and McPherson and Yip papers. To view these articles visit https://doi.org/10.1111/apt.14217 and https://doi.org/10.1111/apt.14234.
Bibliography:Funding information
This article is linked to Gallacher et al and McPherson and Yip papers. To view these articles visit
This study was funded in part by the direct grant of The Chinese University of Hong Kong (project reference number: 2015.1.033) to Vincent Wong.
and
https://doi.org/10.1111/apt.14217
https://doi.org/10.1111/apt.14234
Linked Content
.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Undefined-3
ISSN:0269-2813
1365-2036
1365-2036
DOI:10.1111/apt.14172