Survival analysis for pediatric heart transplant patients using a novel machine learning algorithm: A UNOS analysis

Impact of pretransplantation risk factors on mortality in the first year after heart transplantation remains largely unknown. Using machine learning algorithms, we selected clinically relevant identifiers that could predict 1-year mortality after pediatric heart transplantation. Data were obtained f...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of heart and lung transplantation Vol. 42; no. 10; pp. 1341 - 1348
Main Authors Ashfaq, Awais, Gray, Geoffrey M., Carapelluci, Jennifer, Amankwah, Ernest K., Rehman, Mohamed, Puchalski, Michael, Smith, Andrew, Quintessenza, James A., Laks, Jessica, Ahumada, Luis M., Asante-Korang, Alfred
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 01.10.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Impact of pretransplantation risk factors on mortality in the first year after heart transplantation remains largely unknown. Using machine learning algorithms, we selected clinically relevant identifiers that could predict 1-year mortality after pediatric heart transplantation. Data were obtained from the United Network for Organ Sharing Database for years 2010-2020 for patients 0-17 years receiving their first heart transplant (N = 4150). Features were selected using subject experts and literature review. Scikit-Learn, Scikit-Survival, and Tensorflow were used. A train:test split of 70:30 was used. N-repeated k-fold validation was performed (N = 5, k = 5). Seven models were tested, Hyperparameter tuning performed using Bayesian optimization and the concordance index (C-index) was used for model assessment. A C-index above 0.6 for test data was considered acceptable for survival analysis models. C-indices obtained were 0.60 (Cox proportional hazards), 0.61 (Cox with elastic net), 0.64 (gradient boosting), 0.64 (support vector machine), 0.68 (random forest), 0.66 (component gradient boosting), and 0.54 (survival trees). Machine learning models show an improvement over the traditional Cox proportional hazards model, with random forest performing the best on the test set. Analysis of the feature importance for the gradient boosted model found that the top 5 features were the most recent serum total bilirubin, the travel distance from the transplant center, the patient body mass index, the deceased donor terminal Serum glutamic pyruvic transaminase/Alanine transaminase (SGPT/ALT), and the donor PCO2. Combination of machine learning and expert-based methodology of selecting predictors of survival for pediatric heart transplantation provides a reasonable prediction of 1- and 3-year survival outcomes. SHapley Additive exPlanations can be an effective tool for modeling and visualizing nonlinear interactions.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1053-2498
1557-3117
DOI:10.1016/j.healun.2023.06.006