Survival analysis for pediatric heart transplant patients using a novel machine learning algorithm: A UNOS analysis
Impact of pretransplantation risk factors on mortality in the first year after heart transplantation remains largely unknown. Using machine learning algorithms, we selected clinically relevant identifiers that could predict 1-year mortality after pediatric heart transplantation. Data were obtained f...
Saved in:
Published in | The Journal of heart and lung transplantation Vol. 42; no. 10; pp. 1341 - 1348 |
---|---|
Main Authors | , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Elsevier Inc
01.10.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Impact of pretransplantation risk factors on mortality in the first year after heart transplantation remains largely unknown. Using machine learning algorithms, we selected clinically relevant identifiers that could predict 1-year mortality after pediatric heart transplantation.
Data were obtained from the United Network for Organ Sharing Database for years 2010-2020 for patients 0-17 years receiving their first heart transplant (N = 4150). Features were selected using subject experts and literature review. Scikit-Learn, Scikit-Survival, and Tensorflow were used. A train:test split of 70:30 was used. N-repeated k-fold validation was performed (N = 5, k = 5). Seven models were tested, Hyperparameter tuning performed using Bayesian optimization and the concordance index (C-index) was used for model assessment.
A C-index above 0.6 for test data was considered acceptable for survival analysis models. C-indices obtained were 0.60 (Cox proportional hazards), 0.61 (Cox with elastic net), 0.64 (gradient boosting), 0.64 (support vector machine), 0.68 (random forest), 0.66 (component gradient boosting), and 0.54 (survival trees). Machine learning models show an improvement over the traditional Cox proportional hazards model, with random forest performing the best on the test set. Analysis of the feature importance for the gradient boosted model found that the top 5 features were the most recent serum total bilirubin, the travel distance from the transplant center, the patient body mass index, the deceased donor terminal Serum glutamic pyruvic transaminase/Alanine transaminase (SGPT/ALT), and the donor PCO2.
Combination of machine learning and expert-based methodology of selecting predictors of survival for pediatric heart transplantation provides a reasonable prediction of 1- and 3-year survival outcomes. SHapley Additive exPlanations can be an effective tool for modeling and visualizing nonlinear interactions. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1053-2498 1557-3117 |
DOI: | 10.1016/j.healun.2023.06.006 |