Preeclampsia Prediction Using Machine Learning and Polygenic Risk Scores From Clinical and Genetic Risk Factors in Early and Late Pregnancies

Preeclampsia, a pregnancy-specific condition associated with new-onset hypertension after 20-weeks gestation, is a leading cause of maternal and neonatal morbidity and mortality. Predictive tools to understand which individuals are most at risk are needed. We identified a cohort of N=1125 pregnant i...

Full description

Saved in:
Bibliographic Details
Published inHypertension (Dallas, Tex. 1979) Vol. 81; no. 2; pp. 264 - 272
Main Authors Kovacheva, Vesela P, Eberhard, Braden W, Cohen, Raphael Y, Maher, Matthew, Saxena, Richa, Gray, Kathryn J
Format Journal Article
LanguageEnglish
Published United States 01.02.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Preeclampsia, a pregnancy-specific condition associated with new-onset hypertension after 20-weeks gestation, is a leading cause of maternal and neonatal morbidity and mortality. Predictive tools to understand which individuals are most at risk are needed. We identified a cohort of N=1125 pregnant individuals who delivered between May 2015 and May 2022 at Mass General Brigham Hospitals with available electronic health record data and linked genetic data. Using clinical electronic health record data and systolic blood pressure polygenic risk scores derived from a large genome-wide association study, we developed machine learning (XGBoost) and logistic regression models to predict preeclampsia risk. Pregnant individuals with a systolic blood pressure polygenic risk score in the top quartile had higher blood pressures throughout pregnancy compared with patients within the lowest quartile systolic blood pressure polygenic risk score. In the first trimester, the most predictive model was XGBoost, with an area under the curve of 0.74. In late pregnancy, with data obtained up to the delivery admission, the best-performing model was XGBoost using clinical variables, which achieved an area under the curve of 0.91. Adding the systolic blood pressure polygenic risk score to the models did not improve the performance significantly based on De Long test comparing the area under the curve of models with and without the polygenic score. Integrating clinical factors into predictive models can inform personalized preeclampsia risk and achieve higher predictive power than the current practice. In the future, personalized tools can be implemented to identify high-risk patients for preventative therapies and timely intervention to improve adverse maternal and neonatal outcomes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0194-911X
1524-4563
1524-4563
DOI:10.1161/HYPERTENSIONAHA.123.21053