Serum biomarker-based early detection of pancreatic ductal adenocarcinomas with ensemble learning

Earlier detection of pancreatic ductal adenocarcinoma (PDAC) is key to improving patient outcomes, as it is mostly detected at advanced stages which are associated with poor survival. Developing non-invasive blood tests for early detection would be an important breakthrough. The primary objective of...

Full description

Saved in:
Bibliographic Details
Published inCommunications medicine Vol. 3; no. 1; p. 10
Main Authors Nené, Nuno R, Ney, Alexander, Nazarenko, Tatiana, Blyuss, Oleg, Johnston, Harvey E, Whitwell, Harry J, Sedlak, Eva, Gentry-Maharaj, Aleksandra, Apostolidou, Sophia, Costello, Eithne, Greenhalf, William, Jacobs, Ian, Menon, Usha, Hsuan, Justin, Pereira, Stephen P, Zaikin, Alexey, Timms, John F
Format Journal Article
LanguageEnglish
Published England Springer Nature B.V 20.01.2023
Nature Publishing Group UK
Nature Portfolio
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Earlier detection of pancreatic ductal adenocarcinoma (PDAC) is key to improving patient outcomes, as it is mostly detected at advanced stages which are associated with poor survival. Developing non-invasive blood tests for early detection would be an important breakthrough. The primary objective of the work presented here is to use a dataset that is prospectively collected, to quantify a set of cancer-associated proteins and construct multi-marker models with the capacity to predict PDAC years before diagnosis. The data used is part of a nested case-control study within the UK Collaborative Trial of Ovarian Cancer Screening and is comprised of 218 samples, collected from a total of 143 post-menopausal women who were diagnosed with pancreatic cancer within 70 months after sample collection, and 249 matched non-cancer controls. We develop a stacked ensemble modelling technique to achieve robustness in predictions and, therefore, improve performance in newly collected datasets. Here we show that with ensemble learning we can predict PDAC status with an AUC of 0.91 (95% CI 0.75-1.0), sensitivity of 92% (95% CI 0.54-1.0) at 90% specificity, up to 1 year prior to diagnosis, and at an AUC of 0.85 (95% CI 0.74-0.93) up to 2 years prior to diagnosis (sensitivity of 61%, 95% CI 0.17-0.83, at 90% specificity). The ensemble modelling strategy explored here outperforms considerably biomarker combinations cited in the literature. Further developments in the selection of classifiers balancing performance and heterogeneity should further enhance the predictive capacity of the method.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2730-664X
2730-664X
DOI:10.1038/s43856-023-00237-5