Constructing diagnostic signature of serum microRNAs using machine learning for early pan-cancer detection
Background Cancer is a major public health concern and the second leading cause of death worldwide. Various studies have reported the use of serum microRNAs (miRNAs) as non-invasive biomarkers for cancer detection. However, large-scale pan-cancer studies based on serum miRNAs have been relatively sc...
Saved in:
Published in | Discover. Oncology Vol. 15; no. 1; pp. 263 - 14 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
New York
Springer US
04.07.2024
Springer Nature B.V Springer |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Background
Cancer is a major public health concern and the second leading cause of death worldwide. Various studies have reported the use of serum microRNAs (miRNAs) as non-invasive biomarkers for cancer detection. However, large-scale pan-cancer studies based on serum miRNAs have been relatively scarce.
Methods
An optimized machine learning workflow, combining least absolute shrinkage and selection operator (LASSO) analyses, recursive feature elimination (RFE), and fourteen kinds of machine learning algorithms, was use to screen out candidate miRNAs from 2540 serum miRNAs and constructed a potent diagnostic signature (Cancer-related Serum miRNA Signatures) for pan-cancer detection, based on a serum miRNA expression dataset of 38,223 samples.
Result
Cancer-related Serum miRNA Signatures performed well in pan-cancer detection with an area under curve (AUC) of 0.999, 94.51% sensitivity, and 99.49% specificity in the external validation cohort, and represented an acceptable diagnostic performance for identifying early-stage tumors. Furthermore, the ability of multi-classification of tumors by serum miRNAs in pancreatic, colorectal, and biliary tract cancers was lower than that in other cancers, which showed accuracies of 59%, 58.5%, and 28.9%, respectively, indicating that the difference in serum miRNA expression profiles among a small number of tumor subtypes was not as significant as that between cancer samples and non-cancer controls.
Conclusion
We have developed a serum miRNA signature using machine learning that may be a cost-effective risk tool for pan-cancer detection. Our findings will benefit not only the predictive diagnosis of cancer but also a preventive and more personalized screening plan. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 2730-6011 2730-6011 |
DOI: | 10.1007/s12672-024-01139-1 |