Constructing diagnostic signature of serum microRNAs using machine learning for early pan-cancer detection

Background Cancer is a major public health concern and the second leading cause of death worldwide. Various studies have reported the use of serum microRNAs (miRNAs) as non-invasive biomarkers for cancer detection. However, large-scale pan-cancer studies based on serum miRNAs have been relatively sc...

Full description

Saved in:
Bibliographic Details
Published inDiscover. Oncology Vol. 15; no. 1; pp. 263 - 14
Main Authors Xu, Yuyan, Liao, Wei, Chen, Huanwei, Pan, Mingxin
Format Journal Article
LanguageEnglish
Published New York Springer US 04.07.2024
Springer Nature B.V
Springer
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Background Cancer is a major public health concern and the second leading cause of death worldwide. Various studies have reported the use of serum microRNAs (miRNAs) as non-invasive biomarkers for cancer detection. However, large-scale pan-cancer studies based on serum miRNAs have been relatively scarce. Methods An optimized machine learning workflow, combining least absolute shrinkage and selection operator (LASSO) analyses, recursive feature elimination (RFE), and fourteen kinds of machine learning algorithms, was use to screen out candidate miRNAs from 2540 serum miRNAs and constructed a potent diagnostic signature (Cancer-related Serum miRNA Signatures) for pan-cancer detection, based on a serum miRNA expression dataset of 38,223 samples. Result Cancer-related Serum miRNA Signatures performed well in pan-cancer detection with an area under curve (AUC) of 0.999, 94.51% sensitivity, and 99.49% specificity in the external validation cohort, and represented an acceptable diagnostic performance for identifying early-stage tumors. Furthermore, the ability of multi-classification of tumors by serum miRNAs in pancreatic, colorectal, and biliary tract cancers was lower than that in other cancers, which showed accuracies of 59%, 58.5%, and 28.9%, respectively, indicating that the difference in serum miRNA expression profiles among a small number of tumor subtypes was not as significant as that between cancer samples and non-cancer controls. Conclusion We have developed a serum miRNA signature using machine learning that may be a cost-effective risk tool for pan-cancer detection. Our findings will benefit not only the predictive diagnosis of cancer but also a preventive and more personalized screening plan.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2730-6011
2730-6011
DOI:10.1007/s12672-024-01139-1