Salivary metabolomics with machine learning for colorectal cancer detection

As the worldwide prevalence of colorectal cancer (CRC) increases, it is vital to reduce its morbidity and mortality through early detection. Saliva‐based tests are an ideal noninvasive tool for CRC detection. Here, we explored and validated salivary biomarkers to distinguish patients with CRC from t...

Full description

Saved in:
Bibliographic Details
Published inCancer science Vol. 113; no. 9; pp. 3234 - 3243
Main Authors Kuwabara, Hiroshi, Katsumata, Kenji, Iwabuchi, Atsuhiro, Udo, Ryutaro, Tago, Tomoya, Kasahara, Kenta, Mazaki, Junichi, Enomoto, Masanobu, Ishizaki, Tetsuo, Soya, Ryoko, Kaneko, Miku, Ota, Sana, Enomoto, Ayame, Soga, Tomoyoshi, Tomita, Masaru, Sunamura, Makoto, Tsuchida, Akihiko, Sugimoto, Masahiro, Nagakawa, Yuichi
Format Journal Article
LanguageEnglish
Published Tokyo John Wiley & Sons, Inc 01.09.2022
John Wiley and Sons Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:As the worldwide prevalence of colorectal cancer (CRC) increases, it is vital to reduce its morbidity and mortality through early detection. Saliva‐based tests are an ideal noninvasive tool for CRC detection. Here, we explored and validated salivary biomarkers to distinguish patients with CRC from those with adenoma (AD) and healthy controls (HC). Saliva samples were collected from patients with CRC, AD, and HC. Untargeted salivary hydrophilic metabolite profiling was conducted using capillary electrophoresis–mass spectrometry and liquid chromatography–mass spectrometry. An alternative decision tree (ADTree)‐based machine learning (ML) method was used to assess the discrimination abilities of the quantified metabolites. A total of 2602 unstimulated saliva samples were collected from subjects with CRC (n = 235), AD (n = 50), and HC (n = 2317). Data were randomly divided into training (n = 1301) and validation datasets (n = 1301). The clustering analysis showed a clear consistency of aberrant metabolites between the two groups. The ADTree model was optimized through cross‐validation (CV) using the training dataset, and the developed model was validated using the validation dataset. The model discriminating CRC + AD from HC showed area under the receiver‐operating characteristic curves (AUC) of 0.860 (95% confidence interval [CI]: 0.828‐0.891) for CV and 0.870 (95% CI: 0.837‐0.903) for the validation dataset. The other model discriminating CRC from AD + HC showed an AUC of 0.879 (95% CI: 0.851‐0.907) and 0.870 (95% CI: 0.838‐0.902), respectively. Salivary metabolomics combined with ML demonstrated high accuracy and versatility in detecting CRC. Saliva‐based colorectal cancer tests have been developed. Machine learning and metabolomics were used.
Bibliography:Funding information
This work was supported by JSPS KAKENHI (Grant Numbers 26462027, 15K08751, 16H05408, 16K10554, 20H05743, and 21K07228) and research grants from the Yamagata Prefectural Government and the City of Tsuruoka.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1347-9032
1349-7006
1349-7006
DOI:10.1111/cas.15472