Machine learning multi-omics analysis reveals cancer driver dysregulation in pan-cancer cell lines compared to primary tumors

Cancer cell lines have been widely used for decades to study biological processes driving cancer development, and to identify biomarkers of response to therapeutic agents. Advances in genomic sequencing have made possible large-scale genomic characterizations of collections of cancer cell lines and...

Full description

Saved in:
Bibliographic Details
Published inCommunications biology Vol. 5; no. 1; p. 1367
Main Authors Sanders, Lauren M., Chandra, Rahul, Zebarjadi, Navid, Beale, Holly C., Lyle, A. Geoffrey, Rodriguez, Analiz, Kephart, Ellen Towle, Pfeil, Jacob, Cheney, Allison, Learned, Katrina, Currie, Rob, Gitlin, Leonid, Vengerov, David, Haussler, David, Salama, Sofie R., Vaske, Olena M.
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 13.12.2022
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Cancer cell lines have been widely used for decades to study biological processes driving cancer development, and to identify biomarkers of response to therapeutic agents. Advances in genomic sequencing have made possible large-scale genomic characterizations of collections of cancer cell lines and primary tumors, such as the Cancer Cell Line Encyclopedia (CCLE) and The Cancer Genome Atlas (TCGA). These studies allow for the first time a comprehensive evaluation of the comparability of cancer cell lines and primary tumors on the genomic and proteomic level. Here we employ bulk mRNA and micro-RNA sequencing data from thousands of samples in CCLE and TCGA, and proteomic data from partner studies in the MD Anderson Cell Line Project (MCLP) and The Cancer Proteome Atlas (TCPA), to characterize the extent to which cancer cell lines recapitulate tumors. We identify dysregulation of a long non-coding RNA and microRNA regulatory network in cancer cell lines, associated with differential expression between cell lines and primary tumors in four key cancer driver pathways: KRAS signaling, NFKB signaling, IL2/STAT5 signaling and TP53 signaling. Our results emphasize the necessity for careful interpretation of cancer cell line experiments, particularly with respect to therapeutic treatments targeting these important cancer pathways. Using a support vector machine learning approach and multi-omics data, dysregulation of key cancer driver pathways is revealed in cancer cell lines compared to primary tumors.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2399-3642
2399-3642
DOI:10.1038/s42003-022-04075-4