Cell-type-specific co-expression inference from single cell RNA-sequencing data
The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by v...
Saved in:
Published in | bioRxiv : the preprint server for biology |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
15.12.2022
|
Online Access | Get more information |
Cover
Loading…
Abstract | The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimer’s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods. |
---|---|
AbstractList | The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimer’s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods. |
Author | Shan, Xinning Xu, Zichun Cai, Biao Zhang, Jingfei Su, Chang Zhao, Hongyu |
Author_xml | – sequence: 1 givenname: Chang orcidid: 0000-0002-8704-1512 surname: Su fullname: Su, Chang – sequence: 2 givenname: Zichun orcidid: 0000-0002-4001-0321 surname: Xu fullname: Xu, Zichun – sequence: 3 givenname: Xinning orcidid: 0000-0001-6270-0094 surname: Shan fullname: Shan, Xinning – sequence: 4 givenname: Biao orcidid: 0000-0001-8972-0204 surname: Cai fullname: Cai, Biao – sequence: 5 givenname: Hongyu orcidid: 0000-0003-1195-9607 surname: Zhao fullname: Zhao, Hongyu – sequence: 6 givenname: Jingfei orcidid: 0000-0001-9700-1103 surname: Zhang fullname: Zhang, Jingfei |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/36561173$$D View this record in MEDLINE/PubMed |
BookMark | eNqFjssKwjAURLNQfP-C3B_IopbWtRTFlYK4LzGdSCAvkxbs35uFrl0NzJkDs2QT5x1mbF7WVV0U-3LBrg2M4f0YwFOA1EpLkp7jHSJS0t6RdgoRToJU9JaSdk8Dklmj2-XAE15DprmlTvRizaZKmITNN1dsezremzMPw8Oia0PUVsSx_T3Y_R18AFyoORE |
ContentType | Journal Article |
DBID | NPM |
DatabaseName | PubMed |
DatabaseTitle | PubMed |
DatabaseTitleList | PubMed |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database |
DeliveryMethod | no_fulltext_linktorsrc |
ExternalDocumentID | 36561173 |
Genre | Preprint |
GrantInformation_xml | – fundername: NIGMS NIH HHS grantid: R01 GM134005 – fundername: NIA NIH HHS grantid: R56 AG074015 |
GroupedDBID | NPM |
ID | FETCH-pubmed_primary_365611732 |
IngestDate | Wed Aug 16 02:25:38 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-pubmed_primary_365611732 |
ORCID | 0000-0001-9700-1103 0000-0002-4001-0321 0000-0001-6270-0094 0000-0001-8972-0204 0000-0003-1195-9607 0000-0002-8704-1512 |
PMID | 36561173 |
ParticipantIDs | pubmed_primary_36561173 |
PublicationCentury | 2000 |
PublicationDate | 2022-Dec-15 |
PublicationDateYYYYMMDD | 2022-12-15 |
PublicationDate_xml | – month: 12 year: 2022 text: 2022-Dec-15 day: 15 |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | bioRxiv : the preprint server for biology |
PublicationTitleAlternate | bioRxiv |
PublicationYear | 2022 |
Score | 3.7280138 |
Snippet | The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However,... |
SourceID | pubmed |
SourceType | Index Database |
Title | Cell-type-specific co-expression inference from single cell RNA-sequencing data |
URI | https://www.ncbi.nlm.nih.gov/pubmed/36561173 |
hasFullText | |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bS8MwFD44BdmLKN4vIw--lYjburV7nEMZwqZsE4ovY20TFtjaIZsMf73nJG130Yn6EsoJhHC-pDn5ci4A13iqyzAMbS6rNYfbTig5HsIuD_AWJ9AguZU6gWmrXW2-2I9exVuU-9PRJVP_Jvj4Nq7kP6iiDHGlKNk_IJsNigL8RnyxRYSx_RXGDTEacSJROQVMktOPFcRczBPn1ki7Wpk0sjqMhHiBkbCIrLc67TpP_KiJLUhi1DJD1VdxZ67erdTvY0LZL1U0tYjFFW_Gy1OtcPLdWfJ-nxyGKPK06FUFw1m2CLtDw7l6SldLWjyCaL-COzWIl5mIki6KYmIxl3Q9GWtll9FOLBZNlZKfe9fSXaddOcg5Lv2y2s-tPOym4jXrX1sBvX3YS8x3VjdYHMCWiA7h6SsObAUHluHACAdmcGCEA1vFgREOR1B4uO81mtzMoT8xGUH66exKx7AdxZE4BVYuikHFCV1Rlbbt-OVaKCTeTYNKycatUXPP4GTDIOcbey4gv9D7JexIXMfiCo2iqV_QivoEM7sY8g |
link.rule.ids | 786 |
linkProvider | National Library of Medicine |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Cell-type-specific+co-expression+inference+from+single+cell+RNA-sequencing+data&rft.jtitle=bioRxiv+%3A+the+preprint+server+for+biology&rft.au=Su%2C+Chang&rft.au=Xu%2C+Zichun&rft.au=Shan%2C+Xinning&rft.au=Cai%2C+Biao&rft.date=2022-12-15&rft_id=info%3Apmid%2F36561173&rft_id=info%3Apmid%2F36561173&rft.externalDocID=36561173 |