Cell-type-specific co-expression inference from single cell RNA-sequencing data

The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by v...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv : the preprint server for biology
Main Authors Su, Chang, Xu, Zichun, Shan, Xinning, Cai, Biao, Zhao, Hongyu, Zhang, Jingfei
Format Journal Article
LanguageEnglish
Published United States 15.12.2022
Online AccessGet more information

Cover

Loading…
Abstract The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimer’s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods.
AbstractList The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimer’s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods.
Author Shan, Xinning
Xu, Zichun
Cai, Biao
Zhang, Jingfei
Su, Chang
Zhao, Hongyu
Author_xml – sequence: 1
  givenname: Chang
  orcidid: 0000-0002-8704-1512
  surname: Su
  fullname: Su, Chang
– sequence: 2
  givenname: Zichun
  orcidid: 0000-0002-4001-0321
  surname: Xu
  fullname: Xu, Zichun
– sequence: 3
  givenname: Xinning
  orcidid: 0000-0001-6270-0094
  surname: Shan
  fullname: Shan, Xinning
– sequence: 4
  givenname: Biao
  orcidid: 0000-0001-8972-0204
  surname: Cai
  fullname: Cai, Biao
– sequence: 5
  givenname: Hongyu
  orcidid: 0000-0003-1195-9607
  surname: Zhao
  fullname: Zhao, Hongyu
– sequence: 6
  givenname: Jingfei
  orcidid: 0000-0001-9700-1103
  surname: Zhang
  fullname: Zhang, Jingfei
BackLink https://www.ncbi.nlm.nih.gov/pubmed/36561173$$D View this record in MEDLINE/PubMed
BookMark eNqFjssKwjAURLNQfP-C3B_IopbWtRTFlYK4LzGdSCAvkxbs35uFrl0NzJkDs2QT5x1mbF7WVV0U-3LBrg2M4f0YwFOA1EpLkp7jHSJS0t6RdgoRToJU9JaSdk8Dklmj2-XAE15DprmlTvRizaZKmITNN1dsezremzMPw8Oia0PUVsSx_T3Y_R18AFyoORE
ContentType Journal Article
DBID NPM
DatabaseName PubMed
DatabaseTitle PubMed
DatabaseTitleList PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
DeliveryMethod no_fulltext_linktorsrc
ExternalDocumentID 36561173
Genre Preprint
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: R01 GM134005
– fundername: NIA NIH HHS
  grantid: R56 AG074015
GroupedDBID NPM
ID FETCH-pubmed_primary_365611732
IngestDate Wed Aug 16 02:25:38 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel OpenURL
MergedId FETCHMERGED-pubmed_primary_365611732
ORCID 0000-0001-9700-1103
0000-0002-4001-0321
0000-0001-6270-0094
0000-0001-8972-0204
0000-0003-1195-9607
0000-0002-8704-1512
PMID 36561173
ParticipantIDs pubmed_primary_36561173
PublicationCentury 2000
PublicationDate 2022-Dec-15
PublicationDateYYYYMMDD 2022-12-15
PublicationDate_xml – month: 12
  year: 2022
  text: 2022-Dec-15
  day: 15
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle bioRxiv : the preprint server for biology
PublicationTitleAlternate bioRxiv
PublicationYear 2022
Score 3.7280138
Snippet The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However,...
SourceID pubmed
SourceType Index Database
Title Cell-type-specific co-expression inference from single cell RNA-sequencing data
URI https://www.ncbi.nlm.nih.gov/pubmed/36561173
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bS8MwFD44BdmLKN4vIw--lYjburV7nEMZwqZsE4ovY20TFtjaIZsMf73nJG130Yn6EsoJhHC-pDn5ci4A13iqyzAMbS6rNYfbTig5HsIuD_AWJ9AguZU6gWmrXW2-2I9exVuU-9PRJVP_Jvj4Nq7kP6iiDHGlKNk_IJsNigL8RnyxRYSx_RXGDTEacSJROQVMktOPFcRczBPn1ki7Wpk0sjqMhHiBkbCIrLc67TpP_KiJLUhi1DJD1VdxZ67erdTvY0LZL1U0tYjFFW_Gy1OtcPLdWfJ-nxyGKPK06FUFw1m2CLtDw7l6SldLWjyCaL-COzWIl5mIki6KYmIxl3Q9GWtll9FOLBZNlZKfe9fSXaddOcg5Lv2y2s-tPOym4jXrX1sBvX3YS8x3VjdYHMCWiA7h6SsObAUHluHACAdmcGCEA1vFgREOR1B4uO81mtzMoT8xGUH66exKx7AdxZE4BVYuikHFCV1Rlbbt-OVaKCTeTYNKycatUXPP4GTDIOcbey4gv9D7JexIXMfiCo2iqV_QivoEM7sY8g
link.rule.ids 786
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Cell-type-specific+co-expression+inference+from+single+cell+RNA-sequencing+data&rft.jtitle=bioRxiv+%3A+the+preprint+server+for+biology&rft.au=Su%2C+Chang&rft.au=Xu%2C+Zichun&rft.au=Shan%2C+Xinning&rft.au=Cai%2C+Biao&rft.date=2022-12-15&rft_id=info%3Apmid%2F36561173&rft_id=info%3Apmid%2F36561173&rft.externalDocID=36561173