Detection of differentially abundant cell subpopulations in scRNA-seq data

Comprehensive and accurate comparisons of transcriptomic distributions of cells from samples taken from two different biological states, such as healthy versus diseased individuals, are an emerging challenge in single-cell RNA sequencing (scRNA-seq) analysis. Current methods for detecting differenti...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings of the National Academy of Sciences - PNAS Vol. 118; no. 22; pp. 1 - 12
Main Authors	Zhao, Jun, Jaffe, Ariel, Li, Henry, Lindenbaum, Ofir, Sefik, Esen, Jackson, Ruaidhrí, Cheng, Xiuyuan, Flavell, Richard A., Kluger, Yuval
Format	Journal Article
Language	English
Published	United States National Academy of Sciences 01.06.2021
Subjects	Aging Biological Sciences Clustering Computational neuroscience COVID-19 Embryogenesis Embryonic growth stage Gene sequencing Immune checkpoint Multiscale analysis Phenotypes Subpopulations Transcriptomics RNA-seq local differential abundance single cell
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Comprehensive and accurate comparisons of transcriptomic distributions of cells from samples taken from two different biological states, such as healthy versus diseased individuals, are an emerging challenge in single-cell RNA sequencing (scRNA-seq) analysis. Current methods for detecting differentially abundant (DA) subpopulations between samples rely heavily on initial clustering of all cells in both samples. Often, this clustering step is inadequate since the DA subpopulations may not align with a clear cluster structure, and important differences between the two biological states can be missed. Here, we introduce DA-seq, a targeted approach for identifying DA subpopulations not restricted to clusters. DA-seq is a multiscale method that quantifies a local DA measure for each cell, which is computed from its k nearest neighboring cells across a range of k values. Based on this measure, DA-seq delineates contiguous significant DA subpopulations in the transcriptomic space. We apply DA-seq to several scRNA-seq datasets and highlight its improved ability to detect differences between distinct phenotypes in severe versus mildly ill COVID-19 patients, melanomas subjected to immune checkpoint therapy comparing responders to nonresponders, embryonic development at two time points, and young versus aging brain tissue. DA-seq enabled us to detect differences between these phenotypes. Importantly, we find that DA-seq not only recovers the DA cell types as discovered in the original studies but also reveals additional DA subpopulations that were not described before. Analysis of these subpopulations yields biological insights that would otherwise be undetected using conventional computational approaches.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Contributed by Richard A. Flavell, April 6, 2021 (sent for review January 18, 2021; reviewed by Constantin F. Aliferis and Meromit Singer) Reviewers: C.F.A., University of Minnesota; and M.S., Dana-Farber Cancer Institute. 1J.Z. and A.J. contributed equally to this work. Author contributions: J.Z., A.J., X.C., R.A.F., and Y.K. designed research; J.Z. and A.J. performed research; H.L. and O.L. contributed new reagents/analytic tools; J.Z. and H.L. analyzed data; and J.Z., A.J., E.S., R.J., X.C., R.A.F., and Y.K. wrote the paper.
ISSN:	0027-8424 1091-6490 1091-6490
DOI:	10.1073/pnas.2100293118