Zero-preserving imputation of single-cell RNA-seq data

A key challenge in analyzing single cell RNA-sequencing data is the large number of false zeros, where genes actually expressed in a given cell are incorrectly measured as unexpressed. We present a method based on low-rank matrix approximation which imputes these values while preserving biologically...

Full description

Saved in:
Bibliographic Details
Published inNature communications Vol. 13; no. 1; p. 192
Main Authors Linderman, George C, Zhao, Jun, Roulis, Manolis, Bielecki, Piotr, Flavell, Richard A, Nadler, Boaz, Kluger, Yuval
Format Journal Article
LanguageEnglish
Published England Nature Publishing Group 11.01.2022
Nature Publishing Group UK
Nature Portfolio
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A key challenge in analyzing single cell RNA-sequencing data is the large number of false zeros, where genes actually expressed in a given cell are incorrectly measured as unexpressed. We present a method based on low-rank matrix approximation which imputes these values while preserving biologically non-expressed genes (true biological zeros) at zero expression levels. We provide theoretical justification for this denoising approach and demonstrate its advantages relative to other methods on simulated and biological datasets.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-021-27729-z