Visualizing structure and transitions in high-dimensional biological data

The high-dimensional data created by high-throughput technologies require visualization tools that reveal data structure and patterns in an intuitive form. We present PHATE, a visualization method that captures both local and global nonlinear structure using an information-geometric distance between...

Full description

Saved in:
Bibliographic Details
Published inNature biotechnology Vol. 37; no. 12; pp. 1482 - 1492
Main Authors Moon, Kevin R., van Dijk, David, Wang, Zheng, Gigante, Scott, Burkhardt, Daniel B., Chen, William S., Yim, Kristina, Elzen, Antonia van den, Hirn, Matthew J., Coifman, Ronald R., Ivanova, Natalia B., Wolf, Guy, Krishnaswamy, Smita
Format Journal Article
LanguageEnglish
Published New York Nature Publishing Group US 01.12.2019
Nature Publishing Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The high-dimensional data created by high-throughput technologies require visualization tools that reveal data structure and patterns in an intuitive form. We present PHATE, a visualization method that captures both local and global nonlinear structure using an information-geometric distance between data points. We compare PHATE to other tools on a variety of artificial and biological datasets, and find that it consistently preserves a range of patterns in data, including continual progressions, branches and clusters, better than other tools. We define a manifold preservation metric, which we call denoised embedding manifold preservation (DEMaP), and show that PHATE produces lower-dimensional embeddings that are quantitatively better denoised as compared to existing visualization methods. An analysis of a newly generated single-cell RNA sequencing dataset on human germ-layer differentiation demonstrates how PHATE reveals unique biological insight into the main developmental branches, including identification of three previously undescribed subpopulations. We also show that PHATE is applicable to a wide variety of data types, including mass cytometry, single-cell RNA sequencing, Hi-C and gut microbiome data. PHATE, a new data visualization tool, better preserves patterns in high-dimensional data after dimensionality reduction.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
These authors contributed equally.
KM, SK, GW, and DD envisioned the project. KM, DD, SG, and GW implemented the method. KM, DD, SG, SK, and NI performed the analyses. KM, SK, GW, and NI wrote the paper. DD, SG, and DB assisted in writing. DB, WC, and KY assisted in the analysis. KM, GW, MH, and RC developed the mathematical foundations of the method. ZW, AE, and NI were responsible for data acquisition and processing.
Author Contributions
ISSN:1087-0156
1546-1696
DOI:10.1038/s41587-019-0336-3