Dimensionality reduction for visualizing high-dimensional biological data

High throughput technologies used in experimental biological sciences produce data with a vast number of variables at a rapid pace, making large volumes of high-dimensional data available. The exploratory analysis of such high-dimensional data can be aided by human interpretable low-dimensional visu...

Full description

Saved in:
Bibliographic Details
Published inBioSystems Vol. 220; p. 104749
Main Authors Malepathirana, Tamasha, Senanayake, Damith, Vidanaarachchi, Rajith, Gautam, Vini, Halgamuge, Saman
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:High throughput technologies used in experimental biological sciences produce data with a vast number of variables at a rapid pace, making large volumes of high-dimensional data available. The exploratory analysis of such high-dimensional data can be aided by human interpretable low-dimensional visualizations. This work investigates how both discrete and continuous structures in biological data can be captured using the recently proposed dimensionality reduction method SONG, and compares the results with commonly used methods UMAP and PHATE. Using simulated and real-world datasets, we observe that SONG produces insightful visualizations by preserving various patterns, including discrete clusters, continuums, and branching structures in all considered datasets. More importantly, for datasets containing both discrete and continuous structures, SONG performs better at preserving both the structures compared to UMAP and PHATE. Furthermore, our quantitative evaluation of the three methods using downstream analysis validates the on par quality of the SONG’s low-dimensional embeddings compared to the other methods.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0303-2647
1872-8324
DOI:10.1016/j.biosystems.2022.104749