Subject clustering by IF-PCA and several recent methods

Subject clustering (i.e., the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of significant interest. In recent years, many approaches have been proposed, among which unsupervised deep learning (UDL) has received much attention. Two intere...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in genetics Vol. 14; p. 1166404
Main Authors Chen, Dieyi, Jin, Jiashun, Ke, Zheng Tracy
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 23.05.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Subject clustering (i.e., the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of significant interest. In recent years, many approaches have been proposed, among which unsupervised deep learning (UDL) has received much attention. Two interesting questions are 1) how to combine the strengths of UDL and other approaches and 2) how these approaches compare to each other. We combine the variational auto-encoder (VAE), a popular UDL approach, with the recent idea of influential feature-principal component analysis (IF-PCA) and propose IF-VAE as a new method for subject clustering. We study IF-VAE and compare it with several other methods (including IF-PCA, VAE, Seurat, and SC3) on 10 gene microarray data sets and eight single-cell RNA-seq data sets. We find that IF-VAE shows significant improvement over VAE, but still underperforms compared to IF-PCA. We also find that IF-PCA is quite competitive, slightly outperforming Seurat and SC3 over the eight single-cell data sets. IF-PCA is conceptually simple and permits delicate analysis. We demonstrate that IF-PCA is capable of achieving phase transition in a rare/weak model. Comparatively, Seurat and SC3 are more complex and theoretically difficult to analyze (for these reasons, their optimality remains unclear).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Reviewed by: Yuping Zhang, University of Connecticut, United States
Edited by: Jichun Xie, Duke University, United States
Wei Vivian Li, University of California, Riverside, United States
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2023.1166404