PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm

The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuab...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in genetics Vol. 13; p. 1087784
Main Authors Zhang, Yan, Xiang, Ju, Tang, Liang, Yang, Jialiang, Li, Jianming
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 20.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuable information from the data to predict pathogenic genes accurately and rapidly is currently a challenging and meaningful task. Therefore, we present a novel computational method called PGAGP for inferring potential pathogenic genes based on an adaptive network embedding algorithm. The PGAGP algorithm is to first extract initial features of nodes from a heterogeneous network of diseases and genes efficiently and effectively by Gaussian random projection and then optimize the features of nodes by an adaptive refining process. These low-dimensional features are used to improve the disease-gene heterogenous network, and we apply network propagation to the improved heterogenous network to predict pathogenic genes more effectively. By a series of experiments, we study the effect of PGAGP's parameters and integrated strategies on predictive performance and confirm that PGAGP is better than the state-of-the-art algorithms. Case studies show that many of the predicted candidate genes for specific diseases have been implied to be related to these diseases by literature verification and enrichment analysis, which further verifies the effectiveness of PGAGP. Overall, this work provides a useful solution for mining disease-gene heterogeneous network to predict pathogenic genes more effectively.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Edited by: Leyi Wei, Shandong University, China
This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics
Reviewed by: Qiu Xiao, Hunan Normal University, China
Ping Luo, University Health Network, Canada
These authors have contributed equally to this work
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2022.1087784