Identification of Essential Proteins Based on Edge Clustering Coefficient

Identification of essential proteins is key to understanding the minimal requirements for cellular life and important for drug design. The rapid increase of available protein-protein interaction (PPI) data has made it possible to detect protein essentiality on network level. A series of centrality m...

Full description

Saved in:

Bibliographic Details
Published in	IEEE/ACM transactions on computational biology and bioinformatics Vol. 9; no. 4; pp. 1070 - 1080
Main Authors	Wang, Jianxin, Li, Min, Wang, Huan, Pan, Yi
Format	Journal Article
Language	English
Published	United States IEEE 01.07.2012 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Accuracy Bioinformatics centrality measures Cluster Analysis Computational Biology - methods edge clustering coefficient Electronics packaging Essential proteins Genes, Essential Protein Interaction Maps - physiology protein interaction network Proteins Reproducibility of Results Ribonucleic acid RNA Saccharomyces cerevisiae Proteins - chemistry Saccharomyces cerevisiae Proteins - classification Sensitivity Tin topology
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Identification of essential proteins is key to understanding the minimal requirements for cellular life and important for drug design. The rapid increase of available protein-protein interaction (PPI) data has made it possible to detect protein essentiality on network level. A series of centrality measures have been proposed to discover essential proteins based on network topology. However, most of them tended to focus only on the location of single protein, but ignored the relevance between interactions and protein essentiality. In this paper, a new centrality measure for identifying essential proteins based on edge clustering coefficient, named as NC, is proposed. Different from previous centrality measures, NC considers both the centrality of a node and the relationship between it and its neighbors. For each interaction in the network, we calculate its edge clustering coefficient. A node's essentiality is determined by the sum of the edge clustering coefficients of interactions connecting it and its neighbors. The new centrality measure NC takes into account the modular nature of protein essentiality. NC is applied to three different types of yeast protein-protein interaction networks, which are obtained from the DIP database, the MIPS database and the BioGRID database, respectively. The experimental results on the three different networks show that the number of essential proteins discovered by NC universally exceeds that discovered by the six other centrality measures: DC, BC, CC, SC, EC, and IC. Moreover, the essential proteins discovered by NC show significant cluster effect.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1545-5963 1557-9964
DOI:	10.1109/TCBB.2011.147