A novel method for protein-protein interaction site prediction using phylogenetic substitution models
Protein–protein binding events mediate many critical biological functions in the cell. Typically, functionally important sites in proteins can be well identified by considering sequence conservation. However, protein–protein interaction sites exhibit higher sequence variation than other functional r...
Saved in:
Published in | Proteins, structure, function, and bioinformatics Vol. 80; no. 1; pp. 126 - 141 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Hoboken
Wiley Subscription Services, Inc., A Wiley Company
01.01.2012
Wiley Subscription Services, Inc |
Subjects | |
Online Access | Get full text |
ISSN | 0887-3585 1097-0134 1097-0134 |
DOI | 10.1002/prot.23169 |
Cover
Summary: | Protein–protein binding events mediate many critical biological functions in the cell. Typically, functionally important sites in proteins can be well identified by considering sequence conservation. However, protein–protein interaction sites exhibit higher sequence variation than other functional regions, such as catalytic sites of enzymes. Consequently, the mutational behavior leading to weak sequence conservation poses significant challenges to the protein–protein interaction site prediction. Here, we present a phylogenetic framework to capture critical sequence variations that favor the selection of residues essential for protein–protein binding. Through the comprehensive analysis of diverse protein families, we show that protein binding interfaces exhibit distinct amino acid substitution as compared with other surface residues. On the basis of this analysis, we have developed a novel method, BindML, which utilizes the substitution models to predict protein–protein binding sites of protein with unknown interacting partners. BindML estimates the likelihood that a phylogenetic tree of a local surface region in a query protein structure follows the substitution patterns of protein binding interface and nonbinding surfaces. BindML is shown to perform well compared to alternative methods for protein binding interface prediction. The methodology developed in this study is very versatile in the sense that it can be generally applied for predicting other types of functional sites, such as DNA, RNA, and membrane binding sites in proteins. Proteins 2012. © 2011 Wiley Periodicals, Inc. |
---|---|
Bibliography: | National Institute of General Medical Sciences of the National Institutes of Health - No. R01 GM075004 National Science Foundation - No. DMS800568; No. IIS0915801; No. EF0850009 istex:F58BEECC0BA23C5CF4557438C891CAAE3B8F5595 ArticleID:PROT23169 ark:/67375/WNG-X5V026RF-7 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23 |
ISSN: | 0887-3585 1097-0134 1097-0134 |
DOI: | 10.1002/prot.23169 |