A novel method for protein-protein interaction site prediction using phylogenetic substitution models

Protein–protein binding events mediate many critical biological functions in the cell. Typically, functionally important sites in proteins can be well identified by considering sequence conservation. However, protein–protein interaction sites exhibit higher sequence variation than other functional r...

Full description

Saved in:
Bibliographic Details
Published inProteins, structure, function, and bioinformatics Vol. 80; no. 1; pp. 126 - 141
Main Authors La, David, Kihara, Daisuke
Format Journal Article
LanguageEnglish
Published Hoboken Wiley Subscription Services, Inc., A Wiley Company 01.01.2012
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text
ISSN0887-3585
1097-0134
1097-0134
DOI10.1002/prot.23169

Cover

More Information
Summary:Protein–protein binding events mediate many critical biological functions in the cell. Typically, functionally important sites in proteins can be well identified by considering sequence conservation. However, protein–protein interaction sites exhibit higher sequence variation than other functional regions, such as catalytic sites of enzymes. Consequently, the mutational behavior leading to weak sequence conservation poses significant challenges to the protein–protein interaction site prediction. Here, we present a phylogenetic framework to capture critical sequence variations that favor the selection of residues essential for protein–protein binding. Through the comprehensive analysis of diverse protein families, we show that protein binding interfaces exhibit distinct amino acid substitution as compared with other surface residues. On the basis of this analysis, we have developed a novel method, BindML, which utilizes the substitution models to predict protein–protein binding sites of protein with unknown interacting partners. BindML estimates the likelihood that a phylogenetic tree of a local surface region in a query protein structure follows the substitution patterns of protein binding interface and nonbinding surfaces. BindML is shown to perform well compared to alternative methods for protein binding interface prediction. The methodology developed in this study is very versatile in the sense that it can be generally applied for predicting other types of functional sites, such as DNA, RNA, and membrane binding sites in proteins. Proteins 2012. © 2011 Wiley Periodicals, Inc.
Bibliography:National Institute of General Medical Sciences of the National Institutes of Health - No. R01 GM075004
National Science Foundation - No. DMS800568; No. IIS0915801; No. EF0850009
istex:F58BEECC0BA23C5CF4557438C891CAAE3B8F5595
ArticleID:PROT23169
ark:/67375/WNG-X5V026RF-7
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ISSN:0887-3585
1097-0134
1097-0134
DOI:10.1002/prot.23169