Prediction of protein b-residue contacts by Markov logic networks with grounding-specific weights
Motivation: Accurate prediction of contacts between b-strand residues can significantly contribute towards ab initio prediction of the 3D structure of many proteins. Contacts in the same protein are highly interdependent. Therefore, significant improvements can be expected by applying statistical re...
Saved in:
Published in | Bioinformatics (Oxford, England) Vol. 25; no. 18; pp. 2326 - 2333 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
15.09.2009
|
Online Access | Get full text |
ISSN | 1367-4803 |
DOI | 10.1093/bioinformatics/btp421 |
Cover
Summary: | Motivation: Accurate prediction of contacts between b-strand residues can significantly contribute towards ab initio prediction of the 3D structure of many proteins. Contacts in the same protein are highly interdependent. Therefore, significant improvements can be expected by applying statistical relational learners that overcome the usual machine learning assumption that examples are independent and identically distributed. Furthermore, the dependencies among b-residue contacts are subject to strong regularities, many of which are known a priori. In this article, we take advantage of Markov logic, a statistical relational learning framework that is able to capture dependencies between contacts, and constrain the solution according to domain knowledge expressed by means of weighted rules in a logical language.Results: We introduce a novel hybrid architecture based on neural and Markov logic networks with grounding-specific weights. On a non-redundant dataset, our method achieves 44.9% F(1) measure, with 47.3% precision and 42.7% recall, which is significantly better (P < 0.01) than previously reported performance obtained by 2D recursive neural networks. Our approach also significantly improves the number of chains for which b-strands are nearly perfectly paired (36% of the chains are predicted with F(1) . 70% on coarse map). It also outperforms more general contact predictors on recent CASP 2008 targets.Contact: lippi(s)i.unifi.itSupplementary information: Supplementary data are available at Bioinformatics online. |
---|---|
Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 content type line 23 ObjectType-Feature-1 |
ISSN: | 1367-4803 |
DOI: | 10.1093/bioinformatics/btp421 |