Differentiable, multi-dimensional, knowledge-based energy terms for torsion angle probabilities and propensities

Rotatable torsion angles are the major degrees of freedom in proteins. Adjacent angles are highly correlated and energy terms that rely on these correlations are intensively used in molecular modeling. However, the utility of torsion based terms is not yet fully exploited. Many of these terms do not...

Full description

Saved in:
Bibliographic Details
Published inProteins, structure, function, and bioinformatics Vol. 72; no. 1; pp. 62 - 73
Main Authors Amir, El-Ad David, Kalisman, Nir, Keasar, Chen
Format Journal Article
LanguageEnglish
Published Hoboken Wiley Subscription Services, Inc., A Wiley Company 01.07.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Rotatable torsion angles are the major degrees of freedom in proteins. Adjacent angles are highly correlated and energy terms that rely on these correlations are intensively used in molecular modeling. However, the utility of torsion based terms is not yet fully exploited. Many of these terms do not capture the full scale of the correlations. Other terms, which rely on lookup tables, cannot be used in the context of force‐driven algorithms because they are not fully differentiable. This study aims to extend the usability of torsion terms by presenting a set of high‐dimensional and fully‐differentiable energy terms that are derived from high‐resolution structures. The set includes terms that describe backbone conformational probabilities and propensities, side‐chain rotamer probabilities, and an elaborate term that couples all the torsion angles within the same residue. The terms are constructed by cubic spline interpolation with periodic boundary conditions that enable full differentiability and high computational efficiency. We show that the spline implementation does not compromise the accuracy of the original database statistics. We further show that the side‐chain relevant terms are compatible with established rotamer probabilities. Despite their very local characteristics, the new terms are often able to identify native and native‐like structures within decoy sets. Finally, force‐based minimization of NMR structures with the new terms improves their torsion angle statistics with minor structural distortion (0.5 Å RMSD on average). The new terms are freely available in the MESHI molecular modeling package. The spline coefficients are also available as a documented MATLAB file. Proteins 2008. © 2008 Wiley‐Liss, Inc.
Bibliography:The GeneFun consortium of the European Commission - No. LSHG-CT-2004-503567
Israeli Science Foundation - No. 289/06
ArticleID:PROT21896
ark:/67375/WNG-6GVSTFG4-5
istex:3D2FFF4671D34BEF25FFD64FA3DC53BFC4E41F18
Authors El‐Ad David Amir and Nir Kalisman contributed equally to this work.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0887-3585
1097-0134
DOI:10.1002/prot.21896