Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field

Ab initio protein folding is one of the major unsolved problems in computational biology owing to the difficulties in force field design and conformational search. We developed a novel program, QUARK, for template‐free protein structure prediction. Query sequences are first broken into fragments of...

Full description

Saved in:
Bibliographic Details
Published inProteins, structure, function, and bioinformatics Vol. 80; no. 7; pp. 1715 - 1735
Main Authors Xu, Dong, Zhang, Yang
Format Journal Article
LanguageEnglish
Published Hoboken Wiley Subscription Services, Inc., A Wiley Company 01.07.2012
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Ab initio protein folding is one of the major unsolved problems in computational biology owing to the difficulties in force field design and conformational search. We developed a novel program, QUARK, for template‐free protein structure prediction. Query sequences are first broken into fragments of 1–20 residues where multiple fragment structures are retrieved at each position from unrelated experimental structures. Full‐length structure models are then assembled from fragments using replica‐exchange Monte Carlo simulations, which are guided by a composite knowledge‐based force field. A number of novel energy terms and Monte Carlo movements are introduced and the particular contributions to enhancing the efficiency of both force field and search engine are analyzed in detail. QUARK prediction procedure is depicted and tested on the structure modeling of 145 nonhomologous proteins. Although no global templates are used and all fragments from experimental structures with template modeling score >0.5 are excluded, QUARK can successfully construct 3D models of correct folds in one‐third cases of short proteins up to 100 residues. In the ninth community‐wide Critical Assessment of protein Structure Prediction experiment, QUARK server outperformed the second and third best servers by 18 and 47% based on the cumulative Z‐score of global distance test‐total scores in the FM category. Although ab initio protein folding remains a significant challenge, these data demonstrate new progress toward the solution of the most important problem in the field. Proteins 2012; © 2012 Wiley Periodicals, Inc.
Bibliography:ArticleID:PROT24065
The NSF Career Award - No. DBI 1027394
ark:/67375/WNG-8KL70B9L-R
The National Institute of General Medical Sciences - No. GM083107; No. GM084222
istex:C7EC21E364CFDA33330E4B2A96763F32E39B4E09
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:0887-3585
1097-0134
1097-0134
DOI:10.1002/prot.24065