Comprehensive review and empirical analysis of hallmarks of DNA-, RNA- and protein-binding residues in protein chains

Abstract Proteins interact with a variety of molecules including proteins and nucleic acids. We review a comprehensive collection of over 50 studies that analyze and/or predict these interactions. While majority of these studies address either solely protein–DNA or protein–RNA binding, only a few ha...

Full description

Saved in:

Bibliographic Details
Published in	Briefings in bioinformatics Vol. 20; no. 4; pp. 1250 - 1268
Main Authors	Zhang, Jian, Ma, Zhiqiang, Kurgan, Lukasz
Format	Journal Article
Language	English
Published	England Oxford University Press 19.07.2019 Oxford Publishing Limited (England)
Subjects	Amino Acid Sequence Amino acids Amino Acids - chemistry Binding Binding Sites - genetics Computational Biology - methods Databases, Protein Deoxyribonucleic acid DNA DNA - metabolism DNA-Binding Proteins - chemistry DNA-Binding Proteins - genetics DNA-Binding Proteins - metabolism Empirical analysis Evolutionary conservation Humans Internet Ligands Nucleic acids Performance prediction Protein Binding Protein Interaction Domains and Motifs Proteins Residues Ribonucleic acid RNA RNA - metabolism RNA-Binding Proteins - chemistry RNA-Binding Proteins - genetics RNA-Binding Proteins - metabolism Software protein–protein interactions DNA-binding residues RNA-binding residues protein–nucleic acid interactions protein–RNA interactions protein–DNA interactions
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Abstract Proteins interact with a variety of molecules including proteins and nucleic acids. We review a comprehensive collection of over 50 studies that analyze and/or predict these interactions. While majority of these studies address either solely protein–DNA or protein–RNA binding, only a few have a wider scope that covers both protein–protein and protein–nucleic acid binding. Our analysis reveals that binding residues are typically characterized with three hallmarks: relative solvent accessibility (RSA), evolutionary conservation and propensity of amino acids (AAs) for binding. Motivated by drawbacks of the prior studies, we perform a large-scale analysis to quantify and contrast the three hallmarks for residues that bind DNA-, RNA-, protein- and (for the first time) multi-ligand-binding residues that interact with DNA and proteins, and with RNA and proteins. Results generated on a well-annotated data set of over 23 000 proteins show that conservation of binding residues is higher for nucleic acid- than protein-binding residues. Multi-ligand-binding residues are more conserved and have higher RSA than single-ligand-binding residues. We empirically show that each hallmark discriminates between binding and nonbinding residues, even predicted RSA, and that combining them improves discriminatory power for each of the five types of interactions. Linear scoring functions that combine these hallmarks offer good predictive performance of residue-level propensity for binding and provide intuitive interpretation of predictions. Better understanding of these residue-level interactions will facilitate development of methods that accurately predict binding in the exponentially growing databases of protein sequences.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Review-3 content type line 23
ISSN:	1467-5463 1477-4054 1477-4054
DOI:	10.1093/bib/bbx168