Analysis of Large-Scale Mutagenesis Data To Assess the Impact of Single Amino Acid Substitutions

Mutagenesis is a widely used method for identifying protein positions that are important for function or ligand binding. Advances in high-throughput DNA sequencing and mutagenesis techniques have enabled measurement of the effects of nearly all possible amino acid substitutions in many proteins. The...

Full description

Saved in:
Bibliographic Details
Published inGenetics (Austin) Vol. 207; no. 1; pp. 53 - 61
Main Authors Gray, Vanessa E, Hause, Ronald J, Fowler, Douglas M
Format Journal Article
LanguageEnglish
Published United States Genetics Society of America 01.09.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Mutagenesis is a widely used method for identifying protein positions that are important for function or ligand binding. Advances in high-throughput DNA sequencing and mutagenesis techniques have enabled measurement of the effects of nearly all possible amino acid substitutions in many proteins. The resulting large-scale mutagenesis data sets offer a unique opportunity to draw general conclusions about the effects of different amino acid substitutions. Thus, we analyzed 34,373 mutations in 14 proteins whose effects were measured using large-scale mutagenesis approaches. Methionine was the most tolerated substitution, while proline was the least tolerated. We found that several substitutions, including histidine and asparagine, best recapitulated the effects of other substitutions, even when the identity of the wild-type amino acid was considered. The effects of histidine and asparagine substitutions also correlated best with the effects of other substitutions in different structural contexts. Furthermore, highly disruptive substitutions like aspartic and glutamic acid had the most discriminatory power for detecting ligand interface positions. Our work highlights the utility of large-scale mutagenesis data, and our conclusions can help guide future single substitution mutational scans.
ISSN:1943-2631
0016-6731
1943-2631
DOI:10.1534/genetics.117.300064