Characterizing Protease Specificity: How Many Substrates Do We Need?

Calculation of cleavage entropies allows to quantify, map and compare protease substrate specificity by an information entropy based approach. The metric intrinsically depends on the number of experimentally determined substrates (data points). Thus a statistical analysis of its numerical stability...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 10; no. 11; p. e0142658
Main Authors Schauperl, Michael, Fuchs, Julian E, Waldner, Birgit J, Huber, Roland G, Kramer, Christian, Liedl, Klaus R
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 11.11.2015
Public Library of Science (PLoS)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Calculation of cleavage entropies allows to quantify, map and compare protease substrate specificity by an information entropy based approach. The metric intrinsically depends on the number of experimentally determined substrates (data points). Thus a statistical analysis of its numerical stability is crucial to estimate the systematic error made by estimating specificity based on a limited number of substrates. In this contribution, we show the mathematical basis for estimating the uncertainty in cleavage entropies. Sets of cleavage entropies are calculated using experimental cleavage data and modeled extreme cases. By analyzing the underlying mathematics and applying statistical tools, a linear dependence of the metric in respect to 1/n was found. This allows us to extrapolate the values to an infinite number of samples and to estimate the errors. Analyzing the errors, a minimum number of 30 substrates was found to be necessary to characterize substrate specificity, in terms of amino acid variability, for a protease (S4-S4') with an uncertainty of 5 percent. Therefore, we encourage experimental researchers in the protease field to record specificity profiles of novel proteases aiming to identify at least 30 peptide substrates of maximum sequence diversity. We expect a full characterization of protease specificity helpful to rationalize biological functions of proteases and to assist rational drug design.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Competing Interests: The authors have declared that no competing interests exist.
Conceived and designed the experiments: MS JEF RGH CK KRL. Performed the experiments: MS JEF CK. Analyzed the data: MS JEF BJW RGH CK KRL. Contributed reagents/materials/analysis tools: MS JEF RGH CK. Wrote the paper: MS JEF BJW RGH CK KRL.
Current address: Pharma Research and Early Development, Therapeutic Modalities, Roche Innovation Center Basel, Grenzacherstrasse 74, 4070, Basel, Switzerland
Current address: Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), #07–01 Matrix, 30 Biopolis Street, 138671, Singapore, Singapore
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0142658