Computing Exact p-values for a Cross-correlation Shotgun Proteomics Score Function

The core of every protein mass spectrometry analysis pipeline is a function that assesses the quality of a match between an observed spectrum and a candidate peptide. We describe a procedure for computing exact p-values for the oldest and still widely used score function, SEQUEST XCorr. The procedur...

Full description

Saved in:
Bibliographic Details
Published inMolecular & cellular proteomics Vol. 13; no. 9; pp. 2467 - 2479
Main Authors Howbert, J. Jeffry, Noble, William Stafford
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 01.09.2014
The American Society for Biochemistry and Molecular Biology
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The core of every protein mass spectrometry analysis pipeline is a function that assesses the quality of a match between an observed spectrum and a candidate peptide. We describe a procedure for computing exact p-values for the oldest and still widely used score function, SEQUEST XCorr. The procedure uses dynamic programming to enumerate efficiently the full distribution of scores for all possible peptides whose masses are close to that of the spectrum precursor mass. Ranking identified spectra by p-value rather than XCorr significantly reduces variance because of spectrum-specific effects on the score. In combination with the Percolator postprocessor, the XCorr p-value yields more spectrum and peptide identifications at a fixed false discovery rate than Mascot, X!Tandem, Comet, and MS-GF+ across a variety of data sets.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1535-9476
1535-9484
DOI:10.1074/mcp.O113.036327