Experiments in Searching Small Proteins in Unannotated Large Eukaryotic Genomes

There is growing interest to use mass spectrometry data to search genome sequences directly. Previous work by other authors demonstrated that this approach is able to correct and complement available genome annotations. We discuss the practical difficulty of searching large eukaryotic genomes with p...

Full description

Saved in:
Bibliographic Details
Published inJournal of proteome research Vol. 4; no. 1; pp. 167 - 174
Main Authors Colinge, Jacques, Cusin, Isabelle, Reffas, Samia, Mahé, Eve, Niknejad, Anne, Rey, Pierre-Antoine, Mattou, Hassan, Moniatte, Marc, Bougueleret, Lydie
Format Journal Article
LanguageEnglish
Published United States American Chemical Society 01.01.2005
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:There is growing interest to use mass spectrometry data to search genome sequences directly. Previous work by other authors demonstrated that this approach is able to correct and complement available genome annotations. We discuss the practical difficulty of searching large eukaryotic genomes with peptide ion trap tandem mass spectra of small proteins (<40 kDa). The challenging problem of automatically identifying peptides that span across exon/intron boundaries is explored for the first time by using experimental data. In a human genome search, we find that roughly 30% of the peptides are missed, due to various reasons, compared to a Swiss-Prot search. We show that this percentage is significantly reduced with improved parent mass accuracy. We finally provide several examples of predicted gene structures that could be improved by proteomics data, in particular by peptides spanning across exon/intron boundaries. Keywords: bioinformatics • genome • identification • eukaryote • small protein • ion trap • exon • intron
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1535-3893
1535-3907
DOI:10.1021/pr049811i