Bank of samples from the Prof_Pat protein family, assessment of efficacy

The PROF_PAT protein pattern database has been created and maintained so as to comprise the maximal number of the SWISS-PROT + TrEMBL proteins as patterns. The present paper describes some characteristic features of PROF_PAT to assist the potential user. New amino acid sequences (10938) from the SWI...

Full description

Saved in:
Bibliographic Details
Published inMolekuliarnaia biologiia Vol. 38; no. 2; p. 256
Main Authors Nizolenko, L F, Bachinskiĭ, A G, Naumochkin, A N, Iarygin, A A, Grigorovich, D A
Format Journal Article
LanguageRussian
Published Russia (Federation) 01.03.2004
Subjects
Online AccessGet more information

Cover

Loading…
More Information
Summary:The PROF_PAT protein pattern database has been created and maintained so as to comprise the maximal number of the SWISS-PROT + TrEMBL proteins as patterns. The present paper describes some characteristic features of PROF_PAT to assist the potential user. New amino acid sequences (10938) from the SWISS-PROT database have been analyzed to determine the boundary values of the "score" parameter to distinguish random and significant similarities. Analysis through the Internet of 20 amino acid sequences having no descriptions in the TrEMBL database demonstrated that PROF_PAT, being highly competitive with its counterparts in specificity, surpasses them in amplitude and variety of proteins, working several times as fast. The real representation of protein families in the PROF_PAT database (release 1.11), which contains 50,149 patterns of 344,429 proteins, has been estimated at 31,450.
ISSN:0026-8984