Method for screening open reading frame with small peptide encoding capacity
The invention belongs to the technical field of medicines and particularly relates to a method for screening an open reading frame with small peptide encoding capacity. The method comprises the following steps: with the GC content of a genome as a basis, screening prokaryotic genomes with sORFs anno...
Saved in:
Main Authors | , , , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
06.11.2020
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention belongs to the technical field of medicines and particularly relates to a method for screening an open reading frame with small peptide encoding capacity. The method comprises the following steps: with the GC content of a genome as a basis, screening prokaryotic genomes with sORFs annotations from a database; grouping the prokaryotic genomes according to the GC contents of the genomes; then, screening out the sORFs with clear biological functions in each genome; conducting redundancy elimination processing by utilizing a CD-Hit program; sequentially taking peptide coded sORFs ineach genome as positive samples and corresponding random disordered sequences as negative samples, taking the positive samples and the negative samples as a preliminary training set to respectively predict sORFs in other genomes, and taking the genome with the best prediction effect in each genome GC content interval as a final training set source genome; and based on the screened training set, conducting training with a |
---|---|
Bibliography: | Application Number: CN202010777126 |