Redundancy removing optimizing method of gene reference sequence and system thereof

The invention discloses a redundancy removing optimizing method of a gene reference sequence and a system thereof. For aiming at a gene reference sequence, a continuous reference sequence Kmer with apreset length is obtained through traversal according to a preset step length; then dispersion and se...

Full description

Saved in:
Bibliographic Details
Main Authors SONG ZHUO, MA CHOUXIAN, LI GEN, ZHAO LIXIA, MAO HAIBO, XU XIALI, FENG BOLUN, YANG YAO, HUANG NENGCHAO
Format Patent
LanguageChinese
English
Published 19.04.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention discloses a redundancy removing optimizing method of a gene reference sequence and a system thereof. For aiming at a gene reference sequence, a continuous reference sequence Kmer with apreset length is obtained through traversal according to a preset step length; then dispersion and selective redundancy eliminating are performed on the continuous reference sequence Kmer through a Hash barrel; and then re-assembling is performed, thereby reducing the number of the reference sequences Kmer as possible and further ensuring the quality of the continuous reference sequence Kmer. According to the method and the system, the redundancy of the gene reference sequence can be reduced under a precondition that a compression rate does not reduce, thereby obtaining a more suitable reference sequence through retrenching, realizing higher volume of the optimized gene reference sequence, realizing higher memory loading rate in using as a compression reference index, and improving compression efficiency of the ge
Bibliography:Application Number: CN201811591686