Redundancy removing optimizing method of gene reference sequence and system thereof
The invention discloses a redundancy removing optimizing method of a gene reference sequence and a system thereof. For aiming at a gene reference sequence, a continuous reference sequence Kmer with apreset length is obtained through traversal according to a preset step length; then dispersion and se...
Saved in:
Main Authors | , , , , , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
19.04.2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention discloses a redundancy removing optimizing method of a gene reference sequence and a system thereof. For aiming at a gene reference sequence, a continuous reference sequence Kmer with apreset length is obtained through traversal according to a preset step length; then dispersion and selective redundancy eliminating are performed on the continuous reference sequence Kmer through a Hash barrel; and then re-assembling is performed, thereby reducing the number of the reference sequences Kmer as possible and further ensuring the quality of the continuous reference sequence Kmer. According to the method and the system, the redundancy of the gene reference sequence can be reduced under a precondition that a compression rate does not reduce, thereby obtaining a more suitable reference sequence through retrenching, realizing higher volume of the optimized gene reference sequence, realizing higher memory loading rate in using as a compression reference index, and improving compression efficiency of the ge |
---|---|
Bibliography: | Application Number: CN201811591686 |