Method and apparatus for identifying tandem repeats in a nucleotide sequence
A method and corresponding apparatus for identifying tandem repeats in a nucleotide sequence is described. Tandem repeats can be identified by identifying one or more lines present in a self-alignment plot of the nucleotide sequence. The disclosed method includes identifying one or more square-shape...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | English |
Published |
11.06.2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A method and corresponding apparatus for identifying tandem repeats in a nucleotide sequence is described. Tandem repeats can be identified by identifying one or more lines present in a self-alignment plot of the nucleotide sequence. The disclosed method includes identifying one or more square-shaped subregions (SSS) representing a tandem repeat and each associated with a plurality of identified candidate alignments by: i) estimating a defining point of an individual square for each of the candidate alignments, each candidate alignment having a start point and an end point, the start point and the end point positioned along adjacent sides of the individual square; ii) selecting one or more seed alignments from the one or more candidate alignments; and iii) associating the one or more candidate alignments with the one or more seed alignments. Based on the associating, a final SSS representing a tandem repeat is determined and its presence is reported. |
---|---|
Bibliography: | Application Number: US201615196085 |