Method and apparatus for identifying tandem repeats in a nucleotide sequence

A method and corresponding apparatus for identifying tandem repeats in a nucleotide sequence is described. Tandem repeats can be identified by identifying one or more lines present in a self-alignment plot of the nucleotide sequence. The disclosed method includes identifying one or more square-shape...

Full description

Saved in:
Bibliographic Details
Main Authors Li, Yilong, Komar, Peter
Format Patent
LanguageEnglish
Published 11.06.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method and corresponding apparatus for identifying tandem repeats in a nucleotide sequence is described. Tandem repeats can be identified by identifying one or more lines present in a self-alignment plot of the nucleotide sequence. The disclosed method includes identifying one or more square-shaped subregions (SSS) representing a tandem repeat and each associated with a plurality of identified candidate alignments by: i) estimating a defining point of an individual square for each of the candidate alignments, each candidate alignment having a start point and an end point, the start point and the end point positioned along adjacent sides of the individual square; ii) selecting one or more seed alignments from the one or more candidate alignments; and iii) associating the one or more candidate alignments with the one or more seed alignments. Based on the associating, a final SSS representing a tandem repeat is determined and its presence is reported.
Bibliography:Application Number: US201615196085