RepeatProfiler: A pipeline for visualization and comparative analysis of repetitive DNA profiles
Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single‐copy DNA, repetitive sequences can reveal signals of evolutionary history over short...
Saved in:
Published in | Molecular ecology resources Vol. 21; no. 3; pp. 969 - 981 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
England
Wiley Subscription Services, Inc
01.04.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single‐copy DNA, repetitive sequences can reveal signals of evolutionary history over short time scales that may not be evident in sequences from slower‐evolving genomic regions. Many tools for studying REs are directed toward organisms with existing genomic resources, including genome assemblies and repeat libraries. However, signals in repeat variation may prove especially valuable in disentangling evolutionary histories in diverse non‐model groups, for which genomic resources are limited. Here, we introduce RepeatProfiler, a tool for generating, visualizing, and comparing repetitive element DNA profiles from low‐coverage, short‐read sequence data. RepeatProfiler automates the generation and visualization of RE coverage depth profiles (RE profiles) and allows for statistical comparison of profile shape across samples. In addition, RepeatProfiler facilitates comparison of profiles by extracting signal from sequence variants across profiles which can then be analysed as molecular morphological characters using phylogenetic analysis. We validate RepeatProfiler with data sets from ground beetles (Bembidion), flies (Drosophila), and tomatoes (Solanum). We highlight the potential of RE profiles as a high‐resolution data source for studies in species delimitation, comparative genomics, and repeat biology. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 JSS originally conceived the pipeline and its core features. JSS, AML, and SN improved conceptual design and features. SN, JSS, and AG wrote the code. SN, JSS, and AG conducted validation experiments. SN wrote the first draft of the manuscript and all authors contributed to subsequent drafts. Author Contributions |
ISSN: | 1755-098X 1755-0998 |
DOI: | 10.1111/1755-0998.13305 |