RepeatProfiler: A pipeline for visualization and comparative analysis of repetitive DNA profiles

Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single‐copy DNA, repetitive sequences can reveal signals of evolutionary history over short...

Full description

Saved in:
Bibliographic Details
Published inMolecular ecology resources Vol. 21; no. 3; pp. 969 - 981
Main Authors Negm, Sherif, Greenberg, Anya, Larracuente, Amanda M., Sproul, John S.
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 01.04.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single‐copy DNA, repetitive sequences can reveal signals of evolutionary history over short time scales that may not be evident in sequences from slower‐evolving genomic regions. Many tools for studying REs are directed toward organisms with existing genomic resources, including genome assemblies and repeat libraries. However, signals in repeat variation may prove especially valuable in disentangling evolutionary histories in diverse non‐model groups, for which genomic resources are limited. Here, we introduce RepeatProfiler, a tool for generating, visualizing, and comparing repetitive element DNA profiles from low‐coverage, short‐read sequence data. RepeatProfiler automates the generation and visualization of RE coverage depth profiles (RE profiles) and allows for statistical comparison of profile shape across samples. In addition, RepeatProfiler facilitates comparison of profiles by extracting signal from sequence variants across profiles which can then be analysed as molecular morphological characters using phylogenetic analysis. We validate RepeatProfiler with data sets from ground beetles (Bembidion), flies (Drosophila), and tomatoes (Solanum). We highlight the potential of RE profiles as a high‐resolution data source for studies in species delimitation, comparative genomics, and repeat biology.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
JSS originally conceived the pipeline and its core features. JSS, AML, and SN improved conceptual design and features. SN, JSS, and AG wrote the code. SN, JSS, and AG conducted validation experiments. SN wrote the first draft of the manuscript and all authors contributed to subsequent drafts.
Author Contributions
ISSN:1755-098X
1755-0998
DOI:10.1111/1755-0998.13305