Phylogenomics using low‐depth whole genome sequencing: A case study with the olive tribe

Species trees have traditionally been inferred from a few selected markers, and genome‐wide investigations remain largely restricted to model organisms or small groups of species for which sampling of fresh material is available, leaving out most of the existing and historical species diversity. The...

Full description

Saved in:
Bibliographic Details
Published inMolecular ecology resources Vol. 19; no. 4; pp. 877 - 892
Main Authors Olofsson, Jill K., Cantera, Isabel, Van de Paer, Céline, Hong‐Wa, Cynthia, Zedane, Loubab, Dunning, Luke T., Alberti, Adriana, Christin, Pascal‐Antoine, Besnard, Guillaume
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 01.07.2019
Wiley/Blackwell
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Species trees have traditionally been inferred from a few selected markers, and genome‐wide investigations remain largely restricted to model organisms or small groups of species for which sampling of fresh material is available, leaving out most of the existing and historical species diversity. The genomes of an increasing number of species, including specimens extracted from natural history collections, are being sequenced at low depth. While these data sets are widely used to analyse organelle genomes, the nuclear fraction is generally ignored. Here we evaluate different reference‐based methods to infer phylogenies of large taxonomic groups from such data sets. Using the example of the Oleeae tribe, a worldwide‐distributed group, we build phylogenies based on single nucleotide polymorphisms (SNPs) obtained using two reference genomes (the olive and ash trees). The inferred phylogenies are overall congruent, yet present differences that might reflect the effect of distance to the reference on the amount of missing data. To limit this issue, genome complexity was reduced by using pairs of orthologous coding sequences as the reference, thus allowing us to combine SNPs obtained using two distinct references. Concatenated and coalescence trees based on these combined SNPs suggest events of incomplete lineage sorting and/or hybridization during the diversification of this large phylogenetic group. Our results show that genome‐wide phylogenetic trees can be inferred from low‐depth sequence data sets for eukaryote groups with complex genomes, and histories of reticulate evolution. This opens new avenues for large‐scale phylogenomics and biogeographical analyses covering both the extant and the historical diversity stored in museum collections.
Bibliography:Data Availability Statement
All raw reads are available in the short sequence archive under accession nos. PRJNA506987 and PRJEB30497. In the NCBI nucleotide database, all newly assembled chloroplast genomes and ribosomal DNA clusters are available under accession numbers specified in Table S1. All scripts are available on Github
https://github.com/jill-olofsson/low-depth-sequencing_analyses
.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1755-098X
1755-0998
DOI:10.1111/1755-0998.13016