OptM: estimating the optimal number of migration edges on population trees using Treemix

Abstract The software Treemix has become extensively used to estimate the number of migration events, or edges (m), on population trees from genome-wide allele frequency data. However, the appropriate number of edges to include remains unclear. Here, I show that an optimal value of m can be inferred...

Full description

Saved in:
Bibliographic Details
Published inBiology methods and protocols Vol. 6; no. 1; p. bpab017
Main Author Fitak, Robert R
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract The software Treemix has become extensively used to estimate the number of migration events, or edges (m), on population trees from genome-wide allele frequency data. However, the appropriate number of edges to include remains unclear. Here, I show that an optimal value of m can be inferred from the second-order rate of change in likelihood (Δm) across incremental values of m. Repurposed from its original use to estimate the number of population clusters in the software Structure (ΔK), I show using simulated populations that Δm performs equally as well as current recommendations for Treemix. A demonstration of an empirical dataset from domestic dogs indicates that this method may be preferable in large, complex population histories and can prioritize migration events for subsequent investigation. The method has been implemented in a freely available R package called “OptM” and as a web application (https://rfitak.shinyapps.io/OptM/) to interface directly with the output files of Treemix.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2396-8923
2396-8923
DOI:10.1093/biomethods/bpab017