Poor Fit to the Multispecies Coalescent is Widely Detectable in Empirical Data

Model checking is a critical part of Bayesian data analysis, yet it remains largely unused in systematic studies. Phylogeny estimation has recently moved into an era of increasingly complex models that simultaneously account for multiple evolutionary processes, the statistical fit of these models to...

Full description

Saved in:
Bibliographic Details
Published inSystematic biology Vol. 63; no. 3; pp. 322 - 333
Main Authors Reid, Noah M., Hird, Sarah M., Brown, Jeremy M., Pelletier, Tara A., McVay, John D., Satler, Jordan D., Carstens, Bryan C.
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.05.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Model checking is a critical part of Bayesian data analysis, yet it remains largely unused in systematic studies. Phylogeny estimation has recently moved into an era of increasingly complex models that simultaneously account for multiple evolutionary processes, the statistical fit of these models to the data has rarely been tested. Here we develop a posterior predictive simulation-based model check for a commonly used multispecies coalescent model, implemented in *BEAST, and apply it to 25 published data sets. We show that poor model fit is detectable in the majority of data sets; that this poor fit can mislead phylogenetic estimation; and that in some cases it stems from processes of inherent interest to systematists. We suggest that as systematists scale up to phylogenomic data sets, which will be subject to a heterogeneous array of evolutionary processes, critically evaluating the fit of models to data is an analytical step that can no longer be ignored.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ISSN:1063-5157
1076-836X
1076-836X
DOI:10.1093/sysbio/syt057