Bayesian and frequentist testing for differences between two groups with parametric and nonparametric two‐sample tests

Testing for differences between two groups is one of the scenarios most often faced by scientists across all domains and is particularly important in the medical sciences and psychology. The traditional solution to this problem is rooted inside the Neyman–Pearson theory of null hypothesis significan...

Full description

Saved in:
Bibliographic Details
Published inWiley interdisciplinary reviews. Computational statistics Vol. 13; no. 6
Main Author Kelter, Riko
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.11.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Testing for differences between two groups is one of the scenarios most often faced by scientists across all domains and is particularly important in the medical sciences and psychology. The traditional solution to this problem is rooted inside the Neyman–Pearson theory of null hypothesis significance testing and uniformly most powerful tests. In the last decade, a lot of progress has been made in developing Bayesian versions of the most common parametric and nonparametric two‐sample tests, including Student's t‐test and the Mann–Whitney U test. In this article, we review the underlying assumptions, models and implications for research practice of these Bayesian two‐sample tests and contrast them with the existing frequentist solutions. Also, we show that in general Bayesian and frequentist two‐sample tests have different behavior regarding the type I and II error control, which needs to be carefully balanced in practical research. This article is categorized under: Statistical and Graphical Methods of Data Analysis > Bayesian Methods and Theory Statistical and Graphical Methods of Data Analysis > Monte Carlo Methods Statistical and Graphical Methods of Data Analysis > Markov Chain Monte Carlo Visual representation of a Bayesian two‐sample t‐test. Bayesian two‐sample tests have recently becomes increasingly popular through the on‐going discussion about statistical significance, and we review and contrast them with existing frequentist tests, also investigating their type I and II error control.
ISSN:1939-5108
1939-0068
DOI:10.1002/wics.1523