Bayesian Kernel Two-Sample Testing

In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here,...

Full description

Saved in:

Bibliographic Details
Published in	Journal of computational and graphical statistics Vol. 31; no. 4; pp. 1164 - 1176
Main Authors	Zhang, Qinyi, Wild, Veit, Filippi, Sarah, Flaxman, Seth, Sejdinovic, Dino
Format	Journal Article
Language	English
Published	Alexandria Taylor & Francis 02.10.2022 Taylor & Francis Ltd
Subjects	Bayes factor: Hypothesis testing Bayesian analysis Data analysis Heterogeneity Hilbert space Kernel mean embeddings Kernels Random variables Test procedures
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here, we propose a Bayesian kernel two-sample testing procedure based on modeling the difference between kernel mean embeddings in the reproducing kernel Hilbert space using the framework established by Flaxman et al. The use of kernel methods enables its application to random variables in generic domains beyond the multivariate Euclidean spaces. The proposed procedure results in a posterior inference scheme that allows an automatic selection of the kernel parameters relevant to the problem at hand. In a series of synthetic experiments and two real data experiments (i.e., testing network heterogeneity from high-dimensional data and six-membered monocyclic ring conformation comparison), we illustrate the advantages of our approach. Supplementary materials for this article are available online.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1061-8600 1537-2715
DOI:	10.1080/10618600.2022.2067547