Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA

Motivation: The variation in community composition between microbiome samples, termed beta diversity, can be measured by pairwise distance based on either presence–absence or quantitative species abundance data. PERMANOVA, a permutation-based extension of multivariate analysis of variance to a matri...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 31; no. 15; pp. 2461 - 2468
Main Authors Kelly, Brendan J., Gross, Robert, Bittinger, Kyle, Sherrill-Mix, Scott, Lewis, James D., Collman, Ronald G., Bushman, Frederic D., Li, Hongzhe
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.08.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Motivation: The variation in community composition between microbiome samples, termed beta diversity, can be measured by pairwise distance based on either presence–absence or quantitative species abundance data. PERMANOVA, a permutation-based extension of multivariate analysis of variance to a matrix of pairwise distances, partitions within-group and between-group distances to permit assessment of the effect of an exposure or intervention (grouping factor) upon the sampled microbiome. Within-group distance and exposure/intervention effect size must be accurately modeled to estimate statistical power for a microbiome study that will be analyzed with pairwise distances and PERMANOVA. Results: We present a framework for PERMANOVA power estimation tailored to marker-gene microbiome studies that will be analyzed by pairwise distances, which includes: (i) a novel method for distance matrix simulation that permits modeling of within-group pairwise distances according to pre-specified population parameters; (ii) a method to incorporate effects of different sizes within the simulated distance matrix; (iii) a simulation-based method for estimating PERMANOVA power from simulated distance matrices; and (iv) an R statistical software package that implements the above. Matrices of pairwise distances can be efficiently simulated to satisfy the triangle inequality and incorporate group-level effects, which are quantified by the adjusted coefficient of determination, omega-squared (ω2). From simulated distance matrices, available PERMANOVA power or necessary sample size can be estimated for a planned microbiome study. Availability and implementation:  http://github.com/brendankelly/micropower. Contact:  brendank@mail.med.upenn.edu or hongzhe@upenn.edu
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Associate Editor: John Hancock
ISSN:1367-4803
1367-4811
1460-2059
DOI:10.1093/bioinformatics/btv183