LandScape: a simple method to aggregate p-values and other stochastic variables without a priori grouping

In many areas of science it is custom to perform many, potentially millions, of tests simultaneously. To gain statistical power it is common to group tests based on a priori criteria such as predefined regions or by sliding windows. However, it is not straightforward to choose grouping criteria and...

Full description

Saved in:
Bibliographic Details
Published inStatistical applications in genetics and molecular biology Vol. 15; no. 4; pp. 349 - 361
Main Authors Wiuf, Carsten, Schaumburg-Müller Pallesen, Jonatan, Foldager, Leslie, Grove, Jakob
Format Journal Article
LanguageEnglish
Published Germany De Gruyter 01.08.2016
Subjects
Online AccessGet full text
ISSN2194-6302
1544-6115
1544-6115
DOI10.1515/sagmb-2015-0085

Cover

Loading…
More Information
Summary:In many areas of science it is custom to perform many, potentially millions, of tests simultaneously. To gain statistical power it is common to group tests based on a priori criteria such as predefined regions or by sliding windows. However, it is not straightforward to choose grouping criteria and the results might depend on the chosen criteria. Methods that summarize, or aggregate, test statistics or -values, without relying on a priori criteria, are therefore desirable. We present a simple method to aggregate a sequence of stochastic variables, such as test statistics or -values, into fewer variables without assuming a priori defined groups. We provide different ways to evaluate the significance of the aggregated variables based on theoretical considerations and resampling techniques, and show that under certain assumptions the FWER is controlled in the strong sense. Validity of the method was demonstrated using simulations and real data analyses. Our method may be a useful supplement to standard procedures relying on evaluation of test statistics individually. Moreover, by being agnostic and not relying on predefined selected regions, it might be a practical alternative to conventionally used methods of aggregation of -values over regions. The method is implemented in Python and freely available online (through GitHub, see the Supplementary information).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2194-6302
1544-6115
1544-6115
DOI:10.1515/sagmb-2015-0085