Simultaneous critical values for $t$-tests in very high dimensions

Hongyuan Cao; Michael R. Kosorok

doi:10.3150/10-BEJ272

February 2011 Simultaneous critical values for $t$-tests in very high dimensions

Hongyuan Cao, Michael R. Kosorok

Bernoulli 17(1): 347-394 (February 2011). DOI: 10.3150/10-BEJ272

Abstract

This article considers the problem of multiple hypothesis testing using $t$-tests. The observed data are assumed to be independently generated conditional on an underlying and unknown two-state hidden model. We propose an asymptotically valid data-driven procedure to find critical values for rejection regions controlling the $k$-familywise error rate ($k$-FWER), false discovery rate (FDR) and the tail probability of false discovery proportion (FDTP) by using one-sample and two-sample $t$-statistics. We only require a finite fourth moment plus some very general conditions on the mean and variance of the population by virtue of the moderate deviations properties of $t$-statistics. A new consistent estimator for the proportion of alternative hypotheses is developed. Simulation studies support our theoretical results and demonstrate that the power of a multiple testing procedure can be substantially improved by using critical values directly, as opposed to the conventional $p$-value approach. Our method is applied in an analysis of the microarray data from a leukemia cancer study that involves testing a large number of hypotheses simultaneously.

Citation

Download Citation

Hongyuan Cao. Michael R. Kosorok. "Simultaneous critical values for $t$-tests in very high dimensions." Bernoulli 17 (1) 347 - 394, February 2011. https://doi.org/10.3150/10-BEJ272

Information

Published: February 2011

First available in Project Euclid: 8 February 2011

zbMATH: 1284.62469

MathSciNet: MR2797995

Digital Object Identifier: 10.3150/10-BEJ272

Keywords: Empirical processes , FDR , high dimension , microarrays , multiple hypothesis testing , one-sample $t$-statistics , self-normalized moderate deviation , two-sample $t$-statistics