Institute of Mathematical Statistics Collections
Posterior consistency of Dirichlet mixtures of beta densities in estimating positive false discovery rates
In recent years, multiple hypothesis testing has come to the forefront of statistical research, ostensibly in relation to applications in genomics and some other emerging fields. The false discovery rate (FDR) and its variants provide very important notions of errors in this context comparable to the role of error probabilities in classical testing problems. Accurate estimation of positive FDR (pFDR), a variant of the FDR, is essential in assessing and controlling this measure. In a recent paper, the authors proposed a model-based nonparametric Bayesian method of estimation of the pFDR function. In particular, the density of p-values was modeled as a mixture of decreasing beta densities and an appropriate Dirichlet process was considered as a prior on the mixing measure. The resulting procedure was shown to work well in simulations. In this paper, we provide some theoretical results in support of the beta mixture model for the density of p-values, and show that, under appropriate conditions, the resulting posterior is consistent as the number of hypotheses grows to infinity.
First available in Project Euclid: 1 April 2008
Permanent link to this document
Digital Object Identifier
Copyright © 2008, Institute of Mathematical Statistics
Ghosal, Subhashis; Roy, Anindya; Tang, Yongqiang. Posterior consistency of Dirichlet mixtures of beta densities in estimating positive false discovery rates. Beyond Parametrics in Interdisciplinary Research: Festschrift in Honor of Professor Pranab K. Sen, 105--115, Institute of Mathematical Statistics, Beachwood, Ohio, USA, 2008. doi:10.1214/193940307000000077. https://projecteuclid.org/euclid.imsc/1207058267
-  Bayarri, M. J. and Berger, J. O. (2000). p-values for composite null models. J. Amer. Statist. Assoc. 95 1127–1142.
-  Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. Roy. Statist. Soc. Ser. B 57 289–300.
-  Efron, B. and Tibshirani, R. (2002). Empirical Bayes methods and false discovery rates for microarrays. Genetic Epidemiology 23 70–86.
-  Feller, W. (1971). An Introduction to Probability Theory and Its Applications. II. Wiley, New York.
-  Ferguson, T. S. (1973). A Bayesian analysis of some nonparametric problems. Ann. Statist. 1 209–230.
-  Ghosal, S. and van der Vaart, A. W. (2001). Entropies and rates of convergence for maximum likelihood and Bayes estimation for mixtures of normal densities. Ann. Statist. 29 1233–1263.
-  Ghosal, S. and van der Vaart, A. W. (2007). Posterior convergence rates of Dirichlet mixtures at smooth densities. Ann. Statist. 35 697–723.
-  Ghosh, J. K. and Ramamoorthi, R. V. (2003). Bayesian Nonparametrics. Springer, New York.
-  Robins, J. M., van der Vaart, A. W. and Ventura, V. (2000). Asymptotic distribution of p-values in composite null models. J. Amer. Statist. Assoc. 95 1143–1167.
-  Sarkar, S. K. (2002). Some results on false discovery rate in multiple testing procedures. Ann. Statist. 30 239–257.
-  Storey, J. D. (2002). A direct approach to false discovery rates. J. Roy. Statist. Soc. Ser. B 64 479–498.
-  Storey, J. D. (2003). The positive false discovery rate: A Bayesian interpretation and the q-value. Ann. Statist. 31 2013–2035.
-  Tang, Y., Ghosal, S. and Roy, A. (2007). Nonparametric Bayesian estimation of positive false discovery rates. Biometrics 63 1126–1134.
-  Tsai, C., Hsueh, H. and Chen, J. (2003). Estimation of false discovery rates testing: Application to gene microarray data. Biometrics 59 1071–1081.
-  Wong, W. H. and Shen, X. (1995). Probability inequalities for likelihood ratios and convergence rates of sieved MLEs. Ann. Statist. 23 339–362.