Institute of Mathematical Statistics Collections

Posterior consistency of Dirichlet mixtures of beta densities in estimating positive false discovery rates

Subhashis Ghosal, Anindya Roy, Yongqiang Tang

Source: N. Balakrishnan, Edsel A. Peña and Mervyn J. Silvapulle, eds., Beyond Parametrics in Interdisciplinary Research: Festschrift in Honor of Professor Pranab K. Sen (Beachwood, Ohio, USA: Institute of Mathematical Statistics, 2008), 105-115.

Abstract

In recent years, multiple hypothesis testing has come to the forefront of statistical research, ostensibly in relation to applications in genomics and some other emerging fields. The false discovery rate (FDR) and its variants provide very important notions of errors in this context comparable to the role of error probabilities in classical testing problems. Accurate estimation of positive FDR (pFDR), a variant of the FDR, is essential in assessing and controlling this measure. In a recent paper, the authors proposed a model-based nonparametric Bayesian method of estimation of the pFDR function. In particular, the density of p-values was modeled as a mixture of decreasing beta densities and an appropriate Dirichlet process was considered as a prior on the mixing measure. The resulting procedure was shown to work well in simulations. In this paper, we provide some theoretical results in support of the beta mixture model for the density of p-values, and show that, under appropriate conditions, the resulting posterior is consistent as the number of hypotheses grows to infinity.

Primary Subjects: 62G05, 62G20
Secondary Subjects: 62G10
Keywords: Dirichlet process; Dirichlet mixture; multiple testing; positive false discovery rate; posterior consistency

Full-text: Open access

Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.imsc/1207058267
Digital Object Identifier: doi:10.1214/193940307000000077

References

[1] Bayarri, M. J. and Berger, J. O. (2000). p-values for composite null models. J. Amer. Statist. Assoc. 95 1127–1142.

[2] Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. Roy. Statist. Soc. Ser. B 57 289–300.

[3] Efron, B. and Tibshirani, R. (2002). Empirical Bayes methods and false discovery rates for microarrays. Genetic Epidemiology 23 70–86.

[4] Feller, W. (1971). An Introduction to Probability Theory and Its Applications. II. Wiley, New York.

[5] Ferguson, T. S. (1973). A Bayesian analysis of some nonparametric problems. Ann. Statist. 1 209–230.

[6] Ghosal, S. and van der Vaart, A. W. (2001). Entropies and rates of convergence for maximum likelihood and Bayes estimation for mixtures of normal densities. Ann. Statist. 29 1233–1263.

[7] Ghosal, S. and van der Vaart, A. W. (2007). Posterior convergence rates of Dirichlet mixtures at smooth densities. Ann. Statist. 35 697–723.

[8] Ghosh, J. K. and Ramamoorthi, R. V. (2003). Bayesian Nonparametrics. Springer, New York.

[9] Robins, J. M., van der Vaart, A. W. and Ventura, V. (2000). Asymptotic distribution of p-values in composite null models. J. Amer. Statist. Assoc. 95 1143–1167.

[10] Sarkar, S. K. (2002). Some results on false discovery rate in multiple testing procedures. Ann. Statist. 30 239–257.

[11] Storey, J. D. (2002). A direct approach to false discovery rates. J. Roy. Statist. Soc. Ser. B 64 479–498.

[12] Storey, J. D. (2003). The positive false discovery rate: A Bayesian interpretation and the q-value. Ann. Statist. 31 2013–2035.

[13] Tang, Y., Ghosal, S. and Roy, A. (2007). Nonparametric Bayesian estimation of positive false discovery rates. Biometrics 63 1126–1134.

[14] Tsai, C., Hsueh, H. and Chen, J. (2003). Estimation of false discovery rates testing: Application to gene microarray data. Biometrics 59 1071–1081.

[15] Wong, W. H. and Shen, X. (1995). Probability inequalities for likelihood ratios and convergence rates of sieved MLEs. Ann. Statist. 23 339–362.

2009 © Institute of Mathematical Statistics

Institute of Mathematical Statistics Collections

Institute of Mathematical Statistics Collections