Bayesian Analysis

Local-Mass Preserving Prior Distributions for Nonparametric Bayesian Models

Juhee Lee, Steven N. MacEachern, Yiling Lu, and Gordon B. Mills

Full-text: Open access


We address the problem of prior specification for models involving the two-parameter Poisson-Dirichlet process. These models are sometimes partially subjectively specified and are always partially (or fully) specified by a rule. We develop prior distributions based on local mass preservation. The robustness of posterior inference to an arbitrary choice of overdispersion under the proposed and current priors is investigated. Two examples are provided to demonstrate the properties of the proposed priors. We focus on the three major types of inference: clustering of the parameters of interest, estimation and prediction. The new priors are found to provide more stable inference about clustering than traditional priors while showing few drawbacks. Furthermore, it is shown that more stable clustering results in more stable inference for estimation and prediction. We recommend the local-mass preserving priors as a replacement for the traditional priors.

Article information

Bayesian Anal., Volume 9, Number 2 (2014), 307-330.

First available in Project Euclid: 26 May 2014

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

nonparametric Bayes Dirichlet process two-parameter Poisson-Dirichlet process local mass prior misspecification clustering


Lee, Juhee; MacEachern, Steven N.; Lu, Yiling; Mills, Gordon B. Local-Mass Preserving Prior Distributions for Nonparametric Bayesian Models. Bayesian Anal. 9 (2014), no. 2, 307--330. doi:10.1214/13-BA857.

Export citation


  • Antoniak, C. E. (1974). “Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems.” The Annals of Statistics, 2: 1152–1174.
  • Berger, J. O. (1993). Statistical Decision Theory and Bayesian Analysis. Springer-Verlag Inc.
  • Berger, J. O. and Bernardo, J. M. (1992). “On the development of the reference prior method.” In Bernardo, J. M. e., Berger, J. O. e., Dawid, A. P. e., and Smith, A. F. M. e. (eds.), Bayesian Statistics 4. Proceedings of the Fourth Valencia International Meeting, 859. Clarendon Press [Oxford University Press].
  • Berger, J. O., Bernardo, J. M., and Sun, D. (2009). “The Formal Definition of Reference Priors.” The Annals of Statistics, 37(2): 905–938.
  • Bernardo, J. M. (1979). “Reference Posterior Distributions for Bayesian Inference (C/R P128-147).” Journal of the Royal Statistical Society, Series B: Methodological, 41: 113–128.
  • Blackwell, D. and MacQueen, J. B. (1973). “Ferguson Distributions Via Pólya Urn Schemes.” The Annals of Statistics, 1: 353–355.
  • Bush, C. A., Lee, J., and MacEachern, S. N. (2010). “Minimally informative prior distributions for non-parametric Bayesian analysis.” Journal of the Royal Statistical Society, Series B: Methodological, 72: 253–268.
  • De Finetti, B. (1975). Theory of Probability: A Critical Introductory Treatment, Vol. 2. John Wiley & Sons.
  • Escobar, M. D. (1994). “Estimating Normal Means with a Dirichlet Process Prior.” Journal of the American Statistical Association, 89: 268–277.
  • Escobar, M. D. and West, M. (1995). “Bayesian Density Estimation and Inference Using Mixtures.” Journal of the American Statistical Association, 90: 577–588.
  • Ferguson, T. S. (1973). “A Bayesian Analysis of Some Nonparametric Problems.” The Annals of Statistics, 1: 209–230.
  • Ghosal, S. and van der Vaart, A. W. (2001). “Entropies and Rates of Convergence for Maximum Likelihood and Bayes Estimation for Mixtures of Normal Densities.” The Annals of Statistics, 29(5): 1233–1263.
  • Hirano, K. (2002). “Semiparametric Bayesian Inference in Autoregressive Panel Data Models.” Econometrica, 70(2): 781–799.
  • Hjort, N. L., Holmes, C., Müller, P., and Walker, S. G. (eds.) (2010). Bayesian Nonparametrics. Cambridge University Press.
  • Ishwaran, H. and James, L. F. (2002). “Approximate Dirichlet Process Computing in Finite Normal Mixtures: Smoothing and Prior Information.” Journal of Computational and Graphical Statistics, 11(3): 508–532.
  • Jeffreys, H. (1998). Theory of Probability. Oxford University Press.
  • Ji, C., Merl, D., Kepler, T. B., and West, M. (2009). “Spatial Mixture Modelling for Unobserved Point Processes: Examples in Immunofluorescence Histology.” Bayesian Analysis, 4(2): 297–316.
  • Kleijn, B. J. K. and van der Vaart, A. W. (2006). “Misspecification in Infinite-dimensional Bayesian Statistics.” The Annals of Statistics, 34(2): 837–877.
  • Kottas, A., Müller, P., and Quintana, F. (2005). “Nonparametric Bayesian modeling for multivariate ordinal data.” Journal of Computational and Graphical Statistics, 14(3): 610–625.
  • Lindley, D. (1965). Introduction to Probability and Statistics. Cambridge University.
  • Liu, J. S. (1996). “Nonparametric hierarchical Bayes via sequential imputations.” The Annals of Statistics, 24(3): 911–930.
  • MacEachern, S. N. and Guha, S. (2011). “Parametric and Semiparametric Hypotheses in the Linear Model.” The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 39(1): 165–180.
  • Navarrete, C., Quintana, F., and Müller, P. (2008). “Some Issues on Nonparametric Bayesian Modeling Using Species Sampling Models.” Statistical Modelling International Journal, 8(1): 3–21.
  • Nieto-Barajas, L., Müller, P., Ji, Y., Lu, Y., and Mills, G. (2012). “A Time-Series DDP for Functional Proteomics Profiles.” Biometrics, 68(3): 859–868.
  • Peters, R. H. (1983). The Ecological Implications of Body Size. Cambridge: Cambridge University Press.
  • Pitman, J. (1996). “Some developments of the Blackwell-MacQueen urn scheme.” Lecture Notes-Monograph Series, 245–267.
  • Pitman, J. and Yor, M. (1997). “The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator.” The Annals of Probability, 25(2): 855–900.
  • Quintana, F. A. (2006). “A Predictive View of Bayesian Clustering.” Journal of Statistical Planning and Inference, 136(8): 2407–2429.
  • Quintana, F. A. and Iglesias, P. L. (2003). “Bayesian Clustering and Product Partition Models.” Journal of the Royal Statistical Society, Series B: Statistical Methodology, 65(2): 557–574.
  • Rousseau, J. (2010). “Rates of Convergence for the Posterior Distributions of Mixtures of Betas and Adaptive Nonparametric Estimation of the Density.” The Annals of Statistics, 38(1): 146–180.
  • Salinetti, G. (2003). “New Tools for Consistency in Bayesian Nonparametrics.” In Bayesian Statistics 7, 369–384. Oxford University Press.
  • Savage, L. J. (1972). The Foundations of Statistics. Dover Publications, Inc.
  • Tibes, R., Qiu, Y., Lu, Y., Hennessy, B., Andreeff, M., Mills, G. B., and Kornblau, S. M. (2006). “Reverse phase protein array: validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells.” Molecular cancer therapeutics, 5(10): 2512–2521.
  • Tusher, V. G., Tibshirani, R., and Chu, G. (2001). “Significance analysis of microarrays applied to the ionizing radiation response.” In Proceedings of the National Academy of Sciences of the United States of America, National Academy of Sciences, volume 98, 5116–5121. Washington, D.C.
  • Weisberg, S. (1985). Applied Linear Regression. John Wiley & Sons.