Bayesian Analysis

Informative g-Priors for Logistic Regression

Timothy E. Hanson, Adam J. Branscum, and Wesley O. Johnson

Full-text: Open access

Abstract

Eliciting information from experts for use in constructing prior distributions for logistic regression coefficients can be challenging. The task is especially difficult when the model contains many predictor variables, because the expert is asked to provide summary information about the probability of “success” for many subgroups of the population. Often, however, experts are confident only in their assessment of the population as a whole. This paper is about incorporating such overall information easily into a logistic regression data analysis using g-priors. We present a version of the g-prior such that the prior distribution on the overall population logistic regression probabilities of success can be set to match a beta distribution. A simple data augmentation formulation allows implementation in standard statistical software packages.

Article information

Source
Bayesian Anal., Volume 9, Number 3 (2014), 597-612.

Dates
First available in Project Euclid: 5 September 2014

Permanent link to this document
https://projecteuclid.org/euclid.ba/1409921107

Digital Object Identifier
doi:10.1214/14-BA868

Mathematical Reviews number (MathSciNet)
MR3256057

Zentralblatt MATH identifier
1327.62395

Keywords
Binomial regression Generalized linear model Prior elicitation

Citation

Hanson, Timothy E.; Branscum, Adam J.; Johnson, Wesley O. Informative $g$ -Priors for Logistic Regression. Bayesian Anal. 9 (2014), no. 3, 597--612. doi:10.1214/14-BA868. https://projecteuclid.org/euclid.ba/1409921107


Export citation

References

  • Agresti, A. (2013). Categorical Data Analysis. Hoboken, New Jersey: John Wiley & Sons, third edition.
  • Aitchison, J. and Shen, S. (1980). “Logistic-normal distributions: Some properties and uses.” Biometrika, 67: 261–272.
  • Bedrick, E., Christensen, R., and Johnson, W. (1996). “A new perspective on priors for generalized linear models.” Journal of the American Statistical Association, 91: 1450–1460.
  • Bové, D. and Held, L. (2011). “Hyper-$g$ priors for generalized linear models.” Bayesian Analysis, 6: 1–24.
  • Chen, M.-H., Ibrahim, J., and Kim, S. (2008). “Properties and implementation of Jeffreys’ prior in binomial regression models.” Journal of the American Statistical Association, 103: 1659–1664.
  • Christensen, R., Johnson, W., Branscum, A., and Hanson, T. (2010). Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians. Boca Raton, Florida: CRC Press.
  • Firth, D. (1993). “Bias reduction of maximum likelihood estimates.” Biometrika, 80: 27–38.
  • Fouskakis, D., Ntzoufras, I., and Draper, D. (2009). “Bayesian variable selection using cost-adjusted BIC, with application to cost-effective measurement of quality of health care.” Annals of Applied Statistics, 3: 663–690.
  • Gelman, A., Jakulin, A., Pittau, M., and Su, Y.-S. (2008). “A weakly informative default prior distribution for logistic and other regression models.” Annals of Applied Statistics, 2: 1360–1383.
  • Gupta, M. and Ibrahim, J. (2009). “An information matrix prior for Bayesian analysis in generalized linear models with high dimensional data.” Statistica Sinica, 19: 1641–1663.
  • Gustafson, P. (2007). “On robustness and model flexibility in survival analysis: transformed hazards models and average effects.” Biometrics, 63: 69–77.
  • Heinze, G. and Ploner, M. (2003). “Fixing the nonconvergence bug in logistic regression with SPLUS and SAS.” Computer Methods and Programs in Biomedicine, 71: 181–187.
  • Ibrahim, J. and Laud, P. (1991). “On Bayesian analysis of generalized linear models using Jeffreys’ prior.” Journal of the American Statistical Association, 86: 981–986.
  • Jara, A. and Hanson, T. (2011). “A class of mixtures of dependent tailfree processes.” Biometrika, 98: 553–566.
  • Jara, A., Hanson, T., Quintana, F., Müller, P., and Rosner, G. (2011). “DPpackage: Bayesian non- and semi-parametric modelling in R.” Journal of Statistical Software, 40: 1–30.
  • Liu, J. and Gustafson, P. (2008). “Average effects and omitted interactions in linear regression models.” International Statistical Review, 76: 419–432.
  • Marin, J.-M. and Robert, C. (2007). Bayesian Core: A Practical Approach to Computational Bayesian Statistics. New York, New York: Springer-Verlag.
  • R Core Team (2013). “R: A language and environment for statistical computing.” Technical report, R Foundation for Statistical Computing, Vienna, Austria. URL http://www.Rproject. org/.
  • Rathbun, S. and Fei, S. (2006). “A spatial zero-inflated Poisson regression model for oak regeneration.” Environmental and Ecological Statistics, 13: 409–426.
  • Seaman, J., Seaman, J., and Stamey, J. (2012). “Hidden dangers of specifying noninformative priors.” The American Statistician, 66: 77–84.
  • Zellner, A. (1983). “Applications of Bayesian analysis in econometrics.” The Statistician, 32: 23–34.