Open Access
September 2022 Quantifying Observed Prior Impact
David E. Jones, Robert N. Trangucci, Yang Chen
Author Affiliations +
Bayesian Anal. 17(3): 737-764 (September 2022). DOI: 10.1214/21-BA1271

Abstract

When summarizing a Bayesian analysis, it is important to quantify the contribution of the prior distribution to the final posterior inference because this informs other researchers whether the prior information needs to be carefully scrutinized, and whether alternative priors are likely to substantially alter the conclusions drawn. One appealing and interpretable way to do this is to report an effective prior sample size (EPSS), which captures how many observations the information in the prior distribution corresponds to. However, typically the most important aspect of the prior distribution is its location relative to the data, and therefore traditional information measures are somewhat deficit for the purpose of quantifying EPSS, because they concentrate on the variance or spread of the prior distribution (in isolation from the data). To partially address this difficulty, Reimherr et al. (2014) introduced a class of EPSS measures based on prior-likelihood discordance. In this paper, we take this idea further by proposing a new measure of EPSS that not only incorporates the general mathematical form of the likelihood (as proposed by Reimherr et al., 2014) but also the specific data at hand. Thus, our measure considers the location of the prior relative to the current observed data, rather than relative to the average of multiple datasets from the working model, the latter being the approach taken by Reimherr et al. (2014). Consequently, our measure can be highly variable, but we demonstrate that this is because the impact of a prior on a Bayesian analysis can intrinsically be highly variable. Our measure is called the (posterior) mean Observed Prior Effective Sample Size (mOPESS), and is a Bayes estimate of a meaningful quantity. The mOPESS well communicates the extent to which inference is determined by the prior, or framed differently, the amount of sampling effort saved due to having relevant prior information. We illustrate our ideas through a number of examples including Gaussian conjugate and non-conjugate models (continuous observations), a Beta-Binomial model (discrete observations), and a linear regression model (two unknown parameters).

Funding Statement

This work is supported by NSF DMS-1811083 (PI: Yang Chen, 2018–2021).

Acknowledgments

The authors thank Prof. Xiao-Li Meng from Harvard University for helpful discussions and Dr. Vinay Kashyap from the Harvard-Smithsonian Center for Astrophysics (CfA) for collaborating on the astronomical instrument calibration problem.

Citation

Download Citation

David E. Jones. Robert N. Trangucci. Yang Chen. "Quantifying Observed Prior Impact." Bayesian Anal. 17 (3) 737 - 764, September 2022. https://doi.org/10.1214/21-BA1271

Information

Published: September 2022
First available in Project Euclid: 20 May 2021

MathSciNet: MR4483237
Digital Object Identifier: 10.1214/21-BA1271

Keywords: Bayes estimate , effective prior sample size , sensitivity analysis , statistical information , Wasserstein distance

Vol.17 • No. 3 • September 2022
Back to Top