Open Access
March 2018 Computationally Efficient Multivariate Spatio-Temporal Models for High-Dimensional Count-Valued Data (with Discussion)
Jonathan R. Bradley, Scott H. Holan, Christopher K. Wikle
Bayesian Anal. 13(1): 253-310 (March 2018). DOI: 10.1214/17-BA1069

Abstract

We introduce a computationally efficient Bayesian model for predicting high-dimensional dependent count-valued data. In this setting, the Poisson data model with a latent Gaussian process model has become the de facto model. However, this model can be difficult to use in high dimensional settings, where the data may be tabulated over different variables, geographic regions, and times. These computational difficulties are further exacerbated by acknowledging that count-valued data are naturally non-Gaussian. Thus, many of the current approaches, in Bayesian inference, require one to carefully calibrate a Markov chain Monte Carlo (MCMC) technique. We avoid MCMC methods that require tuning by developing a new conjugate multivariate distribution. Specifically, we introduce a multivariate log-gamma distribution and provide substantial methodological development of independent interest including: results regarding conditional distributions, marginal distributions, an asymptotic relationship with the multivariate normal distribution, and full-conditional distributions for a Gibbs sampler. To incorporate dependence between variables, regions, and time points, a multivariate spatio-temporal mixed effects model (MSTM) is used. To demonstrate our methodology we use data obtained from the US Census Bureau’s Longitudinal Employer-Household Dynamics (LEHD) program. In particular, our approach is motivated by the LEHD’s Quarterly Workforce Indicators (QWIs), which constitute current estimates of important US economic variables.

Citation

Download Citation

Jonathan R. Bradley. Scott H. Holan. Christopher K. Wikle. "Computationally Efficient Multivariate Spatio-Temporal Models for High-Dimensional Count-Valued Data (with Discussion)." Bayesian Anal. 13 (1) 253 - 310, March 2018. https://doi.org/10.1214/17-BA1069

Information

Published: March 2018
First available in Project Euclid: 11 October 2017

zbMATH: 06873726
MathSciNet: MR3773410
Digital Object Identifier: 10.1214/17-BA1069

Subjects:
Primary: 62H11
Secondary: 62P12

Keywords: Aggregation , American Community Survey , Bayesian hierarchical model , big data , Longitudinal Employer-Household Dynamics (LEHD) program , Markov chain Monte Carlo , non-Gaussian , Quarterly Workforce Indicators

Vol.13 • No. 1 • March 2018
Back to Top