May 2023 Distributed Bayesian Inference in Massive Spatial Data
Rajarshi Guhaniyogi, Cheng Li, Terrance Savitsky, Sanvesh Srivastava
Author Affiliations +
Statist. Sci. 38(2): 262-284 (May 2023). DOI: 10.1214/22-STS868

Abstract

Gaussian process (GP) regression is computationally expensive in spatial applications involving massive data. Various methods address this limitation, including a small number of Bayesian methods based on distributed computations (or the divide-and-conquer strategy). Focusing on the latter literature, we achieve three main goals. First, we develop an extensible Bayesian framework for distributed spatial GP regression that embeds many popular methods. The proposed framework has three steps that partition the entire data into many subsets, apply a readily available Bayesian spatial process model in parallel on all the subsets, and combine the posterior distributions estimated on all the subsets into a pseudo posterior distribution that conditions on the entire data. The combined pseudo posterior distribution replaces the full data posterior distribution in prediction and inference problems. Demonstrating our framework’s generality, we extend posterior computations for (nondistributed) spatial process models with a stationary full-rank and a nonstationary low-rank GP priors to the distributed setting. Second, we contrast the empirical performance of popular distributed approaches with some widely-used, nondistributed alternatives and highlight their relative advantages and shortcomings. Third, we provide theoretical support for our numerical observations and show that the Bayes L2-risks of the combined posterior distributions obtained from a subclass of the divide-and-conquer methods achieves the near-optimal convergence rate in estimating the true spatial surface with various types of covariance functions. Additionally, we provide upper bounds on the number of subsets to achieve these near-optimal rates.

Funding Statement

Rajarshi Guhaniyogi’s and Sanvesh Srivastava’s research are partially supported by from Office of Naval Research award no. N00014-18-1-2741 and National Science Foundation DMS-2220840/1854667.
Cheng Li’s research is supported by Singapore Ministry of Education Academic Research Funds Tier 1 Grants R155000172133 and R155000201114.

Acknowledgments

The four authors contributed equally to this work.

Citation

Download Citation

Rajarshi Guhaniyogi. Cheng Li. Terrance Savitsky. Sanvesh Srivastava. "Distributed Bayesian Inference in Massive Spatial Data." Statist. Sci. 38 (2) 262 - 284, May 2023. https://doi.org/10.1214/22-STS868

Information

Published: May 2023
First available in Project Euclid: 7 May 2023

MathSciNet: MR4597336
zbMATH: 07708431
Digital Object Identifier: 10.1214/22-STS868

Keywords: Distributed Bayesian inference , Gaussian process , low-rank Gaussian process , massive spatial data , Wasserstein barycenter

Rights: Copyright © 2023 Institute of Mathematical Statistics

Vol.38 • No. 2 • May 2023
Back to Top