Open Access
November 2017 Comparing consensus Monte Carlo strategies for distributed Bayesian computation
Steven L. Scott
Braz. J. Probab. Stat. 31(4): 668-685 (November 2017). DOI: 10.1214/17-BJPS365

Abstract

Consensus Monte Carlo is an algorithm for conducting Monte Carlo based Bayesian inference on large data sets distributed across many worker machines in a data center. The algorithm operates by running a separate Monte Carlo algorithm on each worker machine, which only sees a portion of the full data set. The worker-level posterior samples are then combined to form a Monte Carlo approximation to the full posterior distribution based on the complete data set. We compare several methods of carrying out the combination, including a new method based on approximating worker-level simulations using a mixture of multivariate Gaussian distributions. We find that resampling and kernel density based methods break down after 10 or sometimes fewer dimensions, while the new mixture-based approach works well, but the necessary mixture models take too long to fit.

Citation

Download Citation

Steven L. Scott. "Comparing consensus Monte Carlo strategies for distributed Bayesian computation." Braz. J. Probab. Stat. 31 (4) 668 - 685, November 2017. https://doi.org/10.1214/17-BJPS365

Information

Received: 1 December 2016; Accepted: 1 April 2017; Published: November 2017
First available in Project Euclid: 15 December 2017

zbMATH: 1385.65008
MathSciNet: MR3738171
Digital Object Identifier: 10.1214/17-BJPS365

Keywords: Bayesian modeling , big data , Cloud computing

Rights: Copyright © 2017 Brazilian Statistical Association

Vol.31 • No. 4 • November 2017
Back to Top