Brazilian Journal of Probability and Statistics
- Braz. J. Probab. Stat.
- Volume 33, Number 2 (2019), 267-279.
The coreset variational Bayes (CVB) algorithm for mixture analysis
The pressing need for improved methods for analysing and coping with big data has opened up a new area of research for statisticians. Image analysis is an area where there is typically a very large number of data points to be processed per image, and often multiple images are captured over time. These issues make it challenging to design methodology that is reliable and yet still efficient enough to be of practical use. One promising emerging approach for this problem is to reduce the amount of data that actually has to be processed by extracting what we call coresets from the full dataset; analysis is then based on the coreset rather than the whole dataset. Coresets are representative subsamples of data that are carefully selected via an adaptive sampling approach. We propose a new approach called coreset variational Bayes (CVB) for mixture modelling; this is an algorithm which can perform a variational Bayes analysis of a dataset based on just an extracted coreset of the data. We apply our algorithm to weed image analysis.
Braz. J. Probab. Stat., Volume 33, Number 2 (2019), 267-279.
Received: February 2017
Accepted: November 2017
First available in Project Euclid: 4 March 2019
Permanent link to this document
Digital Object Identifier
Liu, Qianying; McGrory, Clare A.; Baxter, Peter W. J. The coreset variational Bayes (CVB) algorithm for mixture analysis. Braz. J. Probab. Stat. 33 (2019), no. 2, 267--279. doi:10.1214/17-BJPS387. https://projecteuclid.org/euclid.bjps/1551690034