Open Access
2009 Efficient utility-based clustering over high dimensional partition spaces
Paul E. Anderson, Kieron D. Edwards, Silvia Liverani, Andrew J. Millar, Jim Q. Smith
Bayesian Anal. 4(3): 539-571 (2009). DOI: 10.1214/09-BA420

Abstract

Because of the huge number of partitions of even a moderately sized dataset, even when Bayes factors have a closed form, in model-based clustering a comprehensive search for the highest scoring (MAP) partition is usually impossible. However, when each cluster in a partition has a signature and it is known that some signatures are of scientific interest whilst others are not, it is possible, within a Bayesian framework, to develop search algorithms which are guided by these cluster signatures. Such algorithms can be expected to find better partitions more quickly. In this paper we develop a framework within which these ideas can be formalized. We then briefly illustrate the efficacy of the proposed guided search on a microarray time course data set where the clustering objective is to identify clusters of genes with different types of circadian expression profiles.

Citation

Download Citation

Paul E. Anderson. Kieron D. Edwards. Silvia Liverani. Andrew J. Millar. Jim Q. Smith. "Efficient utility-based clustering over high dimensional partition spaces." Bayesian Anal. 4 (3) 539 - 571, 2009. https://doi.org/10.1214/09-BA420

Information

Published: 2009
First available in Project Euclid: 22 June 2012

zbMATH: 1330.62253
MathSciNet: MR2551045
Digital Object Identifier: 10.1214/09-BA420

Rights: Copyright © 2009 International Society for Bayesian Analysis

Vol.4 • No. 3 • 2009
Back to Top