Open Access
June 2006 Model-based subspace clustering
Peter D. Hoff
Bayesian Anal. 1(2): 321-344 (June 2006). DOI: 10.1214/06-BA111

Abstract

We discuss a model-based approach to identifying clusters of objects based on subsets of attributes, so that the attributes that distinguish a cluster from the rest of the population may depend on the cluster being considered. The method is based on a Pólya urn cluster model for multivariate means and variances, resulting in a multivariate Dirichlet process mixture model. This particular model-based approach accommodates outliers and allows for the incorporation of application-specific data features into the clustering scheme. For example, in an analysis of genetic CGH array data we are able to design a clustering method that accounts for spatial dependence of chromosomal abnormalities.

Citation

Download Citation

Peter D. Hoff. "Model-based subspace clustering." Bayesian Anal. 1 (2) 321 - 344, June 2006. https://doi.org/10.1214/06-BA111

Information

Published: June 2006
First available in Project Euclid: 22 June 2012

zbMATH: 1331.62309
MathSciNet: MR2221267
Digital Object Identifier: 10.1214/06-BA111

Keywords: COSA , Dirichlet process , mixture model , nonparametric Bayes , Pólya urn , unsupervised learning , Variable selection

Rights: Copyright © 2006 International Society for Bayesian Analysis

Vol.1 • No. 2 • June 2006
Back to Top