Abstract
A class of procedures based on "impartial trimming" (self-determined by the data) is introduced with the aim of robustifying k-means, hence the associated clustering analysis. We include a detailed study of optimal regions, showing that only nonpathological regions can arise from impartial trimming procedures. The asymptotic results provided in the paper focus on strong consistency of the suggested methods under widely general conditions. A section is devoted to exploring the performance of the procedure to detect anomalous data in simulated data sets.
Citation
J. A. Cuesta-Albertos. A. Gordaliza. C. Matrán. "Trimmed $k$-means: an attempt to robustify quantizers." Ann. Statist. 25 (2) 553 - 576, April 1997. https://doi.org/10.1214/aos/1031833664
Information