We propose a new method for class prediction in DNA microarray studies based on an enhancement of the nearest prototype classifier. Our technique uses "shrunken" centroids as prototypes for each class to identify the subsets of the genes that best characterize each class. The method is general and can be applied to the other high-dimensional classification problems. The method is illustrated on data from two gene expression studies: lymphoma and cancer cell lines.
"Class Prediction by Nearest Shrunken Centroids, with Applications to DNA Microarrays." Statist. Sci. 18 (1) 104 - 117, February 2003. https://doi.org/10.1214/ss/1056397488