Correcting for selection bias via cross-validation in the classification of microarray data

Chevelu, J., University of Rennes; McLachlan, G. J., University of Queensland; Zhu, J., University of Queensland

doi:10.1214/193940307000000284

VOL. 1 | 2008 Correcting for selection bias via cross-validation in the classification of microarray data

J. Chevelu, G. J. McLachlan, J. Zhu

Editor(s) N. Balakrishnan, Edsel A. Peña, Mervyn J. Silvapulle

Inst. Math. Stat. (IMS) Collect., 2008: 364-376 (2008) DOI: 10.1214/193940307000000284

Abstract

There is increasing interest in the use of diagnostic rules based on microarray data. These rules are formed by considering the expression levels of thousands of genes in tissue samples taken on patients of known classification with respect to a number of classes, representing, say, disease status or treatment strategy. As the final versions of these rules are usually based on a small subset of the available genes, there is a selection bias that has to be corrected for in the estimation of the associated error rates. We consider the problem using cross-validation. In particular, we present explicit formulae that are useful in explaining the layers of validation that have to be performed in order to avoid improperly cross-validated estimates.

Information

Published: 1 January 2008

First available in Project Euclid: 1 April 2008

MathSciNet: MR2462219

Digital Object Identifier: 10.1214/193940307000000284

Subjects:

Primary: 62H30

Secondary: 62H12

Keywords: cross-validation , discriminant analysis , error rate estimation , gene expression data , selection bias

CHAPTER
13 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY

< Previous Chapter

|

Next Chapter >

Beyond Parametrics in Interdisciplinary Research: Festschrift in Honor of Professor Pranab K. Sen

Vol. 1 • 1 January 2008

Institute of Mathematical Statistics

Abstract

Information

KEYWORDS/PHRASES

PUBLICATION TITLE:

PUBLICATION YEARS