## The Annals of Statistics

- Ann. Statist.
- Volume 24, Number 6 (1996), 2319-2349.

### An overtraining-resistant stochastic modeling method for pattern recognition

#### Abstract

We will introduce a generic approach for solving problems in pattern recognition based on the synthesis of accurate multiclass discriminators from large numbers of very inaccurate "weak" models through the use of discrete stochastic processes. Contrary to the standard expectation held for the many statistical and heuristic techniques normally associated with the field, a significant feature of this method of "stochastic modeling" is its resistance to so-called "overtraining." The drop in performance of any stochastic model in going from training to test data remains comparable to that of the component weak models from which it is synthesized; and since these component models are very simple, their performance drop is small, resulting in a stochastic model whose performance drop is also small despite its high level of accuracy.

#### Article information

**Source**

Ann. Statist., Volume 24, Number 6 (1996), 2319-2349.

**Dates**

First available in Project Euclid: 16 September 2002

**Permanent link to this document**

https://projecteuclid.org/euclid.aos/1032181157

**Digital Object Identifier**

doi:10.1214/aos/1032181157

**Mathematical Reviews number (MathSciNet)**

MR1425956

**Zentralblatt MATH identifier**

0877.68102

**Subjects**

Primary: 68T10: Pattern recognition, speech recognition {For cluster analysis, see 62H30} 68T05: Learning and adaptive systems [See also 68Q32, 91E40]

**Keywords**

Pattern recognition machine learning

#### Citation

Kleinberg, E. M. An overtraining-resistant stochastic modeling method for pattern recognition. Ann. Statist. 24 (1996), no. 6, 2319--2349. doi:10.1214/aos/1032181157. https://projecteuclid.org/euclid.aos/1032181157