The Annals of Statistics
- Ann. Statist.
- Volume 32, Number 1 (2004), 13-29.
Process consistency for AdaBoost
Recent experiments and theoretical studies show that AdaBoost can overfit in the limit of large time. If running the algorithm forever is suboptimal, a natural question is how low can the prediction error be during the process of AdaBoost? We show under general regularity conditions that during the process of AdaBoost a consistent prediction is generated, which has the prediction error approximating the optimal Bayes error as the sample size increases. This result suggests that, while running the algorithm forever can be suboptimal, it is reasonable to expect that some regularization method via truncation of the process may lead to a near-optimal performance for sufficiently large sample size.
Ann. Statist. Volume 32, Number 1 (2004), 13-29.
First available in Project Euclid: 12 March 2004
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Jiang, Wenxin. Process consistency for AdaBoost. Ann. Statist. 32 (2004), no. 1, 13--29. doi:10.1214/aos/1079120128. http://projecteuclid.org/euclid.aos/1079120128.