The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 3, Number 4 (2009), 1710-1737.
Improving the precision of classification trees
Besides serving as prediction models, classification trees are useful for finding important predictor variables and identifying interesting subgroups in the data. These functions can be compromised by weak split selection algorithms that have variable selection biases or that fail to search beyond local main effects at each node of the tree. The resulting models may include many irrelevant variables or select too few of the important ones. Either eventuality can lead to erroneous conclusions. Four techniques to improve the precision of the models are proposed and their effectiveness compared with that of other algorithms, including tree ensembles, on real and simulated data sets.
Ann. Appl. Stat., Volume 3, Number 4 (2009), 1710-1737.
First available in Project Euclid: 1 March 2010
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Loh, Wei-Yin. Improving the precision of classification trees. Ann. Appl. Stat. 3 (2009), no. 4, 1710--1737. doi:10.1214/09-AOAS260. https://projecteuclid.org/euclid.aoas/1267453961