Bernoulli

  • Bernoulli
  • Volume 8, Number 5 (2002), 561-576.

Nonparametric estimation of conditional quantiles using quantile regression trees

Probal Chaudhuri and Wei-Yin Loh

Full-text: Open access

Abstract

A nonparametric regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning of the covariate space is investigated. Unlike least-squares regression trees, which concentrate on modelling the relationship between the response and the covariates at the centre of the response distribution, our quantile regression trees can provide insight into the nature of that relationship at the centre as well as the tails of the response distribution. Our nonparametric regression quantiles have piecewise polynomial forms, where each piece is obtained by fitting a polynomial quantile regression model to the data in a terminal node of a binary decision tree. The decision tree is constructed by recursively partitioning the data based on repeated analyses of the residuals obtained after model fitting with quantile regression. One advantage of the tree structure is that it provides a simple summary of the interactions among the covariates. The asymptotic behaviour of piecewise polynomial quantile regression estimates and the associated derivative estimates are studied under appropriate regularity conditions. The methodology is illustrated with an example on the incidence rates of mumps in the United States.

Article information

Source
Bernoulli, Volume 8, Number 5 (2002), 561-576.

Dates
First available in Project Euclid: 4 March 2004

Permanent link to this document
https://projecteuclid.org/euclid.bj/1078435218

Mathematical Reviews number (MathSciNet)
MR2003i:62060

Zentralblatt MATH identifier
1009.62031

Keywords
derivative estimate GUIDE algorithm piecewise polynomial estimates recursive partitioning tree-structured regression uniform asymptotic consistency Vapnik-Chervonenkis class

Citation

Chaudhuri, Probal; Loh, Wei-Yin. Nonparametric estimation of conditional quantiles using quantile regression trees. Bernoulli 8 (2002), no. 5, 561--576. https://projecteuclid.org/euclid.bj/1078435218


Export citation

References

  • [1] Breiman, L., Friedman, J.H., Olshen, R.A. and Stone, C.J. (1984) Classification and Regression Trees. Belmont, CA: Wadsworth.
  • [2] Buchinsky, M. (1994) Changes in the U.S. wage structure 1963-1987: Application of quantile regression. Econometrica, 62, 405-458.
  • [3] Chaudhuri, P. (1991a) Global nonparametric estimation of conditional quantile functions and their derivatives. J. Multivariate Anal., 39, 246-269. Abstract can also be found in the ISI/STMA publication
  • [4] Chaudhuri, P. (1991b) Nonparametric estimates of regression quantiles and their local Bahadur representation. Ann. Statist., 19, 760-777. Abstract can also be found in the ISI/STMA publication
  • [5] Chaudhuri, P. (2000) Asymptotic consistency of median regression trees. J. Statist. Plann. Inference, 91, 229-238. Abstract can also be found in the ISI/STMA publication
  • [6] Chaudhuri, P., Huang, M.C., Loh, W.-Y. and Yao, R. (1994) Piecewise polynomial regression trees. Statist. Sinica, 4, 143-167. Abstract can also be found in the ISI/STMA publication
  • [7] Chaudhuri, P., Lo, W.-D., Loh, W.-Y. and Yang, C.-C. (1995) Generalized regression trees. Statist. Sinica, 5, 641-666. Abstract can also be found in the ISI/STMA publication
  • [8] Chaudhuri, P., Doksum, K. and Samarov, A. (1997) On average derivative quantile regression. Ann Statist., 25, 715-744. Abstract can also be found in the ISI/STMA publication
  • [9] Cheng, K.F. (1983) Nonparametric estimators for percentile regression functions. Comm. Statist. Theory Methods, 12, 681-692.
  • [10] Cheng, K.F. (1984) Nonparametric estimation of regression function using linear combinations of sample quantile regression function. Sankhya Ser. A, 46, 287-302.
  • [11] Dabrowska, D. (1992) Nonparametric quantile regression with censored data. Sankhya Ser. A, 54, 252-259. Abstract can also be found in the ISI/STMA publication
  • [12] Doyle, P. (1973) The use of Automatic Interaction Detector and similar search procedures. Oper. Res. Quart., 24, 465-467.
  • [13] Dudley, R.M. (1978) Central limit theorems for empirical measures. Ann. Probab., 6, 899-929.
  • [14] Correction (1979): 7, 909-911.
  • [15] Fan, J. and Gijbels, I. (1996) Local Polynomial Modeling and Its Applications. London: Chapman & Hall.
  • [16] Fan, J., Hu, T.C. and Truong, Y.K. (1994) Robust nonparamentric function estimation. Scand. J. Statist., 21, 433-446.
  • [17] Han, A. (1987) A nonparametric analysis of transformations. J. Econometrics, 35, 191-209.
  • [18] Härdle, W. and Stoker, T. (1989) Investigating smooth multiple regression by the method of average derivatives. J. Amer. Statist. Assoc., 84, 986-995.
  • [19] Hendricks, W. and Koenker, R. (1992) Hierarchical spline model for conditional quantiles and the demand for electricity. J. Amer. Statist. Assoc., 87, 58-68. Abstract can also be found in the ISI/STMA publication
  • [20] Hogg, R.V. (1975) Estimates of percentile regression line using salary data. J. Amer. Statist. Assoc., 70, 56-59.
  • [21] Janssen, P. and Veraverbeke, N. (1987) On nonparametric regression estimators based on regression quantiles. Comm. Statist. Theory Methods, 16, 383-396.
  • [22] Koenker, R. and Bassett, G. (1978) Regression quantiles. Econometrica, 46, 33-50.
  • [23] Koenker, R. and D´Orey, V. (1987) Computing regression quantiles. Appl. Statist., 36, 383-393.
  • [24] Koenker, R., Ng, P. and Portnoy, S. (1994) Quantile smoothing splines. Biometrika, 81, 673-680. Abstract can also be found in the ISI/STMA publication
  • [25] Lejeune, M.G. and Sarda, P. (1988) Quantile regression: a nonparametric approach. Comput. Statist. Data Anal., 6, 229-239. Abstract can also be found in the ISI/STMA publication
  • [26] Loh, W.-Y. (2002) Regression trees with unbiased variable selection and interaction detection. Statist. Sinica, 12, 361-386.
  • [27] Loh, W.-Y. and Shih, Y.-S. (1997) Split selection methods for classification trees. Statist. Sinica, 7, 815-840. Abstract can also be found in the ISI/STMA publication
  • [28] Newey, W.K. and Stoker, T.M. (1993) Efficiency of weighted average derivative estimators and index models. Econometrica, 61, 1199-1223. Abstract can also be found in the ISI/STMA publication
  • [29] Powell, J., Stock, J. and Stoker, T. (1989) Semiparametric estimation of index coefficients. Econometrica, 57, 1403-1430. Abstract can also be found in the ISI/STMA publication
  • [30] Samarov, A. (1993) Exploring regression structure using nonparametric functional estimation. J. Amer. Statist. Assoc., 88, 836-849. Abstract can also be found in the ISI/STMA publication
  • [31] Sherman, R. (1993) The limiting distribution of the maximum rank correlation estimator. Econometrica, 61, 123-137. Abstract can also be found in the ISI/STMA publication
  • [32] Shorack, G.R. and Wellner, J.A. (1986) Empirical Processes with Applications to Statistics. New York: Wiley.
  • [33] Truong, Y.K.N. (1989) Asymptotic properties of kernel estimators based on local medians. Ann. Statist., 17, 606-617. Abstract can also be found in the ISI/STMA publication
  • [34] Vapnik, V.N. and Chervonenkis, A.Ya. (1971) On the uniform convergence of relative frequencies of events to their probabilities. Theory Probab. Appl., 16, 264-280.
  • [35] Wand, M.P. and Jones, M.C. (1995) Kernel Smoothing. London: Chapman & Hall.
  • [36] Welsh, A.H. (1996) Robust estimation of smooth regression and spread functions and their derivatives. Statist. Sinica, 6, 347-366. Abstract can also be found in the ISI/STMA publication