The Annals of Statistics

Accelerated convergence for nonparametric regression with coarsened predictors

Aurore Delaigle, Peter Hall, and Hans-Georg Müller
Source: Ann. Statist. Volume 35, Number 6 (2007), 2639-2653.

Abstract

We consider nonparametric estimation of a regression function for a situation where precisely measured predictors are used to estimate the regression curve for coarsened, that is, less precise or contaminated predictors. Specifically, while one has available a sample (W1, Y1), …, (Wn, Yn) of independent and identically distributed data, representing observations with precisely measured predictors, where E(Yi|Wi)=g(Wi), instead of the smooth regression function g, the target of interest is another smooth regression function m that pertains to predictors Xi that are noisy versions of the Wi. Our target is then the regression function m(x)=E(Y|X=x), where X is a contaminated version of W, that is, X=W+δ. It is assumed that either the density of the errors is known, or replicated data are available resembling, but not necessarily the same as, the variables X. In either case, and under suitable conditions, we obtain $\sqrt{n}$-rates of convergence of the proposed estimator and its derivatives, and establish a functional limit theorem. Weak convergence to a Gaussian limit process implies pointwise and uniform confidence intervals and $\sqrt{n}$-consistent estimators of extrema and zeros of m. It is shown that these results are preserved under more general models in which X is determined by an explanatory variable. Finite sample performance is investigated in simulations and illustrated by a real data example.

First Page: Show Hide
Primary Subjects: 62G08, 62G05
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1201012975
Digital Object Identifier: doi:10.1214/009053607000000497
Zentralblatt MATH identifier: 1129.62032
Mathematical Reviews number (MathSciNet): MR2382661

References

Bickel, P. J., Klaassen, C. A. J., Ritov, Y. and Wellner, J. A. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Johns Hopkins Univ. Press, Baltimore, MD.
Mathematical Reviews (MathSciNet): MR1245941
Zentralblatt MATH: 0786.62001
Carroll, R. J. and Hall, P. (2004). Low-order approximations in deconvolution and regression with errors in variables. J. R. Stat. Soc. Ser. B Stat. Methodol. 66 31--46.
Mathematical Reviews (MathSciNet): MR2035757
Digital Object Identifier: doi:10.1111/j.1467-9868.2004.00430.x
Zentralblatt MATH: 1062.62066
Carroll, R. J., Maca, J. D. and Ruppert, D. (1999). Nonparametric regression in the presence of measurement error. Biometrika 86 541--554.
Mathematical Reviews (MathSciNet): MR1723777
Zentralblatt MATH: 0938.62039
Digital Object Identifier: doi:10.1093/biomet/86.3.541
Carroll, R. J., Ruppert, D. and Stefanski, L. (1995). Measurement Error in Nonlinear Models. Chapman and Hall, London.
Mathematical Reviews (MathSciNet): MR1630517
Zentralblatt MATH: 0853.62048
Devanarayan, V. and Stefanski, L. A. (2002). Empirical simulation extrapolation for measurement error models with replicate measurements. Statist. Probab. Lett. 59 219--225.
Mathematical Reviews (MathSciNet): MR1932865
Fan, J. (1991). On the optimal rates of convergence for nonparametric deconvolution problems. Ann. Statist. 19 1257--1272.
Mathematical Reviews (MathSciNet): MR1126324
Digital Object Identifier: doi:10.1214/aos/1176348248
Project Euclid: euclid.aos/1176348248
Zentralblatt MATH: 0729.62033
Fan, J. and Masry, E. (1992). Multivariate regression estimation with errors-in-variables: Asymptotic normality for mixing processes. J. Multivariate Anal. 43 237--271.
Mathematical Reviews (MathSciNet): MR1193614
Digital Object Identifier: doi:10.1016/0047-259X(92)90036-F
Zentralblatt MATH: 0769.62028
Fan, J. and Truong, Y. K. (1993). Nonparametric regression with errors in variables. Ann. Statist. 21 1900--1925.
Mathematical Reviews (MathSciNet): MR1245773
Digital Object Identifier: doi:10.1214/aos/1176349402
Project Euclid: euclid.aos/1176349402
Zentralblatt MATH: 0791.62042
Fan, J., Truong, Y. K. and Wang, Y. (1991). Nonparametric function estimation involving errors-in-variables. In Nonparametric Functional Estimation and Related Topics (G. Roussas, ed.) 613--627. Kluwer, Dordrecht.
Hastie, T., Tibshirani, R. and Friedman, J. (2001). The Elements of Statistical Learning. Data Mining, Inference and Prediction. Springer, New York.
Mathematical Reviews (MathSciNet): MR1851606
Zentralblatt MATH: 0973.62007
Hobert, J. P. and Wand, M. P. (2000). Automatic generalized nonparametric regression via maximum likelihood. Technical report, Dept. Biostatistics, Harvard School of Public Health.
Ioannides, D. A. and Matzner-Løber, E. (2002). Nonparametric estimation of the conditional mode with errors-in-variables: Strong consistency for mixing processes. J. Nonparametr. Statist. 14 341--352.
Mathematical Reviews (MathSciNet): MR1905756
Digital Object Identifier: doi:10.1080/10485250212375
Zentralblatt MATH: 1014.62102
Li, T. and Vuong, Q. (1998). Nonparametric estimation of the measurement error model using multiple indicators. J. Multivariate Anal. 65 139--165.
Mathematical Reviews (MathSciNet): MR1625869
Digital Object Identifier: doi:10.1006/jmva.1998.1741
Zentralblatt MATH: 1127.62323
Linton, O. and Whang, Y.-J. (2002). Nonparametric estimation with aggregated data. Econometric Theory 18 420--468.
Mathematical Reviews (MathSciNet): MR1891830
Digital Object Identifier: doi:10.1017/S0266466602182089
Zentralblatt MATH: 1109.62312
Stefanski, L. and Carroll, R. J. (1990). Deconvoluting kernel density estimators. Statistics 21 169--184.
Mathematical Reviews (MathSciNet): MR1054861
Digital Object Identifier: doi:10.1080/02331889008802238
Zentralblatt MATH: 0697.62035
Stefanski, L. A. and Cook, J. R. (1995). Simulation-extrapolation: The measurement error jackknife. J. Amer. Statist. Assoc. 90 1247--1256.
Mathematical Reviews (MathSciNet): MR1379467
Digital Object Identifier: doi:10.2307/2291515
Zentralblatt MATH: 0868.62062
Taupin, M. L. (2001). Semi-parametric estimation in the nonlinear structural errors-in-variables model. Ann. Statist. 29 66--93.
Mathematical Reviews (MathSciNet): MR1833959
Digital Object Identifier: doi:10.1214/aos/996986502
Project Euclid: euclid.aos/996986502
Zentralblatt MATH: 1029.62039

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics