The Annals of Statistics

Testing the suitability of polynomial models in errors-in-variables problems

Peter Hall and Yanyuan Ma
Source: Ann. Statist. Volume 35, Number 6 (2007), 2620-2638.

Abstract

A low-degree polynomial model for a response curve is used commonly in practice. It generally incorporates a linear or quadratic function of the covariate. In this paper we suggest methods for testing the goodness of fit of a general polynomial model when there are errors in the covariates. There, the true covariates are not directly observed, and conventional bootstrap methods for testing are not applicable. We develop a new approach, in which deconvolution methods are used to estimate the distribution of the covariates under the null hypothesis, and a “wild” or moment-matching bootstrap argument is employed to estimate the distribution of the experimental errors (distinct from the distribution of the errors in covariates). Most of our attention is directed at the case where the distribution of the errors in covariates is known, although we also discuss methods for estimation and testing when the covariate error distribution is estimated. No assumptions are made about the distribution of experimental error, and, in particular, we depart substantially from conventional parametric models for errors-in-variables problems.

First Page: Show Hide
Primary Subjects: 62G08, 62G09, 62G10, 62G20
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1201012974
Digital Object Identifier: doi:10.1214/009053607000000361
Mathematical Reviews number (MathSciNet): MR2382660
Zentralblatt MATH identifier: 1129.62042

References

Bickel, P. J. and Ritov, Y. (1987). Efficient estimation in the errors-in-variables model. Ann. Statist. 15 513--540.
Mathematical Reviews (MathSciNet): MR0888423
Digital Object Identifier: doi:10.1214/aos/1176350358
Project Euclid: euclid.aos/1176350358
Zentralblatt MATH: 0643.62029
Carroll, R. J. and Hall, P. (1988). Optimal rates of convergence for deconvolving a density. J. Amer. Statist. Assoc. 83 1184--1186.
Mathematical Reviews (MathSciNet): MR0997599
Digital Object Identifier: doi:10.2307/2290153
Zentralblatt MATH: 0673.62033
Chan, L. K. and Mak, T. K. (1985). On the polynomial functional relationship. J. Roy. Statist. Soc. Ser. B 47 510--518.
Mathematical Reviews (MathSciNet): MR0844482
Cheng, C.-L. and Kukush, A. G. (2004). A goodness-of-fit test for a polynomial errors-in-variables model. Ukranian Math. J. 56 641--661.
Mathematical Reviews (MathSciNet): MR2105906
Digital Object Identifier: doi:10.1007/s11253-005-0095-9
Cheng, C.-L. and Schneeweiss, H. (1998). Polynomial regression with errors in the variables. J. R. Stat. Soc. Ser. B Stat. Methodol. 60 189--199.
Mathematical Reviews (MathSciNet): MR1625632
Digital Object Identifier: doi:10.1111/1467-9868.00118
Cheng, C.-L., Schneeweiss, H. and Thamerus, M. (2000). A small sample estimator for a polynomial regression with errors in the variables. J. R. Stat. Soc. Ser. B Stat. Methodol. 62 699--709.
Mathematical Reviews (MathSciNet): MR1796286
Digital Object Identifier: doi:10.1111/1467-9868.00258
Zentralblatt MATH: 0957.62052
Delaigle, A. and Gijbels, I. (2002). Estimation of integrated squared density derivatives from a contaminated sample. J. R. Stat. Soc. Ser. B Stat. Methodol. 64 869--886.
Mathematical Reviews (MathSciNet): MR1979392
Digital Object Identifier: doi:10.1111/1467-9868.00366
Zentralblatt MATH: 1067.62034
Delaigle, A. and Gijbels, I. (2004). Practical bandwidth selection in deconvolution kernel density estimation. Comput. Statist. Data Anal. 45 249--267.
Mathematical Reviews (MathSciNet): MR2045631
Domínguez, M. A. and Lobato, I. N. (2003). Testing the martingale difference hypothesis. Econometric Rev. 22 351--377.
Mathematical Reviews (MathSciNet): MR2018885
Digital Object Identifier: doi:10.1081/ETC-120025895
Zentralblatt MATH: 1030.62066
Efromovich, S. (1994). Nonparametric curve estimation from indirect observations. In Computing Science and Statistics. Proc. 26th Symp. on the Interface 196--200. Interface Foundation of North America, Fairfax Station, VA.
Efromovich, S. (1999). Nonparametric Curve Estimation: Methods, Theory and Applications. Springer, New York.
Mathematical Reviews (MathSciNet): MR1705298
Zentralblatt MATH: 0935.62039
Fan, J. (1991). On the optimal rates of convergence for nonparametric deconvolution problems. Ann. Statist. 19 1257--1272.
Mathematical Reviews (MathSciNet): MR1126324
Digital Object Identifier: doi:10.1214/aos/1176348248
Project Euclid: euclid.aos/1176348248
Zentralblatt MATH: 0729.62033
Fan, J. and Truong, Y. K. (1993). Nonparametric regression with errors in variables. Ann. Statist. 21 1900--1925.
Mathematical Reviews (MathSciNet): MR1245773
Digital Object Identifier: doi:10.1214/aos/1176349402
Project Euclid: euclid.aos/1176349402
Zentralblatt MATH: 0791.62042
Fan, J., Zhang, C. M. and Zhang, J. (2001). Generalized likelihood ratio statistics and Wilks phenomenon. Ann. Statist. 29 153--193.
Mathematical Reviews (MathSciNet): MR1833962
Digital Object Identifier: doi:10.1214/aos/996986505
Project Euclid: euclid.aos/996986505
Zentralblatt MATH: 1029.62042
Fan, Y. and Li, Q. (2002). A consistent model specification test based on the kernel sum of squares of residuals. Econometric Rev. 21 337--352.
Mathematical Reviews (MathSciNet): MR1944979
Digital Object Identifier: doi:10.1081/ETC-120015787
Zentralblatt MATH: 1033.62038
Flachaire, E. (2002). Bootstrapping heteroskedasticity consistent covariance matrix estimator. Comput. Statist. 17 501--506.
Mathematical Reviews (MathSciNet): MR1952694
Digital Object Identifier: doi:10.1007/s001800200122
Zentralblatt MATH: 1037.62023
Fuller, W. A. (1987). Measurement Error Models. Wiley, New York.
Mathematical Reviews (MathSciNet): MR0898653
Zentralblatt MATH: 0800.62413
González Manteiga, W., Martínez Miranda, M. D. and Pérez González, A. (2004). The choice of smoothing parameter in nonparametric regression through wild bootstrap. Comput. Statist. Data Anal. 47 487--515.
Mathematical Reviews (MathSciNet): MR2100562
Härdle, W. and Mammen, E. (1993). Comparing nonparametric versus parametric regression fits. Ann. Statist. 21 1926--1947.
Mathematical Reviews (MathSciNet): MR1245774
Digital Object Identifier: doi:10.1214/aos/1176349403
Project Euclid: euclid.aos/1176349403
Zentralblatt MATH: 0795.62036
Kauermann, G. and Opsomer, J. D. (2003). Local likelihood estimation in generalized additive models. Scand. J. Statist. 30 317--337.
Mathematical Reviews (MathSciNet): MR1983128
Digital Object Identifier: doi:10.1111/1467-9469.00333
Zentralblatt MATH: 1053.62084
Kukush, A., Schneeweiss, H. and Wolf, R. (2005). Relative efficiency of three estimators in a polynomial regression with measurement errors. J. Statist. Plann. Inference 127 179--203.
Mathematical Reviews (MathSciNet): MR2103032
Digital Object Identifier: doi:10.1016/j.jspi.2003.09.016
Zentralblatt MATH: 1083.62023
Li, Q., Hsiao, C. and Zinn, J. (2003). Consistent specification tests for semiparametric/nonparametric models based on series estimation methods. J. Econometrics 112 295--325.
Mathematical Reviews (MathSciNet): MR1951146
Digital Object Identifier: doi:10.1016/S0304-4076(02)00198-7
Zentralblatt MATH: 1027.62027
Liang, H., Härdle, W. and Carroll, R. J. (1999). Estimation in a semiparametric partially linear errors-in-variables model. Ann. Statist. 27 1519--1535.
Mathematical Reviews (MathSciNet): MR1742498
Digital Object Identifier: doi:10.1214/aos/1017939140
Project Euclid: euclid.aos/1017939140
Zentralblatt MATH: 0977.62036
Liu, R. Y. (1988). Bootstrap procedures under some non-i.i.d. models. Ann. Statist. 16 1696--1708.
Mathematical Reviews (MathSciNet): MR0964947
Digital Object Identifier: doi:10.1214/aos/1176351062
Project Euclid: euclid.aos/1176351062
Zentralblatt MATH: 0655.62031
Mammen, E. (1993). Bootstrap and wild bootstrap for high-dimensional linear models. Ann. Statist. 21 255--285.
Mathematical Reviews (MathSciNet): MR1212176
Digital Object Identifier: doi:10.1214/aos/1176349025
Project Euclid: euclid.aos/1176349025
Zentralblatt MATH: 0771.62032
Prášková, Z. (2003). Wild bootstrap in RCA(1) model. Kybernetika (Prague) 39 1--12.
Mathematical Reviews (MathSciNet): MR1980120
Stefanski, L. A. and Carroll, R. J. (1987). Conditional scores and optimal scores for generalized linear measurement-error models. Biometrika 74 703--716.
Mathematical Reviews (MathSciNet): MR0919838
Zentralblatt MATH: 0632.62052
Stefanski, L. A. and Carroll, R. J. (1990). Deconvoluting kernel density estimators. Statistics 21 169--184.
Mathematical Reviews (MathSciNet): MR1054861
Digital Object Identifier: doi:10.1080/02331889008802238
Zentralblatt MATH: 0697.62035
Taupin, M. (2001). Semi-parametric estimation in the nonlinear structural errors-in-variables model. Ann. Statist. 29 66--93.
Mathematical Reviews (MathSciNet): MR1833959
Digital Object Identifier: doi:10.1214/aos/996986502
Project Euclid: euclid.aos/996986502
Zentralblatt MATH: 1029.62039
Tsiatis, A. A. and Ma, Y. (2004). Locally efficient semiparametric estimators for functional measurement error models. Biometrika 91 835--848.
Mathematical Reviews (MathSciNet): MR2126036
Zentralblatt MATH: 1064.62034
Digital Object Identifier: doi:10.1093/biomet/91.4.835

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics