The Annals of Statistics

Generalized Likelihood Ratio Statistics and Wilks Phenomenon

Jianqing Fan, Chunming Zhang, and Jian Zhang
Source: Ann. Statist. Volume 29, Number 1 (2001), 153-193.

Abstract

Likelihood ratio theory has had tremendous success in parametric inference, due to the fundamental theory of Wilks. Yet, there is no general applicable approach for nonparametric inferences based on function estimation. Maximum likelihood ratio test statistics in general may not exist in nonparametric function estimation setting. Even if they exist, they are hard to find and can not be optimal as shown in this paper. We introduce the generalized likelihood statistics to overcome the drawbacks of nonparametric maximum likelihood ratio statistics. A new Wilks phenomenon is unveiled. We demonstrate that a class of the generalized likelihood statistics based on some appropriate nonparametric estimators are asymptotically distribution free and follow χ2-distributions under null hypotheses for a number of useful hypotheses and a variety of useful models including Gaussian white noise models, nonparametric regression models, varying coefficient models and generalized varying coefficient models. We further demonstrate that generalized likelihood ratio statistics are asymptotically optimal in the sense that they achieve optimal rates of convergence given by Ingster. They can even be adaptively optimal in the sense of Spokoiny by using a simple choice of adaptive smoothing parameter. Our work indicates that the generalized likelihood ratio statistics are indeed general and powerful for nonparametric testing problems based on function estimation.

First Page: Show Hide
Primary Subjects: 62G07
Secondary Subjects: 62G10, 62J12
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/996986505
Digital Object Identifier: doi:10.1214/aos/996986505
Mathematical Reviews number (MathSciNet): MR1833962
Zentralblatt MATH identifier: 1029.62042

References

Aerts, M., Claeskens, G. and Hart, J. D. (1999). Testing the fit of a parametric function. J. Amer. Statist. Assoc. 94 869-879.
Mathematical Reviews (MathSciNet): MR2000g:62173
Zentralblatt MATH: 0996.62044
Digital Object Identifier: doi:10.2307/2670002
Azzalini, A. and Bowman, A. N. (1993). On the use of nonparametric regression for checking linear relationships. J. Roy. Statist. Soc. Ser. B 55 549-557.
Mathematical Reviews (MathSciNet): MR94a:62073
Azzalini, A., Bowman, A. N. and H¨ardle, W. (1989). On the use of nonparametric regression for model checking. Biometrika 76 1-11.
Mathematical Reviews (MathSciNet): MR90h:62081
Zentralblatt MATH: 0663.62096
Digital Object Identifier: doi:10.1093/biomet/76.1.1
Bickel, P. J. and Ritov, Y. (1992). Testing for goodness of fit: a new approach. In Nonparametric Statistics and Related Topics (A. K. Md. E. Saleh, ed.) 51-57. North-Holland, New York.
Mathematical Reviews (MathSciNet): MR1226715
Bickel, P. J. and Rosenblatt, M. (1973). On some global measures of the deviation of density function estimates. Ann. Statist. 1 1071-1095.
Mathematical Reviews (MathSciNet): MR50:1400
Zentralblatt MATH: 0275.62033
Digital Object Identifier: doi:10.1214/aos/1176342558
Project Euclid: euclid.aos/1176342558
Brown, L. D. and Low, M. G. (1996). Asymptotic equivalence of nonparametric regression and white noise. Ann. Statist. 24 2384-2398.
Zentralblatt MATH: 0867.62022
Mathematical Reviews (MathSciNet): MR1425958
Digital Object Identifier: doi:10.1214/aos/1032181159
Project Euclid: euclid.aos/1032181159
Cai, Z., Fan, J. and Li, R. (2000). Efficient estimation and inferences for varying-coefficient models. J. Amer. Statist. Assoc. To appear.
Mathematical Reviews (MathSciNet): MR1804446
Zentralblatt MATH: 0999.62052
Digital Object Identifier: doi:10.2307/2669472
Carroll, R. J., Fan, J., Gijbels, I. and Wand, M. P. (1997). Generalized partially linear singleindex models. J. Amer. Statist. Assoc. 92 477-489.
Mathematical Reviews (MathSciNet): MR98f:62215
Zentralblatt MATH: 0890.62053
Digital Object Identifier: doi:10.2307/2965697
Chen, J. H. and Qin, J. (1993). Empirical likelihood estimation for finite populations and the effective usage of auxiliary information. Biometrika 80 107-116.
Zentralblatt MATH: 0769.62006
Mathematical Reviews (MathSciNet): MR1225218
Digital Object Identifier: doi:10.1093/biomet/80.1.107
Cleveland, W. S. and Devlin, S. J. (1988). Locally-weighted regression: an approach to regression analysis by local fitting. J. Amer. Statist. Assoc. 83 597-610.
Cleveland, W. S., Grosse, E. and Shyu, W. M. (1992). Local regression models. In Statistical Models in S (J. M. Chambers and T. J. Hastie, eds.) 309-376. Wadsworth and Brooks Cole, Pacific Grove, CA.
de Jong, P. (1987). A central limit theorem for generalized quadratic forms. Probab. Theory Related Fields 75 261-277.
Mathematical Reviews (MathSciNet): MR88d:60070
Zentralblatt MATH: 0596.60022
Digital Object Identifier: doi:10.1007/BF00354037
Eubank, R. L. and Hart, J. D. (1992). Testing goodness-of-fit in regression via order selection criteria. Ann. Statist. 20 1412-1425.
Mathematical Reviews (MathSciNet): MR93k:62107
Zentralblatt MATH: 0776.62045
Digital Object Identifier: doi:10.1214/aos/1176348775
Project Euclid: euclid.aos/1176348775
Eubank, R. L. and LaRiccia, V. M. (1992). Asymptotic comparison of Cram´er-von Mises and nonparametric function estimation techniques for testing goodness-of-fit. Ann. Statist. 20 2071-2086.
Mathematical Reviews (MathSciNet): MR93k:62108
Digital Object Identifier: doi:10.1214/aos/1176348903
Project Euclid: euclid.aos/1176348903
Fan, J. (1993). Local linear regression smoothers and their minimax efficiency. Ann. Statist. 21 196-216.
Mathematical Reviews (MathSciNet): MR1212173
Zentralblatt MATH: 0773.62029
Digital Object Identifier: doi:10.1214/aos/1176349022
Project Euclid: euclid.aos/1176349022
Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman's truncation. J. Amer. Statist. Assoc. 91 674-688.
Zentralblatt MATH: 0869.62032
Mathematical Reviews (MathSciNet): MR1395735
Digital Object Identifier: doi:10.2307/2291663
Fan, J. and Gijbels, I. (1996). Local Polynomial Modeling and Its Applications. Chapman and Hall, London.
Mathematical Reviews (MathSciNet): MR1383587
Zentralblatt MATH: 0873.62037
Fan, J. and Huang, L. (1998). Goodness-of-fit test for parametric regression models. Technical Report, Dept. Statistics, Univ. California, Los Angeles.
Fan, J. and Zhang, J. (1999). Sieve empirical likelihood ratios for nonparametric functions. Unpublished manuscript.
Hall, P. and Owen, A. B. (1993). Empirical likelihood confidence bands in density estimation. J. Comput. Graph. Statist. 2 273-289.
Mathematical Reviews (MathSciNet): MR1272395
Digital Object Identifier: doi:10.2307/1390646
H¨ardle, W. and Mammen, E. (1993). Comparing nonparametric versus parametric regression fits. Ann. Statist. 21 1926-1947.
Mathematical Reviews (MathSciNet): MR94k:62057
Zentralblatt MATH: 0795.62036
Digital Object Identifier: doi:10.1214/aos/1176349403
Project Euclid: euclid.aos/1176349403
Hart, J. D. (1997). Nonparametric Smoothing and Lack-of-Fit Tests. Springer, New York.
Mathematical Reviews (MathSciNet): MR99h:62056
Zentralblatt MATH: 0886.62043
Hastie, T. J. and Tibshirani, R. J. (1990). Generalized Additive Models. Chapman and Hall, London.
Mathematical Reviews (MathSciNet): MR92e:62117
Hastie, T. J. and Tibshirani, R. J. (1993). Varying-coefficient models (with discussion). J. Royal Statist. Soc. Ser. B 55 757-796.
Mathematical Reviews (MathSciNet): MR94b:62055
Huber, P. J. (1973). Robust regression: asymptotics, conjectures and Monte Carlo. Ann. Statist. 1 799-821.
Mathematical Reviews (MathSciNet): MR50:8843
Zentralblatt MATH: 0289.62033
Digital Object Identifier: doi:10.1214/aos/1176342503
Project Euclid: euclid.aos/1176342503
Inglot, T., Kallenberg, W. C. M. and Ledwina, T. (1994). Power approximations to and power comparison of smooth goodness-of-fit tests. Scand. J. Statist. 21 131-145.
Mathematical Reviews (MathSciNet): MR95h:62081
Inglot, T. and Ledwina, T. (1996). Asymptotic optimality of data-driven Neyman's tests for uniformity. Ann. Statist. 24 1982-2019.
Mathematical Reviews (MathSciNet): MR99b:62079
Zentralblatt MATH: 0905.62044
Digital Object Identifier: doi:10.1214/aos/1069362306
Project Euclid: euclid.aos/1069362306
Ingster, Yu. I. (1993). Asymptotic minimax hypothesis testing for nonparametric alternatives I-III. Math. Methods Statist. 2 85-114; 3 171-189; 4 249-268.
Kallenberg, W. C. M. and Ledwina, T. (1997). Data-driven smooth tests when the hypothesis is composite. J. Amer. Statist. Assoc. 92 1094-1104.
Mathematical Reviews (MathSciNet): MR98f:62136
Zentralblatt MATH: 1067.62534
Digital Object Identifier: doi:10.2307/2965574
Koroljuk, V. S. and Borovskich, Yu. V. (1994). Theory of UStatistics. Kluwer, Amsterdam.
Kuchibhatla, M. and Hart, J. D. (1996). Smoothing-based lack-of-fit tests: variations on a theme. J. Nonparameter. Statist. 7 1-22.
Mathematical Reviews (MathSciNet): MR99a:62068
Zentralblatt MATH: 0877.62041
Digital Object Identifier: doi:10.1080/10485259608832685
Lepski, O. V. and Spokoiny, V. G. (1999). Minimax nonparametric hypothesis testing: the case of an inhomogeneous alternative. Bernoulli 5 333-358.
Mathematical Reviews (MathSciNet): MR2000h:62047
Digital Object Identifier: doi:10.2307/3318439
Project Euclid: euclid.bj/1173147910
Li, G., Hollander, M., McKeague, I. W. and Yang, J. (1996). Nonparametric likelihood ratio confidence bands for quantile functions from incomplete survival data. Ann. Statist. 24 628-640.
Zentralblatt MATH: 0859.62047
Mathematical Reviews (MathSciNet): MR1394978
Digital Object Identifier: doi:10.1214/aos/1032894455
Project Euclid: euclid.aos/1032894455
Murphy, S. A. (1993). Testing for a time dependent coefficient in Cox's regression model. Scand. J. Statist. 20 35-50.
Mathematical Reviews (MathSciNet): MR94h:62082
Neyman, J. (1937). Smooth test for goodness of fit. Skand. Aktuar. J. 20 149-199.
Zentralblatt MATH: 0018.03403
Nussbaum, M. (1996). Asymptotic equivalence of density estimation and Gaussian white noise. Ann. Statist. 24 2399-2430.
Mathematical Reviews (MathSciNet): MR98k:62065
Digital Object Identifier: doi:10.1214/aos/1032181160
Project Euclid: euclid.aos/1032181160
Owen, A. B. (1988). Empirical likelihood ratio confidence intervals for a single functional. Biometrika 75 237-249.
Mathematical Reviews (MathSciNet): MR90b:62047
Zentralblatt MATH: 0641.62032
Digital Object Identifier: doi:10.1093/biomet/75.2.237
Owen, A. B. (1990). Empirical likelihood ratio confidence regions. Ann. Statist. 18 90-120.
Mathematical Reviews (MathSciNet): MR91g:62037
Zentralblatt MATH: 0712.62040
Digital Object Identifier: doi:10.1214/aos/1176347494
Project Euclid: euclid.aos/1176347494
Portnoy, S. (1988). Asymptotic behavior of likelihood methods for exponential families when the number of parameters tends to infinity. Ann. Statist. 16 356-366.
Mathematical Reviews (MathSciNet): MR89a:62056
Zentralblatt MATH: 0637.62026
Digital Object Identifier: doi:10.1214/aos/1176350710
Project Euclid: euclid.aos/1176350710
Randle, D. H. and Wolfe, D. A. (1979). Introduction to the Theory of Nonparametric Statistics. Wiley, New York.
Mathematical Reviews (MathSciNet): MR547836
Zentralblatt MATH: 0529.62035
Seber, G. A. F. (1977). Linear Regression Analysis. Wiley, New York.
Mathematical Reviews (MathSciNet): MR55:9428
Shen, X., Shi, J. and Wong, W. H. (1999). Random sieve likelihood and general regression models. J. Amer. Statist. Assoc. 94 835-846.
Mathematical Reviews (MathSciNet): MR1723339
Zentralblatt MATH: 0994.62032
Digital Object Identifier: doi:10.2307/2669998
Severini, T. A. and Wong, W. H. (1992). Generalized profile likelihood and conditional parametric models. Ann. Statist. 20 1768-1802.
Mathematical Reviews (MathSciNet): MR94a:62063
Zentralblatt MATH: 0768.62015
Digital Object Identifier: doi:10.1214/aos/1176348889
Project Euclid: euclid.aos/1176348889
Silverman, B. W. (1984). Spline smoothing: the equivalent variable kernel method. Ann. Statist. 12 898-916.
Mathematical Reviews (MathSciNet): MR86e:62084
Zentralblatt MATH: 0547.62024
Digital Object Identifier: doi:10.1214/aos/1176346710
Project Euclid: euclid.aos/1176346710
Spokoiny, V. G. (1996). Adaptive hypothesis testing using wavelets. Ann. Statist. 24 2477-2498.
Mathematical Reviews (MathSciNet): MR98f:62141
Zentralblatt MATH: 0898.62056
Digital Object Identifier: doi:10.1214/aos/1032181163
Project Euclid: euclid.aos/1032181163
Wilks, S. S. (1938). The large-sample distribution of the likelihood ratio for testing composite hypotheses. Ann. Math. Statist. 9 60-62.
Zentralblatt MATH: 0018.32003
Zhang, J. and Gijbels, I. (1999). Sieve empirical likelihood and extensions of generalized least squares. Discussion paper, Institute of Statistics, Univ. catholique de Louvain.
5612 AZ, Eindhoven The Netherlands E-mail: jzhang@euridice.tue.nl

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics