The Annals of Statistics

Generalized Likelihood Ratio Statistics and Wilks Phenomenon

Jianqing Fan, Chunming Zhang, and Jian Zhang

Source: Ann. Statist. Volume 29, Number 1 (2001), 153-193.

Abstract

Likelihood ratio theory has had tremendous success in parametric inference, due to the fundamental theory of Wilks. Yet, there is no general applicable approach for nonparametric inferences based on function estimation. Maximum likelihood ratio test statistics in general may not exist in nonparametric function estimation setting. Even if they exist, they are hard to find and can not be optimal as shown in this paper. We introduce the generalized likelihood statistics to overcome the drawbacks of nonparametric maximum likelihood ratio statistics. A new Wilks phenomenon is unveiled. We demonstrate that a class of the generalized likelihood statistics based on some appropriate nonparametric estimators are asymptotically distribution free and follow χ2-distributions under null hypotheses for a number of useful hypotheses and a variety of useful models including Gaussian white noise models, nonparametric regression models, varying coefficient models and generalized varying coefficient models. We further demonstrate that generalized likelihood ratio statistics are asymptotically optimal in the sense that they achieve optimal rates of convergence given by Ingster. They can even be adaptively optimal in the sense of Spokoiny by using a simple choice of adaptive smoothing parameter. Our work indicates that the generalized likelihood ratio statistics are indeed general and powerful for nonparametric testing problems based on function estimation.

Primary Subjects: 62G07
Secondary Subjects: 62G10, 62J12
Keywords: asymptotic null distribution; Gaussian white noise models; nonparametric test; optimal rates; power function; generalized likelihood; Wilks theorem

Full-text: Open access

Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/996986505
Digital Object Identifier: doi:10.1214/aos/996986505
Mathematical Reviews number (MathSciNet): MR1833962
Zentralblatt MATH identifier: 1029.62042

References

Aerts, M., Claeskens, G. and Hart, J. D. (1999). Testing the fit of a parametric function. J. Amer. Statist. Assoc. 94 869-879.
Mathematical Reviews (MathSciNet): MR2000g:62173
Azzalini, A. and Bowman, A. N. (1993). On the use of nonparametric regression for checking linear relationships. J. Roy. Statist. Soc. Ser. B 55 549-557.
Mathematical Reviews (MathSciNet): MR94a:62073
Azzalini, A., Bowman, A. N. and H¨ardle, W. (1989). On the use of nonparametric regression for model checking. Biometrika 76 1-11.
Mathematical Reviews (MathSciNet): MR90h:62081
Zentralblatt MATH: 0663.62096
Bickel, P. J. and Ritov, Y. (1992). Testing for goodness of fit: a new approach. In Nonparametric Statistics and Related Topics (A. K. Md. E. Saleh, ed.) 51-57. North-Holland, New York.
Mathematical Reviews (MathSciNet): MR1226715
Bickel, P. J. and Rosenblatt, M. (1973). On some global measures of the deviation of density function estimates. Ann. Statist. 1 1071-1095.
Mathematical Reviews (MathSciNet): MR50:1400
Zentralblatt MATH: 0275.62033
Brown, L. D. and Low, M. G. (1996). Asymptotic equivalence of nonparametric regression and white noise. Ann. Statist. 24 2384-2398.
Zentralblatt MATH: 0867.62022
Cai, Z., Fan, J. and Li, R. (2000). Efficient estimation and inferences for varying-coefficient models. J. Amer. Statist. Assoc. To appear.
Carroll, R. J., Fan, J., Gijbels, I. and Wand, M. P. (1997). Generalized partially linear singleindex models. J. Amer. Statist. Assoc. 92 477-489.
Mathematical Reviews (MathSciNet): MR98f:62215
Chen, J. H. and Qin, J. (1993). Empirical likelihood estimation for finite populations and the effective usage of auxiliary information. Biometrika 80 107-116.
Zentralblatt MATH: 0769.62006
Cleveland, W. S. and Devlin, S. J. (1988). Locally-weighted regression: an approach to regression analysis by local fitting. J. Amer. Statist. Assoc. 83 597-610.
Cleveland, W. S., Grosse, E. and Shyu, W. M. (1992). Local regression models. In Statistical Models in S (J. M. Chambers and T. J. Hastie, eds.) 309-376. Wadsworth and Brooks Cole, Pacific Grove, CA.
de Jong, P. (1987). A central limit theorem for generalized quadratic forms. Probab. Theory Related Fields 75 261-277.
Mathematical Reviews (MathSciNet): MR88d:60070
Eubank, R. L. and Hart, J. D. (1992). Testing goodness-of-fit in regression via order selection criteria. Ann. Statist. 20 1412-1425.
Mathematical Reviews (MathSciNet): MR93k:62107
Eubank, R. L. and LaRiccia, V. M. (1992). Asymptotic comparison of Cram´er-von Mises and nonparametric function estimation techniques for testing goodness-of-fit. Ann. Statist. 20 2071-2086.
Mathematical Reviews (MathSciNet): MR93k:62108
Fan, J. (1993). Local linear regression smoothers and their minimax efficiency. Ann. Statist. 21 196-216.
Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman's truncation. J. Amer. Statist. Assoc. 91 674-688.
Zentralblatt MATH: 0869.62032
Fan, J. and Gijbels, I. (1996). Local Polynomial Modeling and Its Applications. Chapman and Hall, London.
Fan, J. and Huang, L. (1998). Goodness-of-fit test for parametric regression models. Technical Report, Dept. Statistics, Univ. California, Los Angeles.
Fan, J. and Zhang, J. (1999). Sieve empirical likelihood ratios for nonparametric functions. Unpublished manuscript.
Hall, P. and Owen, A. B. (1993). Empirical likelihood confidence bands in density estimation. J. Comput. Graph. Statist. 2 273-289.
H¨ardle, W. and Mammen, E. (1993). Comparing nonparametric versus parametric regression fits. Ann. Statist. 21 1926-1947.
Mathematical Reviews (MathSciNet): MR94k:62057
Hart, J. D. (1997). Nonparametric Smoothing and Lack-of-Fit Tests. Springer, New York.
Mathematical Reviews (MathSciNet): MR99h:62056
Hastie, T. J. and Tibshirani, R. J. (1990). Generalized Additive Models. Chapman and Hall, London.
Mathematical Reviews (MathSciNet): MR92e:62117
Hastie, T. J. and Tibshirani, R. J. (1993). Varying-coefficient models (with discussion). J. Royal Statist. Soc. Ser. B 55 757-796.
Mathematical Reviews (MathSciNet): MR94b:62055
Huber, P. J. (1973). Robust regression: asymptotics, conjectures and Monte Carlo. Ann. Statist. 1 799-821.
Mathematical Reviews (MathSciNet): MR50:8843
Inglot, T., Kallenberg, W. C. M. and Ledwina, T. (1994). Power approximations to and power comparison of smooth goodness-of-fit tests. Scand. J. Statist. 21 131-145.
Mathematical Reviews (MathSciNet): MR95h:62081
Inglot, T. and Ledwina, T. (1996). Asymptotic optimality of data-driven Neyman's tests for uniformity. Ann. Statist. 24 1982-2019.
Mathematical Reviews (MathSciNet): MR99b:62079
Ingster, Yu. I. (1993). Asymptotic minimax hypothesis testing for nonparametric alternatives I-III. Math. Methods Statist. 2 85-114; 3 171-189; 4 249-268.
Kallenberg, W. C. M. and Ledwina, T. (1997). Data-driven smooth tests when the hypothesis is composite. J. Amer. Statist. Assoc. 92 1094-1104.
Mathematical Reviews (MathSciNet): MR98f:62136
Koroljuk, V. S. and Borovskich, Yu. V. (1994). Theory of UStatistics. Kluwer, Amsterdam.
Kuchibhatla, M. and Hart, J. D. (1996). Smoothing-based lack-of-fit tests: variations on a theme. J. Nonparameter. Statist. 7 1-22.
Mathematical Reviews (MathSciNet): MR99a:62068
Lepski, O. V. and Spokoiny, V. G. (1999). Minimax nonparametric hypothesis testing: the case of an inhomogeneous alternative. Bernoulli 5 333-358.
Mathematical Reviews (MathSciNet): MR2000h:62047
Li, G., Hollander, M., McKeague, I. W. and Yang, J. (1996). Nonparametric likelihood ratio confidence bands for quantile functions from incomplete survival data. Ann. Statist. 24 628-640.
Zentralblatt MATH: 0859.62047
Murphy, S. A. (1993). Testing for a time dependent coefficient in Cox's regression model. Scand. J. Statist. 20 35-50.
Mathematical Reviews (MathSciNet): MR94h:62082
Neyman, J. (1937). Smooth test for goodness of fit. Skand. Aktuar. J. 20 149-199.
Zentralblatt MATH: 0018.03403
Nussbaum, M. (1996). Asymptotic equivalence of density estimation and Gaussian white noise. Ann. Statist. 24 2399-2430.
Mathematical Reviews (MathSciNet): MR98k:62065
Owen, A. B. (1988). Empirical likelihood ratio confidence intervals for a single functional. Biometrika 75 237-249.
Mathematical Reviews (MathSciNet): MR90b:62047
Zentralblatt MATH: 0641.62032
Owen, A. B. (1990). Empirical likelihood ratio confidence regions. Ann. Statist. 18 90-120.
Mathematical Reviews (MathSciNet): MR91g:62037
Portnoy, S. (1988). Asymptotic behavior of likelihood methods for exponential families when the number of parameters tends to infinity. Ann. Statist. 16 356-366.
Mathematical Reviews (MathSciNet): MR89a:62056
Randle, D. H. and Wolfe, D. A. (1979). Introduction to the Theory of Nonparametric Statistics. Wiley, New York.
Seber, G. A. F. (1977). Linear Regression Analysis. Wiley, New York.
Mathematical Reviews (MathSciNet): MR55:9428
Shen, X., Shi, J. and Wong, W. H. (1999). Random sieve likelihood and general regression models. J. Amer. Statist. Assoc. 94 835-846.
Severini, T. A. and Wong, W. H. (1992). Generalized profile likelihood and conditional parametric models. Ann. Statist. 20 1768-1802.
Mathematical Reviews (MathSciNet): MR94a:62063
Silverman, B. W. (1984). Spline smoothing: the equivalent variable kernel method. Ann. Statist. 12 898-916.
Mathematical Reviews (MathSciNet): MR86e:62084
Spokoiny, V. G. (1996). Adaptive hypothesis testing using wavelets. Ann. Statist. 24 2477-2498.
Mathematical Reviews (MathSciNet): MR98f:62141
Wilks, S. S. (1938). The large-sample distribution of the likelihood ratio for testing composite hypotheses. Ann. Math. Statist. 9 60-62.
Zentralblatt MATH: 0018.32003
Zhang, J. and Gijbels, I. (1999). Sieve empirical likelihood and extensions of generalized least squares. Discussion paper, Institute of Statistics, Univ. catholique de Louvain.
5612 AZ, Eindhoven The Netherlands E-mail: jzhang@euridice.tue.nl

2009 © Institute of Mathematical Statistics