The Annals of Statistics

Bayesian-motivated tests of function fit and their asymptotic frequentist properties

Marc Aerts, Gerda Claeskens, and Jeffrey D. Hart
Source: Ann. Statist. Volume 32, Number 6 (2004), 2580-2615.

Abstract

We propose and analyze nonparametric tests of the null hypothesis that a function belongs to a specified parametric family. The tests are based on BIC approximations, πBIC, to the posterior probability of the null model, and may be carried out in either Bayesian or frequentist fashion. We obtain results on the asymptotic distribution of πBIC under both the null hypothesis and local alternatives. One version of πBIC, call it πBIC*, uses a class of models that are orthogonal to each other and growing in number without bound as sample size, n, tends to infinity. We show that $\sqrt{n}$(1−πBIC*) converges in distribution to a stable law under the null hypothesis. We also show that πBIC* can detect local alternatives converging to the null at the rate $\sqrt{\log n/n}$. A particularly interesting finding is that the power of the πBIC*-based test is asymptotically equal to that of a test based on the maximum of alternative log-likelihoods.

Simulation results and an example involving variable star data illustrate desirable features of the proposed tests.

First Page: Show Hide
Primary Subjects: 62G10, 62C10
Secondary Subjects: 62G20
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1107794880
Digital Object Identifier: doi:10.1214/009053604000000805
Mathematical Reviews number (MathSciNet): MR2153996
Zentralblatt MATH identifier: 1068.62053

References

Aerts, M., Claeskens, G. and Hart, J. D. (1999). Testing the fit of a parametric function. J. Amer. Statist. Assoc. 94 869--879.
Mathematical Reviews (MathSciNet): MR1723323
Digital Object Identifier: doi:10.2307/2670002
Zentralblatt MATH: 0996.62044
Aerts, M., Claeskens, G. and Hart, J. D. (2000). Testing lack of fit in multiple regression. Biometrika 87 405--424.
Mathematical Reviews (MathSciNet): MR1782487
Zentralblatt MATH: 0948.62028
Digital Object Identifier: doi:10.1093/biomet/87.2.405
Akaike, H. (1974). A new look at the statistical model identification. IEEE Trans. Automatic Control 19 716--723.
Mathematical Reviews (MathSciNet): MR423716
Digital Object Identifier: doi:10.1109/TAC.1974.1100705
Zentralblatt MATH: 0314.62039
Berger, J. O. and Pericchi, L. R. (1996). The intrinsic Bayes factor for model selection and prediction. J. Amer. Statist. Assoc. 91 109--122.
Mathematical Reviews (MathSciNet): MR1394065
Digital Object Identifier: doi:10.2307/2291387
Zentralblatt MATH: 0870.62021
Claeskens, G. and Hjort, N. L. (2004). Goodness of fit via nonparametric likelihood ratios. Scand. J. Statist. 31 487--514.
Mathematical Reviews (MathSciNet): MR2101536
Digital Object Identifier: doi:10.1111/j.1467-9469.2004.00403.x
Zentralblatt MATH: 1065.62056
Drmač, Z., Omladič, M. and Veselić, K. (1994). On the perturbation of the Cholesky factorization. SIAM J. Matrix Anal. Appl. 15 1319--1332.
Mathematical Reviews (MathSciNet): MR1293920
Digital Object Identifier: doi:10.1137/S0895479893244717
Zentralblatt MATH: 0809.15005
Eubank, R. L. and Hart, J. D. (1992). Testing goodness-of-fit in regression via order selection criteria. Ann. Statist. 20 1412--1425.
Mathematical Reviews (MathSciNet): MR1186256
Digital Object Identifier: doi:10.1214/aos/1176348775
Project Euclid: euclid.aos/1176348775
Zentralblatt MATH: 0776.62045
Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman's truncation. J. Amer. Statist. Assoc. 91 674--688.
Mathematical Reviews (MathSciNet): MR1395735
Digital Object Identifier: doi:10.2307/2291663
Zentralblatt MATH: 0869.62032
Fan, J. and Huang, L. (2001). Goodness-of-fit tests for parametric regression models. J. Amer. Statist. Assoc. 96 640--652.
Mathematical Reviews (MathSciNet): MR1946431
Digital Object Identifier: doi:10.1198/016214501753168316
Zentralblatt MATH: 1017.62014
Golub, G. H. and Van Loan, C. F. (1996). Matrix Computations, 3rd ed. Johns Hopkins Univ. Press.
Mathematical Reviews (MathSciNet): MR1417720
Zentralblatt MATH: 0865.65009
Golubov, B., Efimov, A. and Skvortsov, V. (1991). Walsh Series and Transforms: Theory and Applications. Kluwer, Dordrecht.
Mathematical Reviews (MathSciNet): MR1155844
Zentralblatt MATH: 0785.42010
Good, I. J. (1957). Saddle-point methods for the multinomial distribution. Ann. Math. Statist. 28 861--881.
Mathematical Reviews (MathSciNet): MR93866
Digital Object Identifier: doi:10.1214/aoms/1177706790
Project Euclid: euclid.aoms/1177706790
Good, I. J. (1992). The Bayes/non-Bayes compromise: A brief review. J. Amer. Statist. Assoc. 87 597--606.
Mathematical Reviews (MathSciNet): MR1185188
Digital Object Identifier: doi:10.2307/2290192
Götze, F. (1991). On the rate of convergence in the multivariate central limit theorem. Ann. Probab. 19 724--739.
Mathematical Reviews (MathSciNet): MR1106283
Digital Object Identifier: doi:10.1214/aop/1176990448
Project Euclid: euclid.aop/1176990448
Zentralblatt MATH: 0729.62051
Hart, J. D. (1997). Nonparametric Smoothing and Lack-of-Fit Tests. Springer, New York.
Mathematical Reviews (MathSciNet): MR1461272
Zentralblatt MATH: 0886.62043
Hart, J. D., Koen, C. and Lombard, F. (2004). An analysis of pulsation periods of long-period variable stars. Submitted for publication.
Haughton, D. M. A. (1988). On the choice of a model to fit data from an exponential family. Ann. Statist. 16 342--355.
Mathematical Reviews (MathSciNet): MR924875
Digital Object Identifier: doi:10.1214/aos/1176350709
Project Euclid: euclid.aos/1176350709
Zentralblatt MATH: 0657.62037
Jeffreys, H. (1961). Theory of Probability, 3rd ed. Oxford Univ. Press.
Mathematical Reviews (MathSciNet): MR187257
Zentralblatt MATH: 0116.34904
Kass, R. E. and Wasserman, L. (1995). A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. J. Amer. Statist. Assoc. 90 928--934.
Mathematical Reviews (MathSciNet): MR1354008
Digital Object Identifier: doi:10.2307/2291327
Zentralblatt MATH: 0851.62020
Kass, R. E. and Wasserman, L. (1996). The selection of prior distributions by formal rules. J. Amer. Statist. Assoc. 91 1343--1370. [Correction (1998) 93 412.]
Mathematical Reviews (MathSciNet): MR1478684
Digital Object Identifier: doi:10.1214/lnms/1215453065
Kass, R. E. and Raftery, A. E. (1995). Bayes factors. J. Amer. Statist. Assoc. 90 773--795.
Koen, C. and Lombard, F. (2001). The analysis of indexed astronomical time series---VII. Simultaneous use of times of maxima and minima to test for period changes in long-period variables. Monthly Notices of the Royal Astronomical Society 325 1124--1132.
Ledwina, T. (1994). Data-driven version of Neyman's smooth test of fit. J. Amer. Statist. Assoc. 89 1000--1005.
Mathematical Reviews (MathSciNet): MR1294744
Digital Object Identifier: doi:10.2307/2290926
Zentralblatt MATH: 0805.62022
LePage, R., Woodroofe, M. and Zinn, J. (1981). Convergence to a stable distribution via order statistics. Ann. Probab. 9 624--632.
Mathematical Reviews (MathSciNet): MR624688
Digital Object Identifier: doi:10.1214/aop/1176994367
Project Euclid: euclid.aop/1176994367
Zentralblatt MATH: 0465.60031
Mattei, J. A. (1997). Introducing Mira variables. J. AAVSO 25 57--62.
McCullagh, P. and Nelder, J. A. (1989). Generalized Linear Models, 2nd ed. Chapman and Hall, London.
Mathematical Reviews (MathSciNet): MR727836
Zentralblatt MATH: 0588.62104
Neyman, J. (1937). ``Smooth'' test for goodness of fit. Skandinavisk Aktuarietidskrift 20 149--199.
Rayner, J. C. W. and Best, D. J. (1989). Smooth Tests of Goodness of Fit. Oxford Univ. Press.
Mathematical Reviews (MathSciNet): MR1029526
Zentralblatt MATH: 0731.62064
Rayner, J. C. W. and Best, D. J. (1990). Smooth tests of goodness of fit: An overview. Internat. Statist. Rev. 58 9--17.
Mathematical Reviews (MathSciNet): MR1029526
Zentralblatt MATH: 0731.62064
Rissanen, J. (1983). A universal prior for integers and estimation by minimum description length. Ann. Statist. 11 416--431.
Mathematical Reviews (MathSciNet): MR696056
Digital Object Identifier: doi:10.1214/aos/1176346150
Project Euclid: euclid.aos/1176346150
Zentralblatt MATH: 0513.62005
Samorodnitsky, G. and Taqqu, M. S. (1994). Stable Non-Gaussian Random Processes: Stochastic Models with Infinite Variance. Chapman and Hall, New York.
Mathematical Reviews (MathSciNet): MR1280932
Zentralblatt MATH: 0925.60027
Schwarz, G. (1978). Estimating the dimension of a model. Ann. Statist. 6 461--464.
Mathematical Reviews (MathSciNet): MR468014
Digital Object Identifier: doi:10.1214/aos/1176344136
Project Euclid: euclid.aos/1176344136
Zentralblatt MATH: 0379.62005
Szegö, G. (1975). Orthogonal Polynomials, 4th ed. Amer. Math. Soc., Providence, RI.
Mathematical Reviews (MathSciNet): MR372517
Verdinelli, I. and Wasserman, L. (1998). Bayesian goodness-of-fit testing using infinite-dimensional exponential families. Ann. Statist. 26 1215--1241.
Mathematical Reviews (MathSciNet): MR1647645
Digital Object Identifier: doi:10.1214/aos/1024691240
Project Euclid: euclid.aos/1024691240
Zentralblatt MATH: 0930.62027
White, H. (1994). Estimation, Inference and Specification Analysis. Cambridge Univ. Press.
Mathematical Reviews (MathSciNet): MR1292251
Zentralblatt MATH: 0860.62100

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics