Institute of Mathematical Statistics Collections

On estimating the change point in generalized linear models

Hongling Zhou, Kung-Yee Liang

Abstract

Statistical models incorporating change points are common in practice, especially in the area of biomedicine. This approach is appealing in that a specific parameter is introduced to account for the abrupt change in the response variable relating to a particular independent variable of interest. The statistical challenge one encounters is that the likelihood function is not differentiable with respect to this change point parameter. Consequently, the conventional asymptotic properties for the maximum likelihood estimators fail to hold in this situation. In this paper, we propose an estimating procedure for estimating the change point along with other regression coefficients under the generalized linear model framework. We show that the proposed estimators enjoy the conventional asymptotic properties including consistency and normality. Simulation work we conducted suggests that it performs well for the situations considered. We applied the proposed method to a case-control study aimed to examine the relationship between the risk of myocardial infarction and alcohol intake.

First Page: Show Hide
Primary Subjects: 62F10, 62F12
Secondary Subjects: 62E20
Keywords: asymptotic normality; change point; consistency; generalized linear model; smoothing function
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.imsc/1207058282
Digital Object Identifier: doi:10.1214/193940307000000239

References

[1] Bacon, D. W. and Watts, D. G. (1971). Estimating the transition between two intersecting straight lines. Biometrika 58 525–534.
[2] Bhattacharya, P. K. (1991). Weak convergence of the log likelihood process in the two-phase linear regression problem. Probab. Statist. Design Experiments 145–156.
[3] Bhattacharya, P. K. (1994). Some aspects of change-point analysis. Change-Point Problems (E. Carlstein, H.-G. Miller and D. Siegmund, eds.) 28–56.
Mathematical Reviews (MathSciNet): MR1477912
Zentralblatt MATH: 1157.62331
Digital Object Identifier: doi:10.1214/lnms/1215463112
[4] Boukai, B. (1993). A nonparametric bootstrapped estimate of the change-point. J. Nonparametr. Statist. 123–134.
Mathematical Reviews (MathSciNet): MR1272593
Zentralblatt MATH: 05143405
Digital Object Identifier: doi:10.1080/10485259308832576
[5] Davies, R. B. (1987). Hypothesis testing when a nuisance parameter is present only under the alternative. Biometrika 74 33–43.
Mathematical Reviews (MathSciNet): MR885917
[6] Fahrmeir, L. and Kaufmann, H. (1985). Consistency and asymptotic normality of the maximum likelihood estimator in generalized linear models. Ann. Statist. 13 342–368.
Mathematical Reviews (MathSciNet): MR773172
Digital Object Identifier: doi:10.1214/aos/1176346597
Project Euclid: euclid.aos/1176346597
[7] Feder, P. I. (1975). On asymptotic distribution theory in segmented regression problems – identified case. Ann. Statist. 3 49–83.
Mathematical Reviews (MathSciNet): MR378267
Zentralblatt MATH: 0324.62014
Digital Object Identifier: doi:10.1214/aos/1176342999
Project Euclid: euclid.aos/1176342999
[8] Gallant, A. R. (1977). Testing a nonlinear regression specification: A nonregular case. J. Amer. Statist. Assoc. 72 523–530.
[9] Gallant, A. R. and Fuller, W. A. (1973). Fitting segmented polynomial regression models whose join points have to be estimated. J. Amer. Statist. Assoc. 68 144–147.
[10] Hinkley, D. V. (1969). Inference about the intersection in two-phase regression. Biometrika 56 495–504.
Mathematical Reviews (MathSciNet): MR254946
Zentralblatt MATH: 0183.48101
Digital Object Identifier: doi:10.1093/biomet/56.3.635
[11] Horowitz, J. L. (1992). A smoothed maximum score estimator for the binary response model. Econometrica 60 505–531.
Mathematical Reviews (MathSciNet): MR1162997
Digital Object Identifier: doi:10.2307/2951582
[12] Kardinaal, A. F. M., van’t Veer, P., Kok, F. J., Rignstad, J., G=mez-Aracena, J., Mazaev, V. P., Kohlmeier, L., Martin, B. C., Aro, A., Huttunen, J. K., Kark, J. D., Delgado-Rodriguez, M., Riemersma, R. A., Martin-Moreno, J. M., Kok, F. J., Huttunenm, J. K., Kohlmeierm, L. and Martin-Morenom, J. M. (1993). Antioxidants in adipose tissue and risk of myocardial infarction: The EURAMIC study. Lancet 342 1379–1384.
[13] MacNeill, I. B. and Mao, Y. (1995). Change-point analysis for mortality and morbidity rate. In Applied Change Point Problems in Statistics (B. Sinha, A. Rukhin and M. Ahsanullah, eds.) 37–55.
Mathematical Reviews (MathSciNet): MR1396788
[14] Manski, C. (1985). Semiparametric analysis of discrete response: Asymptotic properties of the maximum score estimator. J. Econometrics 27 313–334.
Mathematical Reviews (MathSciNet): MR788628
Digital Object Identifier: doi:10.1016/0304-4076(85)90009-0
[15] McCullagh, P. and Nelder, J. A. (1989). Generalized Linear Models. Chapman and Hall/CRC.
Mathematical Reviews (MathSciNet): MR727836
[16] Pastor, R. and Guallar, E. (1998). Use of two-segmented logistic regression to estimate change-points in epidemiologic studies. Amer. J. Epidemiology 148 631–642.
[17] Rao, B. L. S. P. (1983). Nonparametric Functional Estimation. Academic Press.
Mathematical Reviews (MathSciNet): MR740865
[18] Sprent, P. (1961). Some hypotheses concerning two phase regression lines. Biometrics 17 634–645.

2012 © Institute of Mathematical Statistics

Institute of Mathematical Statistics Collections

Institute of Mathematical Statistics Collections