Longitudinal studies are often conducted to explore the cohort and age effects in many scientific areas. The within cluster correlation structure plays a very important role in longitudinal data analysis. This is because not only can an estimator be improved by incorporating the within cluster correlation structure into the estimation procedure, but also the within cluster correlation structure can sometimes provide valuable insights in practical problems. For example, it can reveal the correlation strengths among the impacts of various factors. Motivated by data typified by a set from Bangladesh pertinent to the use of contraceptives, we propose a random effect varying-coefficient model, and an estimation procedure for the within cluster correlation structure of the proposed model. The estimation procedure is optimization-free and the proposed estimators enjoy asymptotic normality under mild conditions. Simulations suggest that the proposed estimation is practicable for finite samples and resistent against mild forms of model misspecification. Finally, we analyze the data mentioned above with the new random effect varying-coefficient model together with the proposed estimation procedure, which reveals some interesting sociological dynamics.
References
Cai, Z., Fan, J. and Li, R. (2000). Efficient estimation and inferences for varying-coefficient models. J. Amer. Statist. Assoc. 95 888--902.
Cleland, J., Phillips, J. F., Amin, S. and Kamal, G. M. (1994). The Determinants of Reproductive Change in Bangladesh: Success in a Challenging Environment. The World Bank, Washington.
Commenges, D. and Jacqmin-Gadda, H. (1997). Generalized score test of homogeneity based on correlated random effects models. J. Roy. Statist. Soc. Ser. B 59 157--171.
Cox, D. R. (1972). Regression models and life tables (with discussion). J. Roy. Statist. Soc. Ser. B 34 187--220.
Diggle, P. J., Heagerty, P., Liang, K.-Y. and Zeger, S. L. (2002). Analysis of Longitudinal Data, 2nd ed. Oxford Univ. Press.
Fan, J. and Gijbels, I. (1994). Censored regression: Local linear approximations and their applications. J. Amer. Statist. Assoc. 89 560--570.
Fan, J., Huang, T. and Li, R. (2007). Analysis of longitudinal data with semiparametric estimation of covariance function. J. Amer. Statist. Assoc. 102 632--641.
Fan, J. and Li, R. (2004). New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. J. Amer. Statist. Assoc. 99 710--723.
Fan, J., Yao, Q. and Cai, Z. (2003). Adaptive varying-coefficient linear models. J. R. Stat. Soc. Ser. B Stat. Methodol. 65 57--80.
Fan, J. and Zhang, J.-T. (2000). Two-step estimation of functional linear models with applications to longitudinal data. J. R. Stat. Soc. Ser. B Stat. Methodol. 62 303--322.
Fan, J. and Zhang, W. (1999). Statistical estimation in varying coefficient models. Ann. Statist. 27 1491--1518.
Hastie, T. J. and Tibshirani, R. J. (1993). Varying-coefficient models (with discussion). J. Roy. Statist. Soc. Ser. B 55 757--796.
He, X., Fung, W. K. and Zhu, Z. Y. (2005). Robust estimation in generalized partial linear models for clustered data. J. Amer. Statist. Assoc. 100 1176--1184.
Hedeker, D. and Gibbons, R. (1994). A random-effects ordinal regression model for multilevel analysis. Biometrics 50 933--944.
Hoover, D. R., Rice, J. A., Wu, C. O. and Yang, L.-P. (1998). Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85 809--822.
Jacoby, S. L. S., Kowalik, J. S. and Pizzo, J. T. (1972). Iterative Methods for Nonlinear Optimization Problems. Prentice Hall, Englewood Cliffs, NJ.
Jennrich, R. I. and Schluchter, M. D. (1986). Unbalanced repeated-measures models with structured covariance matrices. Biometrics 42 805--820.
Laird, N. M. and Ware, J. H. (1982). Random-effects models for longitudinal data. Biometrics 38 963--974.
Lin, X. and Carroll, R. J. (2000). Nonparametric function estimation for clustered data when the predictor is measured without/with error. J. Amer. Statist. Assoc. 95 520--534.
Lindstrom, M. J. and Bates, D. M. (1990). Nonlinear mixed effects models for repeated measures data. Biometrics 46 673--687.
Longford, N. T. (1987). A fast scoring algorithm for maximum likelihood estimation in unbalanced mixed models with nested random effects. Biometrika 74 817--827.
Mack, Y. P. and Silverman, B. W. (1982). Weak and strong uniform consistency of kernel regression estimates. Z. Wahrsch. Verw. Gebiete 61 405--415.
Mitra, S. N., Al-Sabir, A., Cross, A. R. and Jamil, K. (1997). Bangladesh Demographic and Health Survey 1996--1997. National Institute of Population Research and Training (NIPORT), Mitra and Associates, Dhaka, Bangladesh and DHS Macro International, Inc., Calverton, MD.
Qu, A. and Li, R. (2006). Quadratic inference functions for varying-coefficient models with longitudinal data. Biometrics 62 379--391.
Wang, N. (2003). Marginal nonparametric kernel regression accounting for within-subject correlation. Biometrika 90 43--52.
Welsh, A. H., Lin, X. and Carroll, R. J. (2002). Marginal longitudinal nonparametric regression: Locality and efficiency of spline and kernel methods. J. Amer. Statist. Assoc. 97 482--493.
Wu, H. and Liang, H. (2004). Backfitting random varying-coefficient models with time-dependent smoothing covariates. Scand. J. Statist. 31 3--19.
Wu, H. and Zhang, J.-T. (2002). Local polynomial mixed-effects models for longitudinal data. J. Amer. Statist. Assoc. 97 883--897.
Xia, Y. and Li, W. K. (1999). On the estimation and testing of functional-coefficient linear models. Statist. Sinica 9 735--757.
Zeger, S. L. and Diggle, P. J. (1994). Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics 50 689--699.
Zeger, S. L., Liang, K.-Y. and Albert, P. S. (1988). Models for longitudinal data: A generalized estimating equation approach. Biometrics 44 1049--1060.
Zhang, W., Lee, S.-Y. and Song, X. Y. (2002). Local polynomial fitting in semivarying coefficient model. J. Multivariate Anal. 82 166--188.