Statistical Science

An Overview on Regression Models for Discrete Longitudinal Responses

Brajendra C. Sutradhar

Source: Statist. Sci. Volume 18, Issue 3 (2003), 377-393.

Abstract

In the longitudinal regression setup, interest may be focused primarily on the regression parameters for the marginal expectations of the longitudinal responses, the longitudinal correlation parameters being of secondary interest. Second, interest may be focused on both the regression and the longitudinal correlation parameters. Under the first setup, there exists a "working'' correlation matrix based generalized estimating equation (GEE) approach for the estimation of the regression parameters. Under the second setup, there exist two approaches for the joint estimation of the regression and the longitudinal correlations. In one approach, true longitudinal correlations are modeled and the regression and the true correlation parameters are jointly estimated based on a GEE approach. The second approach avoids the specification of the true longitudinal correlation structure and deals with the joint estimation of the regression and a vector of "working'' correlation parameters. In this second approach under the second setup, there again exist two joint estimation methods, one requiring moments up to order 4 and the other somehow using moments up to order 2 for the construction of the estimating equations for the "working'' correlation parameters. In this paper, we first provide an outline of the desirable features and drawbacks of each of these four existing approaches. By using a general autocorrelation structure to model the true longitudinal correlations, we then provide an outline of the advantages of three new approaches. In the first new approach, the true longitudinal correlations are estimated by the method of moments, whereas the regression estimates are obtained based on a generalized quasi-likelihood (GQL) estimation approach. The other two new approaches simultaneously estimate the regression and the true longitudinal correlation parameters. It is shown through a simulation study that, among these three new approaches, the first approach performs the best in estimating both the regression and the true correlation parameters, even though the longitudinal correlations are estimated separately by the method of moments.

Keywords: Binary and count responses; repeated measures; time dependence between the responses; marginal models; consistent and efficient estimators

Full-text: Open access

Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.ss/1076102426
Digital Object Identifier: doi:10.1214/ss/1076102426
Mathematical Reviews number (MathSciNet): MR2056579
Zentralblatt MATH identifier: 02138321

References

Crowder, M. (1995). On the use of a working correlation matrix in using generalised linear models for repeated measures. Biometrika 82 407--410.
Fitzmaurice, G. M. (1995). A caveat concerning independence estimating equations with multivariate binary data. Biometrics 51 309--317.
Fitzmaurice, G. M. and Laird, N. M. (1993). A likelihood-based method for analysing longitudinal binary responses. Biometrika 80 141--151.
Fitzmaurice, G. M., Laird, N. M. and Rotnitzky, A. G. (1993). Regression models for discrete longitudinal responses (with discussion). Statist. Sci. 8 284--309.
Mathematical Reviews (MathSciNet): MR1243595
Hall, D. B. and Severini, T. A. (1998). Extended generalized estimating equations for clustered data. J. Amer. Statist. Assoc. 93 1365--1375.
Mathematical Reviews (MathSciNet): MR1666633
Liang, K.-Y. and Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika 73 13--22.
Mathematical Reviews (MathSciNet): MR836430
Zentralblatt MATH: 0595.62110
Digital Object Identifier: doi:10.2307/2336267
Lipsitz, S. R., Fitzmaurice, G. M., Orav, E. J. and Laird, N. M. (1994). Performance of generalized estimating equations in practical situations. Biometrics 50 270--278.
Mancl, L. A. and Leroux, B. G. (1996). Efficiency of regression estimates for clustered data. Biometrics 52 500--511.
McCullagh, P. (1983). Quasilikelihood functions. Ann. Statist. 11 59--67.
Mathematical Reviews (MathSciNet): MR684863
McKenzie, E. (1988). Some ARMA models for dependent sequences of Poisson counts. Adv. in Appl. Probab. 20 822--835.
Mathematical Reviews (MathSciNet): MR968000
Digital Object Identifier: doi:10.2307/1427362
Neuhaus, J. M. (1993). Estimation efficiency and tests of covariate effects with clustered binary data. Biometrics 49 989--996.
Prentice, R. L. and Zhao, L. P. (1991). Estimating equations for parameters in means and covariances of multivariate discrete and continuous responses. Biometrics 47 825--839.
Mathematical Reviews (MathSciNet): MR1141951
Sutradhar, B. C. and Das, K. (1999). On the efficiency of regression estimators in generalised linear models for longitudinal data. Biometrika 86 459--465.
Mathematical Reviews (MathSciNet): MR1705378
Zentralblatt MATH: 0956.62053
Digital Object Identifier: doi:10.1093/biomet/86.2.459
Sutradhar, B. C. and Das, K. (2000). On the accuracy of efficiency of estimating equation approach. Biometrics 56 622--625.
Sutradhar, B. C. and Kovacevic, M. (2000). Analysing ordinal longitudinal survey data: Generalized estimating equations approach. Biometrika 87 837--848.
Mathematical Reviews (MathSciNet): MR1813978
Zentralblatt MATH: 1028.62009
Digital Object Identifier: doi:10.1093/biomet/87.4.837
Sutradhar, B. C. and Kumar, P. (2001). On the efficiency of extended generalized estimating equation approaches. Statist. Probab. Lett. 55 53--61.
Mathematical Reviews (MathSciNet): MR1860192
Zeger, S. L., Liang, K.-Y. and Albert, P. S. (1988). Models for longitudinal data: A generalized estimating equation approach. Biometrics 44 1049--1060.
Mathematical Reviews (MathSciNet): MR980999
Zhao, L. P. and Prentice, R. L. (1990). Correlated binary regression using a quadratic exponential model. Biometrika 77 642--648.
Mathematical Reviews (MathSciNet): MR1087856
Digital Object Identifier: doi:10.2307/2337004
Zhao, L. P., Prentice, R. L. and Self, S. G. (1992). Multivariate mean parameter estimation by using a partly exponential model. J. Roy. Statist. Soc. Ser. B 54 805--811.

2010 © Institute of Mathematical Statistics