Source: Statist. Sci. Volume 24, Number 2
(2009), 223-237.
Traditional methods for covariate adjustment of treatment means in designed experiments are inherently conditional on the observed covariate values. In order to develop a coherent general methodology for analysis of covariance, we propose a multivariate variance components model for the joint distribution of the response and covariates. It is shown that, if the design is orthogonal with respect to (random) blocking factors, then appropriate adjustments to treatment means can be made using the univariate variance components model obtained by conditioning on the observed covariate values. However, it is revealed that some widely used models are incorrectly specified, leading to biased estimates and incorrect standard errors. The approach clarifies some issues that have been the source of ongoing confusion in the statistics literature.
References
Bartlett, M. S. (1936). A note on the analysis of variance. Journal of Agricultural Science 26 388.
Begg, M. D. and Parides, M. K. (2003). Separation of individual-level and cluster-level covariate effects in regression analysis of correlated data. Stat. Med. 22 2591–2602.
Berlin, J. A., Kimmel, S. E., Ten Have, T. R. and Sammel, M. D. (1999). An empirical comparison of several clustered data approaches under confounding due to cluster effects in the analysis of complications of coronary angioplasty. Biometrics 55 470–476.
Bose, R. C. (1949). Least Squares Aspects of Analysis of Variance. Institute of Statistics Mimeo Series 9. University North Carolina, Chapel Hill.
Cambanis, S., Huang, S. and Simons, G. (1981). On the theory of elliptically contoured distributions. J. Multivariate Anal. 11 368–385.
Mathematical Reviews (MathSciNet):
MR629795
Cochran, W. G. and Cox, G. M. (1957). Experimental Designs. 2nd ed. Wiley, New York.
Mathematical Reviews (MathSciNet):
MR85682
Cox, D. R. and McCullagh, P. (1982). Some aspects of analysis of covariance (with discussion). Biometrics 38 541–561.
Mathematical Reviews (MathSciNet):
MR685170
Cox, D. R. and Wermuth, N. (2004). Causality: A statistical view. Int. Statist. Review 72 285–305.
Federer, W. T. (1955). Experimental Design. MacMillan, New York.
Fisher, R. A. (1934). Statistical Methods for Research Workers, 5th ed. Oliver and Boyd Ltd., Edinburgh.
Gelman, A. (2005). Analysis of variance: Why it is more important than ever (with discussion). Ann. Statist. 33 1–53.
Kempthorne, O. (1952). Design and Analysis of Experiments. Wiley, New York.
Mathematical Reviews (MathSciNet):
MR45368
Khuri, A. I., Mathew, T. and Sinha, B. K. (1998). Statistical Tests for Mixed Linear Models. Wiley, New York.
Kreft, I. G., de Leeuw, J. and Aiken, L. S. (1995). The effect of different forms of centering in hierarchical linear models. Multivariate Behavioral Research 30 1–21.
Milliken, G. A. and Johnson, D. E. (2002). Analyis of Messy Data, Volume III: Analysis of Covariance. Chapman and Hall, New York.
Neuhaus, J. M. and Kalbfleisch, J. D. (1998). Between- and within-cluster covariate effects in the analysis of clustered data. Biometrics 54 638–645.
Neuhaus, J. M. and McCulloch, C. E. (2006). Separating between- and within-cluster covariate effects by using conditional and partitioning methods. J. Roy. Statist. Soc. Ser. B 68 859–872.
Pearce, S. C. (1953). Field Experiments With Fruit Trees and Other Perennial Plants. Commonwealth Bureau of Horticulture and Plantation Crops, Farnham Royal, Slough, England, App. IV.
Pearce, S. C. (1982). Encyclopedia of Statistical Sciences 1 61–69. Wiley, New York.
Penrose, R. A. (1955). A generalized inverse for matrices. Proc. Cambridge Philos. Soc. 51 406–413.
Mathematical Reviews (MathSciNet):
MR69793
Rao, C. R. (1962). A note on a generalized inverse of a matrix with applications to problems in mathematical statistics. J. Roy. Statist. Soc. Ser. B 24 152–158.
Mathematical Reviews (MathSciNet):
MR138149
Raudenbush, S. W. and Bryk, A. S. (2002). Hierarchical Linear Models: Applications and Data Analysis Method, 2nd ed. Sage Publications, Newbury Park, CA.
Searle, S. R. (1956). Matrix methods in variance and covariance component analysis. Ann. Math. Statist. 27 737–748.
Mathematical Reviews (MathSciNet):
MR81051
Searle, S. R., Casella, G. and McCulloch, C. E. (1992). Variance Components. Wiley, New York.
Snedecor, G. W. and Cochran, W. G. (1967). Statistical Methods, 6th ed. Iowa State Univ. Press, Ames, IA.
Mathematical Reviews (MathSciNet):
MR381046
Wells, M. T. (2009). A conversation with Shayle R. Searle. Statist. Sci. 24 244–254.
Wichsell, S. D. (1930). Remarks on regression. Ann. Math. Statist. 1 3–13.
Zelen, M. (1957). The analysis of covariance for incomplete block designs. Biometrics 13 309–332.
Mathematical Reviews (MathSciNet):
MR90208