Open Access
December 2016 Improving covariate balance in 2K factorial designs via rerandomization with an application to a New York City Department of Education High School Study
Zach Branson, Tirthankar Dasgupta, Donald B. Rubin
Ann. Appl. Stat. 10(4): 1958-1976 (December 2016). DOI: 10.1214/16-AOAS959
Abstract

A few years ago, the New York Department of Education (NYDE) was planning to conduct an experiment involving five new intervention programs for a selected set of New York City high schools. The goal was to estimate the causal effects of these programs and their interactions on the schools’ performance. For each of the schools, about 50 premeasured covariates were available. The schools could be randomly assigned to the 32 treatment combinations of this $2^{5}$ factorial experiment, but such an allocation could have resulted in a huge covariate imbalance across treatment groups. Standard methods used to prevent confounding of treatment effects with covariate effects (e.g., blocking) were not intuitive due to the large number of covariates. In this paper, we explore how the recently proposed and studied method of rerandomization can be applied to this problem and other factorial experiments. We propose how to implement rerandomization in factorial experiments, extend the theoretical properties of rerandomization from single-factor experiments to $2^{K}$ factorial designs, and demonstrate, using the NYDE data, how such a designed experiment can improve precision of estimated factorial effects.

References

1.

Ahluwalia, J. S., Okuyemi, K., Nollen, N., Choi, W. S., Kaur, H., Pulvers, K. and Mayo, M. S. (2006). The effects of nicotine gum and counseling among African American light smokers: A $2\times 2$ factorial design. Addiction 101 883–891.Ahluwalia, J. S., Okuyemi, K., Nollen, N., Choi, W. S., Kaur, H., Pulvers, K. and Mayo, M. S. (2006). The effects of nicotine gum and counseling among African American light smokers: A $2\times 2$ factorial design. Addiction 101 883–891.

2.

Apfel, C. C., Kranke, P., Katz, M. H., Goepfert, C., Papenfuss, S., Rauch, S., Heineck, R., Greim, C. A. and Roewer, R. (2002). Volatile anaesthetics may be the main cause of early but not delayed postoperative vomiting: A randomized controlled trial of factorial design. Br. J. Anaesth. 88 659–668.Apfel, C. C., Kranke, P., Katz, M. H., Goepfert, C., Papenfuss, S., Rauch, S., Heineck, R., Greim, C. A. and Roewer, R. (2002). Volatile anaesthetics may be the main cause of early but not delayed postoperative vomiting: A randomized controlled trial of factorial design. Br. J. Anaesth. 88 659–668.

3.

Bays, H. E., Ose, L., Fraser, N., Tribble, D. L., Quinto, K., Reyes, R., Johnson-Levonas, A. O., Sapre, A., Donahue, S. R. and Ezetimibe Study Group (2004). A multicenter, randomized, double-blind, placebo-controlled, factorial design study to evaluate the lipid-altering efficacy and safety profile of the ezetimibe/simvastatin tablet compared with ezetimibe and simvastatin monotherapy in patients with primary hypercholesterolemia. Clin. Ther. 26 1758–1773.Bays, H. E., Ose, L., Fraser, N., Tribble, D. L., Quinto, K., Reyes, R., Johnson-Levonas, A. O., Sapre, A., Donahue, S. R. and Ezetimibe Study Group (2004). A multicenter, randomized, double-blind, placebo-controlled, factorial design study to evaluate the lipid-altering efficacy and safety profile of the ezetimibe/simvastatin tablet compared with ezetimibe and simvastatin monotherapy in patients with primary hypercholesterolemia. Clin. Ther. 26 1758–1773.

4.

Box, G. E. P., Hunter, J. S. and Hunter, W. G. (2005). Statistics for Experimenters: Design, Innovation, and Discovery, 2nd ed. Wiley, Hoboken, NJ. 1082.62063Box, G. E. P., Hunter, J. S. and Hunter, W. G. (2005). Statistics for Experimenters: Design, Innovation, and Discovery, 2nd ed. Wiley, Hoboken, NJ. 1082.62063

5.

Branson, Z., Dasgupta, T. and Rubin, D. B. (2016). Supplement to “Improving covariate balance in $2^{K}$ factorial designs via rerandomization with an application to a New York City Department of Education High School Study.”  DOI:10.1214/16-AOAS959SUPP.Branson, Z., Dasgupta, T. and Rubin, D. B. (2016). Supplement to “Improving covariate balance in $2^{K}$ factorial designs via rerandomization with an application to a New York City Department of Education High School Study.”  DOI:10.1214/16-AOAS959SUPP.

6.

Bruhn, M. and McKenzie, D. (2009). In pursuit of balance: Randomization in practice in development field experiments. Am. Econ. J. Appl. Econ. 1 200–232.Bruhn, M. and McKenzie, D. (2009). In pursuit of balance: Randomization in practice in development field experiments. Am. Econ. J. Appl. Econ. 1 200–232.

7.

Cox, D. R. (2009). Randomization in the design of experiments. Int. Stat. Rev. 77 415–429.Cox, D. R. (2009). Randomization in the design of experiments. Int. Stat. Rev. 77 415–429.

8.

Dasgupta, T., Pillai, N. S. and Rubin, D. B. (2015). Causal inference from $2^{K}$ factorial designs by using potential outcomes. J. R. Stat. Soc. Ser. B. Stat. Methodol. 77 727–753.Dasgupta, T., Pillai, N. S. and Rubin, D. B. (2015). Causal inference from $2^{K}$ factorial designs by using potential outcomes. J. R. Stat. Soc. Ser. B. Stat. Methodol. 77 727–753.

9.

Espinosa, V., Dasgupta, T. and Rubin, D. B. (2016). A Bayesian perspective on the analysis of unreplicated factorial experiments using potential outcomes. Technometrics 58 62–73.Espinosa, V., Dasgupta, T. and Rubin, D. B. (2016). A Bayesian perspective on the analysis of unreplicated factorial experiments using potential outcomes. Technometrics 58 62–73.

10.

Fisher, R. A. (1925). Statistical Methods for Research Workers. Oliver and Boyd, Edinburgh. 51.0414.08Fisher, R. A. (1925). Statistical Methods for Research Workers. Oliver and Boyd, Edinburgh. 51.0414.08

11.

Fisher, R. A. (1942). The Design of Experiments, 3rd ed. ed. Hafner-Publishing, New York.Fisher, R. A. (1942). The Design of Experiments, 3rd ed. ed. Hafner-Publishing, New York.

12.

Gu, X. S. and Rosenbaum, P. R. (1993). Comparison of multivariate matching methods: Structures, distances, and algorithms. J. Comput. Graph. Statist. 2 405–420.Gu, X. S. and Rosenbaum, P. R. (1993). Comparison of multivariate matching methods: Structures, distances, and algorithms. J. Comput. Graph. Statist. 2 405–420.

13.

Hu, Y. and Hu, F. (2012). Asymptotic properties of covariate-adaptive randomization. Ann. Statist. 40 1794–1815.Hu, Y. and Hu, F. (2012). Asymptotic properties of covariate-adaptive randomization. Ann. Statist. 40 1794–1815.

14.

Kasari, C., Rotheram-Fuller, E., Locke, J. and Gulsrud, A. (2012). Making the connection: Randomized controlled trial of social skills at school for children with autism spectrum disorders. J. Child Psychol. Psychiatry 53 431–439.Kasari, C., Rotheram-Fuller, E., Locke, J. and Gulsrud, A. (2012). Making the connection: Randomized controlled trial of social skills at school for children with autism spectrum disorders. J. Child Psychol. Psychiatry 53 431–439.

15.

Kollar, I., Fischer, F. and Slotta, J. D. (2005). Internal and external collaboration scripts in web-based science learning at schools. In Proceedings of the 2005 Conference on Computer Support for Collaborative Learning: Learning 2005: The Next 10 Years! CSCL ’05, Taipei, Taiwan, May 30–June 4, 2005. 331–340. International Society of the Learning Sciences. 1082.14022Kollar, I., Fischer, F. and Slotta, J. D. (2005). Internal and external collaboration scripts in web-based science learning at schools. In Proceedings of the 2005 Conference on Computer Support for Collaborative Learning: Learning 2005: The Next 10 Years! CSCL ’05, Taipei, Taiwan, May 30–June 4, 2005. 331–340. International Society of the Learning Sciences. 1082.14022

16.

Krause, M. S. and Howard, K. I. (2003). What random assignment does and does not do. Journal of Clinical Psychology 59 751–766.Krause, M. S. and Howard, K. I. (2003). What random assignment does and does not do. Journal of Clinical Psychology 59 751–766.

17.

Lindley, D. (1982). The role of randomization in inference. PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 2 431–446.Lindley, D. (1982). The role of randomization in inference. PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 2 431–446.

18.

Mahalanobis, P. C. (1936). On the generalized distance in statistics. Proceedings of the National Institute of Sciences (Calcutta) 2 49–55.Mahalanobis, P. C. (1936). On the generalized distance in statistics. Proceedings of the National Institute of Sciences (Calcutta) 2 49–55.

19.

Mardia, K. V., Kent, J. T. and Bibby, J. M. (1979). Multivariate Analysis. Academic Press, London. MR560319 0432.62029Mardia, K. V., Kent, J. T. and Bibby, J. M. (1979). Multivariate Analysis. Academic Press, London. MR560319 0432.62029

20.

Morgan, K. L. and Rubin, D. B. (2012). Rerandomization to improve covariate balance in experiments. Ann. Statist. 40 1263–1282.Morgan, K. L. and Rubin, D. B. (2012). Rerandomization to improve covariate balance in experiments. Ann. Statist. 40 1263–1282.

21.

Morgan, K. L. and Rubin, D. B. (2015). Rerandomization to balance tiers of covariates. J. Amer. Statist. Assoc. 110 1412–1421.Morgan, K. L. and Rubin, D. B. (2015). Rerandomization to balance tiers of covariates. J. Amer. Statist. Assoc. 110 1412–1421.

22.

Morris, C. (1979). A finite selection model for experimental design of the Health Insurance study. J. Econometrics 11 43–61.Morris, C. (1979). A finite selection model for experimental design of the Health Insurance study. J. Econometrics 11 43–61.

23.

Papineau, D. (1994). The virtues of randomization. British J. Philos. Sci. 45 437–450, 712–715.Papineau, D. (1994). The virtues of randomization. British J. Philos. Sci. 45 437–450, 712–715.

24.

Ravaud, P., Giraudeau, B., Logeart, I., Larguier, J. S., Rolland, D., Treves, R., Euller-Ziegler, L., Bannwarth, B. and Dougados, M. (2004). Management of osteoarthritis (OA) with an unsupervised home based exercise programme and/or patient administered assessment tools. A cluster randomised controlled trial with a $2\times 2$ factorial design. Ann. Rheum. Dis. 63 703–708.Ravaud, P., Giraudeau, B., Logeart, I., Larguier, J. S., Rolland, D., Treves, R., Euller-Ziegler, L., Bannwarth, B. and Dougados, M. (2004). Management of osteoarthritis (OA) with an unsupervised home based exercise programme and/or patient administered assessment tools. A cluster randomised controlled trial with a $2\times 2$ factorial design. Ann. Rheum. Dis. 63 703–708.

25.

Rosenbaum, P. R. and Rubin, D. B. (1985). Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. Amer. Statist. 39 33–38.Rosenbaum, P. R. and Rubin, D. B. (1985). Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. Amer. Statist. 39 33–38.

26.

Rosenberger, W. F. and Sverdlov, O. (2008). Handling covariates in the design of clinical trials. Statist. Sci. 23 404–419.Rosenberger, W. F. and Sverdlov, O. (2008). Handling covariates in the design of clinical trials. Statist. Sci. 23 404–419.

27.

Rubin, D. B. (1976). Multivariate matching methods that are equal percent bias reducing. I. Some examples. Biometrics 32 109–120.Rubin, D. B. (1976). Multivariate matching methods that are equal percent bias reducing. I. Some examples. Biometrics 32 109–120.

28.

Rubin, D. B. (2008). Comment: The design and analysis of gold standard randomized experiments [MR2655714]. J. Amer. Statist. Assoc. 103 1350–1353.Rubin, D. B. (2008). Comment: The design and analysis of gold standard randomized experiments [MR2655714]. J. Amer. Statist. Assoc. 103 1350–1353.

29.

Rubin, D. B. and Thomas, N. (2000). Combining propensity score matching with additional adjustments for prognostic covariates. J. Amer. Statist. Assoc. 95 573–585.Rubin, D. B. and Thomas, N. (2000). Combining propensity score matching with additional adjustments for prognostic covariates. J. Amer. Statist. Assoc. 95 573–585.

30.

Seidenfeld, T. (1982). Levi on the Dogma of Randomization in Experiments (H. E. Kyburg, Jr. and I. Levi, eds.) 263–291. Springer, Berlin.Seidenfeld, T. (1982). Levi on the Dogma of Randomization in Experiments (H. E. Kyburg, Jr. and I. Levi, eds.) 263–291. Springer, Berlin.

31.

Worrall, J. (2010). Evidence: Philosophy of science meets medicine. J. Eval. Clin. Pract. 16 356–362.Worrall, J. (2010). Evidence: Philosophy of science meets medicine. J. Eval. Clin. Pract. 16 356–362.

32.

Wu, C. F. J. and Hamada, M. S. (2009). Experiments: Planning, Analysis, and Optimization, 2nd ed. Wiley, Hoboken, NJ. MR2583259 1229.62100Wu, C. F. J. and Hamada, M. S. (2009). Experiments: Planning, Analysis, and Optimization, 2nd ed. Wiley, Hoboken, NJ. MR2583259 1229.62100

33.

Xu, Z. and Kalbfleisch, J. D. (2013). Repeated randomization and matching in multi-arm trials. Biometrics 69 949–959.Xu, Z. and Kalbfleisch, J. D. (2013). Repeated randomization and matching in multi-arm trials. Biometrics 69 949–959.

34.

Yates, F. (1937). The design and analysis of factorial experiments. Imperial Bureau of Soil Sciences—Technical Communication. No. 35, Harpenden.Yates, F. (1937). The design and analysis of factorial experiments. Imperial Bureau of Soil Sciences—Technical Communication. No. 35, Harpenden.
Copyright © 2016 Institute of Mathematical Statistics
Zach Branson, Tirthankar Dasgupta, and Donald B. Rubin "Improving covariate balance in 2K factorial designs via rerandomization with an application to a New York City Department of Education High School Study," The Annals of Applied Statistics 10(4), 1958-1976, (December 2016). https://doi.org/10.1214/16-AOAS959
Received: 1 April 2016; Published: December 2016
Vol.10 • No. 4 • December 2016
Back to Top