Hypothesis testing for high-dimensional multinomials: A selective review

Sivaraman Balakrishnan; Larry Wasserman

doi:10.1214/18-AOAS1155SF

June 2018 Hypothesis testing for high-dimensional multinomials: A selective review

Sivaraman Balakrishnan, Larry Wasserman

Ann. Appl. Stat. 12(2): 727-749 (June 2018). DOI: 10.1214/18-AOAS1155SF

Abstract

The statistical analysis of discrete data has been the subject of extensive statistical research dating back to the work of Pearson. In this survey we review some recently developed methods for testing hypotheses about high-dimensional multinomials. Traditional tests like the $\chi^{2}$-test and the likelihood ratio test can have poor power in the high-dimensional setting. Much of the research in this area has focused on finding tests with asymptotically normal limits and developing (stringent) conditions under which tests have normal limits. We argue that this perspective suffers from a significant deficiency: it can exclude many high-dimensional cases when—despite having non-normal null distributions—carefully designed tests can have high power. Finally, we illustrate that taking a minimax perspective and considering refinements of this perspective can lead naturally to powerful and practical tests.

Citation

Download Citation

Sivaraman Balakrishnan. Larry Wasserman. "Hypothesis testing for high-dimensional multinomials: A selective review." Ann. Appl. Stat. 12 (2) 727 - 749, June 2018. https://doi.org/10.1214/18-AOAS1155SF