Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism

Ery Arias-Castro; Emmanuel J. Candès; Yaniv Plan

doi:10.1214/11-AOS910

October 2011 Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism

Ery Arias-Castro, Emmanuel J. Candès, Yaniv Plan

Ann. Statist. 39(5): 2533-2556 (October 2011). DOI: 10.1214/11-AOS910

Abstract

Testing for the significance of a subset of regression coefficients in a linear model, a staple of statistical analysis, goes back at least to the work of Fisher who introduced the analysis of variance (ANOVA). We study this problem under the assumption that the coefficient vector is sparse, a common situation in modern high-dimensional settings. Suppose we have p covariates and that under the alternative, the response only depends upon the order of p^1−α of those, 0 ≤ α ≤ 1. Under moderate sparsity levels, that is, 0 ≤ α ≤ 1/2, we show that ANOVA is essentially optimal under some conditions on the design. This is no longer the case under strong sparsity constraints, that is, α > 1/2. In such settings, a multiple comparison procedure is often preferred and we establish its optimality when α ≥ 3/4. However, these two very popular methods are suboptimal, and sometimes powerless, under moderately strong sparsity where 1/2 < α < 3/4. We suggest a method based on the higher criticism that is powerful in the whole range α > 1/2. This optimality property is true for a variety of designs, including the classical (balanced) multi-way designs and more modern “p > n” designs arising in genetics and signal processing. In addition to the standard fixed effects model, we establish similar results for a random effects model where the nonzero coefficients of the regression vector are normally distributed.

Citation

Download Citation

Ery Arias-Castro. Emmanuel J. Candès. Yaniv Plan. "Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism." Ann. Statist. 39 (5) 2533 - 2556, October 2011. https://doi.org/10.1214/11-AOS910

Information

Published: October 2011

First available in Project Euclid: 30 November 2011

zbMATH: 1231.62136

MathSciNet: MR2906877

Digital Object Identifier: 10.1214/11-AOS910

Subjects:

Primary: 62G10 , 94A13

Secondary: 62G20

Keywords: Analysis of variance , compressive sensing , Detecting a sparse signal , higher criticism , incoherence , minimax detection , random matrices , suprema of Gaussian processes

Access the abstract

JOURNAL ARTICLE
24 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY