Open Access
June 2018 Test for high-dimensional regression coefficients using refitted cross-validation variance estimation
Hengjian Cui, Wenwen Guo, Wei Zhong
Ann. Statist. 46(3): 958-988 (June 2018). DOI: 10.1214/17-AOS1573

Abstract

Testing a hypothesis for high-dimensional regression coefficients is of fundamental importance in the statistical theory and applications. In this paper, we develop a new test for the overall significance of coefficients in high-dimensional linear regression models based on an estimated U-statistics of order two. With the aid of the martingale central limit theorem, we prove that the asymptotic distributions of the proposed test are normal under two different distribution assumptions. Refitted cross-validation (RCV) variance estimation is utilized to avoid the overestimation of the variance and enhance the empirical power. We examine the finite-sample performances of the proposed test via Monte Carlo simulations, which show that the new test based on the RCV estimator achieves higher powers, especially for the sparse cases. We also demonstrate an application by an empirical analysis of a microarray data set on Yorkshire gilts.

Citation

Download Citation

Hengjian Cui. Wenwen Guo. Wei Zhong. "Test for high-dimensional regression coefficients using refitted cross-validation variance estimation." Ann. Statist. 46 (3) 958 - 988, June 2018. https://doi.org/10.1214/17-AOS1573

Information

Received: 1 February 2016; Revised: 1 April 2017; Published: June 2018
First available in Project Euclid: 3 May 2018

zbMATH: 1392.62159
MathSciNet: MR3797993
Digital Object Identifier: 10.1214/17-AOS1573

Subjects:
Primary: 62F03 , 62H15

Keywords: high-dimensional regression , Hypothesis testing , martingale central limit theorem , refitted cross-validation variance estimation , U-statistics

Rights: Copyright © 2018 Institute of Mathematical Statistics

Vol.46 • No. 3 • June 2018
Back to Top