Open Access
August 2018 Accuracy assessment for high-dimensional linear regression
T. Tony Cai, Zijian Guo
Ann. Statist. 46(4): 1807-1836 (August 2018). DOI: 10.1214/17-AOS1604

Abstract

This paper considers point and interval estimation of the $\ell_{q}$ loss of an estimator in high-dimensional linear regression with random design. We establish the minimax rate for estimating the $\ell_{q}$ loss and the minimax expected length of confidence intervals for the $\ell_{q}$ loss of rate-optimal estimators of the regression vector, including commonly used estimators such as Lasso, scaled Lasso, square-root Lasso and Dantzig Selector. Adaptivity of confidence intervals for the $\ell_{q}$ loss is also studied. Both the setting of the known identity design covariance matrix and known noise level and the setting of unknown design covariance matrix and unknown noise level are studied. The results reveal interesting and significant differences between estimating the $\ell_{2}$ loss and $\ell_{q}$ loss with $1\le q<2$ as well as between the two settings.

New technical tools are developed to establish rate sharp lower bounds for the minimax estimation error and the expected length of minimax and adaptive confidence intervals for the $\ell_{q}$ loss. A significant difference between loss estimation and the traditional parameter estimation is that for loss estimation the constraint is on the performance of the estimator of the regression vector, but the lower bounds are on the difficulty of estimating its $\ell_{q}$ loss. The technical tools developed in this paper can also be of independent interest.

Citation

Download Citation

T. Tony Cai. Zijian Guo. "Accuracy assessment for high-dimensional linear regression." Ann. Statist. 46 (4) 1807 - 1836, August 2018. https://doi.org/10.1214/17-AOS1604

Information

Received: 1 March 2016; Revised: 1 March 2017; Published: August 2018
First available in Project Euclid: 27 June 2018

zbMATH: 06936479
MathSciNet: MR3819118
Digital Object Identifier: 10.1214/17-AOS1604

Subjects:
Primary: 62G15
Secondary: 62C20 , 62H35

Keywords: Accuracy assessment , Adaptivity , Confidence interval , high-dimensional linear regression , loss estimation , minimax lower bound , minimaxity , Sparsity

Rights: Copyright © 2018 Institute of Mathematical Statistics

Vol.46 • No. 4 • August 2018
Back to Top