- Volume 24, Number 4B (2018), 3683-3710.
Adaptive estimation of high-dimensional signal-to-noise ratios
We consider the equivalent problems of estimating the residual variance, the proportion of explained variance $\eta$ and the signal strength in a high-dimensional linear regression model with Gaussian random design. Our aim is to understand the impact of not knowing the sparsity of the vector of regression coefficients and not knowing the distribution of the design on minimax estimation rates of $\eta$. Depending on the sparsity $k$ of the vector regression coefficients, optimal estimators of $\eta$ either rely on estimating the vector of regression coefficients or are based on $U$-type statistics. In the important situation where $k$ is unknown, we build an adaptive procedure whose convergence rate simultaneously achieves the minimax risk over all $k$ up to a logarithmic loss which we prove to be non avoidable. Finally, the knowledge of the design distribution is shown to play a critical role. When the distribution of the design is unknown, consistent estimation of explained variance is indeed possible in much narrower regimes than for known design distribution.
Bernoulli, Volume 24, Number 4B (2018), 3683-3710.
Received: March 2017
Revised: July 2017
First available in Project Euclid: 18 April 2018
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Verzelen, Nicolas; Gassiat, Elisabeth. Adaptive estimation of high-dimensional signal-to-noise ratios. Bernoulli 24 (2018), no. 4B, 3683--3710. doi:10.3150/17-BEJ975. https://projecteuclid.org/euclid.bj/1524038767
- Supplement to “Adaptive estimation of high-dimensional signal-to-noise ratios”. This supplement contains the remaining proofs.