Open Access
June 2018 Distributed testing and estimation under sparse high dimensional models
Heather Battey, Jianqing Fan, Han Liu, Junwei Lu, Ziwei Zhu
Ann. Statist. 46(3): 1352-1382 (June 2018). DOI: 10.1214/17-AOS1587

Abstract

This paper studies hypothesis testing and parameter estimation in the context of the divide-and-conquer algorithm. In a unified likelihood-based framework, we propose new test statistics and point estimators obtained by aggregating various statistics from $k$ subsamples of size $n/k$, where $n$ is the sample size. In both low dimensional and sparse high dimensional settings, we address the important question of how large $k$ can be, as $n$ grows large, such that the loss of efficiency due to the divide-and-conquer algorithm is negligible. In other words, the resulting estimators have the same inferential efficiencies and estimation rates as an oracle with access to the full sample. Thorough numerical results are provided to back up the theory.

Citation

Download Citation

Heather Battey. Jianqing Fan. Han Liu. Junwei Lu. Ziwei Zhu. "Distributed testing and estimation under sparse high dimensional models." Ann. Statist. 46 (3) 1352 - 1382, June 2018. https://doi.org/10.1214/17-AOS1587

Information

Received: 1 September 2015; Revised: 1 December 2016; Published: June 2018
First available in Project Euclid: 3 May 2018

zbMATH: 1392.62060
MathSciNet: MR3798006
Digital Object Identifier: 10.1214/17-AOS1587

Subjects:
Primary: 62F05 , 62F10
Secondary: 62F12

Keywords: debiasing , Divide and conquer , massive data , thresholding

Rights: Copyright © 2018 Institute of Mathematical Statistics

Vol.46 • No. 3 • June 2018
Back to Top