Open Access
2017 Divide and conquer local average regression
Xiangyu Chang, Shao-Bo Lin, Yao Wang
Electron. J. Statist. 11(1): 1326-1350 (2017). DOI: 10.1214/17-EJS1265

Abstract

The divide and conquer strategy, which breaks a massive data set into a series of manageable data blocks, and combines the independent results of data blocks to obtain a final decision, has been recognized as a state-of-the-art method to overcome challenges of massive data analysis. In this paper, we equip the classical local average regression with some divide and conquer strategies to infer the regressive relationship of input-output pairs from a massive data set. When the average mixture, a widely used divide and conquer approach, is adopted, we prove that the optimal learning rate can be achieved under some restrictive conditions on the number of data blocks. We then propose two variants to relax (or remove) these conditions and derive the same optimal learning rates as that for the average mixture local average regression. Our theoretical assertions are verified by a series of experimental studies.

Citation

Download Citation

Xiangyu Chang. Shao-Bo Lin. Yao Wang. "Divide and conquer local average regression." Electron. J. Statist. 11 (1) 1326 - 1350, 2017. https://doi.org/10.1214/17-EJS1265

Information

Received: 1 March 2016; Published: 2017
First available in Project Euclid: 14 April 2017

zbMATH: 1362.62085
MathSciNet: MR3635915
Digital Object Identifier: 10.1214/17-EJS1265

Subjects:
Primary: 62G08

Keywords: $k$ nearest neighbor estimate , Divide and conquer strategy , local average regression , Nadaraya-Watson estimate

Vol.11 • No. 1 • 2017
Back to Top