Abstract
The divide and conquer strategy, which breaks a massive data set into a series of manageable data blocks, and combines the independent results of data blocks to obtain a final decision, has been recognized as a state-of-the-art method to overcome challenges of massive data analysis. In this paper, we equip the classical local average regression with some divide and conquer strategies to infer the regressive relationship of input-output pairs from a massive data set. When the average mixture, a widely used divide and conquer approach, is adopted, we prove that the optimal learning rate can be achieved under some restrictive conditions on the number of data blocks. We then propose two variants to relax (or remove) these conditions and derive the same optimal learning rates as that for the average mixture local average regression. Our theoretical assertions are verified by a series of experimental studies.
Citation
Xiangyu Chang. Shao-Bo Lin. Yao Wang. "Divide and conquer local average regression." Electron. J. Statist. 11 (1) 1326 - 1350, 2017. https://doi.org/10.1214/17-EJS1265
Information