Open Access
2024 Estimation and inference in sparse multivariate regression and conditional Gaussian graphical models under an unbalanced distributed setting
Ensiyeh Nezakati, Eugen Pircalabelu
Author Affiliations +
Electron. J. Statist. 18(1): 599-652 (2024). DOI: 10.1214/23-EJS2193

Abstract

This paper proposes a distributed estimation and inferential framework for sparse multivariate regression and conditional Gaussian graphical models under the unbalanced splitting setting. This type of data splitting arises when the datasets from different sources cannot be aggregated on one single machine or when the available machines are of different powers. In this paper, the number of covariates, responses and machines grow with the sample size, while sparsity is imposed. Debiased estimators of the coefficient matrix and of the precision matrix are proposed on every single machine and theoretical guarantees are provided. Moreover, new aggregated estimators that pool information across the machines using a pseudo log-likelihood function are proposed. It is shown that they enjoy efficiency and asymptotic normality as the number of machines grows with the sample size. The performance of these estimators is investigated via a simulation study and a real data example. It is shown empirically that the performances of these estimators are close to those of the non-distributed estimators which use the entire dataset.

Funding Statement

Computational resources have been provided by the Consortium des Équipements de Calcul Intensif (CÉCI), funded by the Fonds de la Recherche Scientifique de Belgique (F.R.S.-FNRS) under Grant No. 2.5020.11 and by the Walloon Region.

Citation

Download Citation

Ensiyeh Nezakati. Eugen Pircalabelu. "Estimation and inference in sparse multivariate regression and conditional Gaussian graphical models under an unbalanced distributed setting." Electron. J. Statist. 18 (1) 599 - 652, 2024. https://doi.org/10.1214/23-EJS2193

Information

Received: 1 March 2023; Published: 2024
First available in Project Euclid: 22 February 2024

Digital Object Identifier: 10.1214/23-EJS2193

Subjects:
Primary: 62H12 , 62H22
Secondary: 62J07

Keywords: conditional Gaussian graphical models , debiased estimation , multivariate regression models , precision matrix , Sparsity , unbalanced distributed setting

Vol.18 • No. 1 • 2024
Back to Top