Open Access
2024 Probability-weighted clustered coefficient regression models in complex survey sampling
Mingjun Gang, Xin Wang, Zhonglei Wang, Wei Zhong
Author Affiliations +
Electron. J. Statist. 18(2): 4198-4234 (2024). DOI: 10.1214/24-EJS2295

Abstract

Regression analysis is commonly conducted in survey sampling. However, existing methods fail when regression models vary across different clusters of domains. In this paper, we propose a unified framework to study the cluster-wise covariate effect under complex survey sampling based on pairwise penalties, and the associated objective function is solved by the alternating direction method of multipliers. Theoretical properties of the proposed method are investigated under regularity conditions. Numerical experiments demonstrate that the proposed method outperforms its alternatives in terms of identifying the cluster structure and estimation efficiency for both linear regression and logistic regression models. American Community Survey is used as an example to illustrate the advantages of the proposed approach.

Funding Statement

This research is partially supported by National Key R&D Program of China 2022YFA1003800, NSF SES 2316353, Open Research Fund of Key Laboratory of Analytical Mathematics and Applications (Fujian Normal University), Ministry of Education, P. R. China, Humanities and Social Sciences Foundation of the Ministry of Education of China Grant (No. 23YJA910005), NSSFC (No. 23CMZ005), NSFC (No.: 72033002, 12231011, 71988101, 12471265). Wang, Z. and Zhong, W. also thank the supports of Fujian Key Lab of Statistics, Fujian Key lab of Digital Finance.

Acknowledgments

We would like to thank the editor, the AE, and two anonymous reviewers for constructive comments and suggestions.

Citation

Download Citation

Mingjun Gang. Xin Wang. Zhonglei Wang. Wei Zhong. "Probability-weighted clustered coefficient regression models in complex survey sampling." Electron. J. Statist. 18 (2) 4198 - 4234, 2024. https://doi.org/10.1214/24-EJS2295

Information

Received: 1 November 2023; Published: 2024
First available in Project Euclid: 12 November 2024

arXiv: 2210.09339
Digital Object Identifier: 10.1214/24-EJS2295

Subjects:
Primary: 62D05 , 62J07
Secondary: 62J05 , 62J12

Keywords: Clustered coefficient , generalized linear regression models , pairwise penalties , survey sampling

Vol.18 • No. 2 • 2024
Back to Top