June 2024 Identification of influencing factors on self-reported count data with multiple potential inflated values
Yang Li, Mingcong Wu, Mengyun Wu, Shuangge Ma
Author Affiliations +
Ann. Appl. Stat. 18(2): 991-1009 (June 2024). DOI: 10.1214/23-AOAS1819

Abstract

The online chauffeured service demand (OCSD) research is an exploratory market study of designated driver services in China. Researchers are interested in the influencing factors of chauffeured service adoption and usage and have collected relevant data using a self-reported questionnaire. As self-reported count measure data is typically inflated, there exist challenges to its validity, which may bias estimation and increase error in empirical research. Motivated by the analysis of self-reported data with multiple inflated values, we propose a novel approach to simultaneously achieve data-driven inflated value selection and identification of important influencing factors. In particular, the regularization technique is applied to the mixing proportions of inflated values and the regression parameters to obtain shrinkage estimates. We analyze the OCSD data with the proposed approach, deriving insights into the determinants impacting service demand. The proper interpretations and implications contribute to service promotion and related policy optimization. Extensive simulation studies and consistent asymptotic properties further establish the effectiveness of the proposed approach.

Funding Statement

The first author was supported by National Natural Science Foundation of China (72271237) and MOE Project of Key Research Institute of Humanities and Social Sciences (22JJD910001), and partly supported by Institute for Data Science in Health, Renmin University of China, Beijing, China.
The third author was supported by National Natural Science Foundation of China 12071273.

Acknowledgments

The authors would like to thank the anonymous referees, an Associate Editor, and the Editor for their constructive comments which led to a significant improvement of this article.

Mengyun Wu is the corresponding author.

Citation

Download Citation

Yang Li. Mingcong Wu. Mengyun Wu. Shuangge Ma. "Identification of influencing factors on self-reported count data with multiple potential inflated values." Ann. Appl. Stat. 18 (2) 991 - 1009, June 2024. https://doi.org/10.1214/23-AOAS1819

Information

Received: 1 June 2023; Published: June 2024
First available in Project Euclid: 5 April 2024

Digital Object Identifier: 10.1214/23-AOAS1819

Keywords: Inflated value selection , investigation , negative binomial data modeling , Penalization

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.18 • No. 2 • June 2024
Back to Top