The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 9, Number 4 (2015), 2237-2265.
Assessing nonresponse bias in a business survey: Proxy pattern-mixture analysis for skewed data
The Service Annual Survey (SAS) is a business survey conducted annually by the U.S. Census Bureau that collects aggregate and detailed revenues and expenses data. Typical of many business surveys, the SAS population is highly positively skewed, with large companies comprising a large proportion of the published totals. When alternative data are not available, missing data are handled with ratio imputation models that assume missingness is at random. We propose a proxy pattern-mixture (PPM) model that provides a simple framework for assessing nonresponse bias with respect to different nonresponse mechanisms. PPM models were first introduced in this context by Andridge and Little [Journal of Official Statistics 27 (2011) 153–180], but their model assumed the characteristic of interest and the predicted proxy have a bivariate normal distribution, conditional on the missingness indicator. Although often appropriate for large demographic surveys, the normality assumption is less justifiable for the highly skewed SAS data. We propose an alternative PPM model using a bivariate gamma distribution more appropriate for the SAS data. We compare the two PPM models through application to data from six years of data collection in three industries in the health care and transportation sectors of the SAS. Finally, we illustrate properties of the method through simulation.
Ann. Appl. Stat., Volume 9, Number 4 (2015), 2237-2265.
Received: March 2015
Revised: July 2015
First available in Project Euclid: 28 January 2016
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Andridge, Rebecca; Thompson, Katherine Jenny. Assessing nonresponse bias in a business survey: Proxy pattern-mixture analysis for skewed data. Ann. Appl. Stat. 9 (2015), no. 4, 2237--2265. doi:10.1214/15-AOAS878. https://projecteuclid.org/euclid.aoas/1453994199
- Supplement to “Assessing nonresponse bias in a business survey: Proxy pattern-mixture analysis for skewed data”. The supplementary material contains the results of applying multiple imputation using the gamma PPM model and the normal PPM model for $\lambda=0$ (MAR) and $\lambda=\infty$ (MNAR) in the three SAS industries for the expenses model.