Open Access
Translator Disclaimer
May 2002 Inference for Superpopulation Parameters Using Sample Surveys
Barry I. Graubardand, Edward L. Korn
Statist. Sci. 17(1): 73-96 (May 2002). DOI: 10.1214/ss/1023798999


Sample survey inference is historically concerned with finite-population parameters, that is, functions (like means and totals) of the observations for the individuals in the population. In scientific applications, however, interest usually focuses on the “superpopulation” parameters associated with a stochastic mechanismhypothesized to generate the observations in the population rather than the finite-population parameters. Two relevant findings discussed in this paper are that (1) with stratified sampling, it is not sufficient to drop finite-population correction factors from standard design-based variance formulas to obtain appropriate variance formulas for superpopulation inference, and (2) with cluster sampling, standard design-based variance formulas can dramatically underestimate superpopulation variability, even with a small sampling fraction of the final units. A literature review of inference for superpopulation parameters is given, with emphasis on why these findings have not been previously appreciated. Examples are provided for estimating superpopulation means, linear regression coefficients and logistic regression coefficients using U.S. data from the 1987 National Health Interview Survey, the third National Health and Nutrition Examination Survey and the 1986 National Hospital Discharge Survey.


Download Citation

Barry I. Graubardand. Edward L. Korn. "Inference for Superpopulation Parameters Using Sample Surveys." Statist. Sci. 17 (1) 73 - 96, May 2002.


Published: May 2002
First available in Project Euclid: 11 June 2002

zbMATH: 1013.62005
MathSciNet: MR1910075
Digital Object Identifier: 10.1214/ss/1023798999

Keywords: Cluster sampling , complex survey data , design-based inference , model-based inference , random effects , stratified sampling

Rights: Copyright © 2002 Institute of Mathematical Statistics


Vol.17 • No. 1 • May 2002
Back to Top