We establish a mathematical framework that formally validates the two-phase “super-population viewpoint” proposed by Hartley and Sielken [Biometrics 31 (1975) 411–422] by defining a product probability space which includes both the design space and the model space. The methodology we develop combines finite population sampling theory and the classical theory of infinite population sampling to account for the underlying processes that produce the data under a unified approach. Our key results are the following: first, if the sample estimators converge in the design law and the model statistics converge in the model, then, under certain conditions, they are asymptotically independent, and they converge jointly in the product space; second, the sample estimating equation estimator is asymptotically normal around a super-population parameter.
References
Billingsley, P. (1979). Probability and Measure. Wiley, New York.
Billingsley, P. (1995). Probability and Measure, 3rd ed. Wiley, New York.
Binder, D. A. (1983). On the variances of asymptotically normal estimators from complex surveys. Internat. Statist. Rev. 51 279--292.
Binder, D. and Roberts, G. (2003). Design-based and model-based methods for estimating model parameters. In Analysis of Survey Data (R. L. Chambers and C. J. Skinner, eds.) 29--48. Wiley, Chichester.
Chow, Y. S. and Teicher, H. (1997). Probability Theory. Independence, Interchangeability, Martingales, 3rd ed. Springer, New York.
Fuller, W. A. (1975). Regression analysis for sample surveys. Sankhyā Ser. C 37 117--132.
Godambe, V. P. and Thompson, M. E. (1986). Parameters of super-population and survey population: Their relationships and estimation. Internat. Statist. Rev. 54 127--138.
Hájek, J. (1960). Limiting distributions in simple random sampling from a finite population. Publ. Math. Inst. Hungarian Acad. Sci. 5 361--374.
Hájek, J. (1964). Asymptotic theory of rejective sampling with varying probabilities from a finite population. Ann. Math. Statist. 35 1491--1523.
Hájek, J. (1981) (assembled after his death by Václav Dupac). Sampling from a Finite Population. Dekker, New York.
Hartley, H. O. and Sielken, R. L. (1975). A ``superpopulation viewpoint'' for finite population sampling. Biometrics 31 411--422.
Isaki, C. and Fuller, W. (1982). Survey design under the regression superpopulation model. J. Amer. Statist. Assoc. 77 89--96.
Kish, L. (1995). Survey Sampling. Wiley, New York.
Korn, E. L. and Graubard, B. I. (1998). Variance estimation for super-population parameters. Statist. Sinica 8 1131--1151.
Krewski, D. and Rao, J. N. K. (1981). Inference from stratified samples: Properties of linearization, jackknife and balanced repeated replication methods. Ann. Statist. 9 1010--1019.
Molina, E. A., Smith, T. M. F. and Sugden, R. A. (2001). Modelling overdispersion for complex survey data. Internat. Statist. Rev. 69 373--384.
Pfeffermann, D. and Sverchkov, M. (1999). Parametric and semiparametric estimation of regression models fitted to survey data. Sankhyā Ser. B 61 166--186.
Rodríguez, J. E. (2001). A probabilistic framework for inference in finite population sampling. Proc. Section on Survey Research Methods of the American Statistical Association, Aug. 5--9, 2001. Available at www.amstat.org/sections/srms/proceedings/y2001/proceed/00468.pdf.
Rosén, B. (1972). Asymptotic theory for successive sampling with varying probabilities without replacement. I, II. Ann. Math. Statist. 43 373--397, 748--776.
Rosén, B. (1997). On sampling with probability proportional to size. J. Statist. Plann. Inference 62 159--191.
Rubin-Bleuer, S. (1998). Inference for parameters of the super-population, Part I. Research sabbatical report, internal report, Statistics Canada.
Rubin-Bleuer, S. (2000). Some issues in the analysis of complex survey data. Statistics Canada Series, Methodology Branch, Business Survey Methods Division, BSMD-20-001E.
Rubin-Bleuer, S. (2001). A test for survival distributions using data from a complex sample. Proc. Survey Methods Section, SSC Annual Meeting, 2001 103--110.
Rubin-Bleuer, S. (2003). An approximation of the partial likelihood score in a joint design-model space. Proc. Survey Methods Section, SSC Annual Meeting, 2003 CDROM 37--46.
Rubin-Bleuer, S. and Schiopu Kratina, I. (2000). Some issues in the analysis of complex survey data. Proc. Section on Survey Research Methods of the American Statistical Association 734--739. Available at www.amstat.org/sections/srms/proceedings/papers/2000_124.pdf.
Rubin-Bleuer, S. and Schiopu Kratina, I. (2001). A probabilistic set-up for model and design-based inference. Proc. Section on Survey Research Methods of the American Statistical Association, Aug. 5--9, 2001. Available at www.amstat.org/sections/srms/proceedings/y2001/proceed/00469.pdf.
Rubin-Bleuer, S. and Schiopu Kratina, I. (2002). On the two-phase framework for joint model and design-based inference. Technical Report Series, Laboratory for Research in Probability and Statistics, No. 382, Carleton Univ.--Univ. Ottawa.
Särndal, C.-E., Swensson, B. and Wretman, J. (1992). Model Assisted Survey Sampling. Springer, New York.
Scott, A. J. (1977). On the problem of randomization in survey sampling. Sankhyā Ser. C 39 1--9.
Yuan, K.-H. and Jennrich, R. (1998). Asymptotics of estimating equations under natural conditions. J. Multivariate Anal. 65 245--260.
Yung, W. and Rao, J. N. K. (2000). Jackknife variance estimation under imputation for estimators using poststratification information. J. Amer. Statist. Assoc. 95 903--915.