The Annals of Statistics

Weighted polynomial models and weighted sampling schemes for finite population

Sean X. Chen
Source: Ann. Statist. Volume 26, Number 5 (1998), 1894-1915.

Abstract

This paper outlines a theoretical framework for finite population models with unequal sample probabilities, along with sampling schemes for drawing random samples from these models. We first present four exact weighted sampling schemes that can be used for any finite population model to satisfy such requirements as ordered/ unordered samples, with/without replacement, and fixed/nonfixed sample size. We then introduce a new class of finite population models called weighted polynomial models or, in short, WPM. The probability density of a WPM is defined through a symmetric polynomial of the weights of the units in the sample. The WPM is shown to have been applied in many statistical analyses including survey sampling, logistic regression, case-control studies, lottery, DNA sequence alignment and MCMC simulations. We provide general strategies that can help improve the efficiency of the exact weighted sampling schemes for any given WPM. We show that under a mild condition, sampling from any WPM can be implemented within polynomial time. A Metropolis-Hasting-type scheme is proposed for approximate weighted sampling when the exact sampling schemes become intractable for moderate population and sample sizes. We show that under a mild condition, the average acceptance rate of the approximate sampling scheme for any WPM can be expressed in closed form using only the inclusion probabilities.

First Page: Show Hide
Primary Subjects: 62D05, 62E15
Secondary Subjects: 62E25
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1024691362
Mathematical Reviews number (MathSciNet): MR1673283
Digital Object Identifier: doi:10.1214/aos/1024691362
Zentralblatt MATH identifier: 0930.62006

References

CHEN, S. X. 1992. Metropolis algorithm and the nearly black object. Technical report, Dept. Statistics, Harvard Univ. Z.
CHEN, S. X., DEMPSTER, A. P. and LIU, J. S. 1994. Weighted finite population sampling to maximize entropy. Biometrika 81 457 469. Z.
Mathematical Reviews (MathSciNet): MR96c:62022
Zentralblatt MATH: 0816.62008
CHEN, S. X. and LIU, J. S. 1997. Statistical applications of the Poisson-Binomial and conditional Bernoulli distributions. Statist. Sinica 7 875 892. Z.
Mathematical Reviews (MathSciNet): MR1488647
Zentralblatt MATH: 01089160
HANIF, M. and BREWER, K. R. W. 1980. Sampling with unequal probabilities without replacement: a review. Internat. Statist. Rev. 48 317 335. Z.
Mathematical Reviews (MathSciNet): MR83b:62022
JOE, H. 1990. A winning strategy for lotto games? Canad. J. Statist. 18 233 244. Z.
Mathematical Reviews (MathSciNet): MR1079596
LAHIRI, D. B. 1951. A method for sample selection providing unbiased ratio estimates. Bull. Internat. Statist. Inst. 33 133 140. Z.
LIU, J. S., NEUWALD, A. F. and LAWRENCE C. E. 1995. Bayesian models for multiple local sequence alignment and Gibbs sampling strategies. J. Amer. Statist. Assoc. 90 1156 1170. Z.
SAMPFORD, M. R. 1967. On sampling without replacement with unequal probabilities of selection. Biometrika 54 499 513. Z.
Mathematical Reviews (MathSciNet): MR36:6100
SINGH, P. and SRIVASTAVA, A. K. 1980. Sampling schemes providing unbiased regression estimators. Biometrika 67 205 209. Z.
Mathematical Reviews (MathSciNet): MR82a:62027
Zentralblatt MATH: 0426.62009
SMITH, A. F. M. and ROBERTS, G. O. 1993. Bayesian computation via the Gibbs sampler and related Markov chain Monte Carlo methods. J. Roy. Statist. Soc. B 55 3 23. Z.
Mathematical Reviews (MathSciNet): MR94g:62056
Zentralblatt MATH: 0779.62030
STERN, H. and COVER, T. M. 1989. Maximum entropy and the lottery. J. Amer. Statist. Assoc. 84 980 985. Z.
Mathematical Reviews (MathSciNet): MR1134487
USPENSKY, J. V. 1948. Theory of Equations. McGraw-Hill, New York.
NEW YORK, NEW YORK 10012 E-MAIL: schen3@stern.ny u.edu

2013 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics

Turn MathJax Off
What is MathJax?