On Detecting Fake Coin Flip Sequences
Michael A. Kouritzin, Fraser Newton, Sterling Orsten, Daniel C. Wilson
Abstract
Classification of data as true or fabricated has applications in fraud detection and verification of data samples. In this paper, we apply nonlinear filtering to a simplified fraud-detection problem: classifying coin flip sequences as either real or faked. On the way, we propose a method for generating Bernoulli variables with given marginal probabilities and pair-wise covariances. Finally, we present the empirical performance of the classification algorithm.
First Page:
Show
Hide
Full-text: Open access
Links and Identifiers
Permanent link to this document: http://projecteuclid.org/euclid.imsc/1233152938
Digital Object Identifier: doi:10.1214/074921708000000336
References
[1] Csörgő, M. and Révész, P. (1981). Strong Approximations in Probability and Statistics. Academic Press.
Mathematical Reviews (MathSciNet):
MR666546
[2] Emrich, L. J. and Piedmonte, M. R. (1991). A method for generating high dimensional multivariate binary variates The American Statistician 45 (4) 302–304.
[3] Gange, S. J. (1995). Generating multivariate categorical variates using the iterative proportional fitting algorithm. The American Statistician 49 (2).
[4] Hill, T. P. (1999). The difficulty of faking data. Chance Magazine 12 (3) 27–31.
[5] Lee, A. J. (1993). Generating random binary deviates having fixed marginal distributions and specified degrees of association. The American Statistician 47 (3).
[6] Park, C. G., Park, T. and Shin, D. W. (1996). A simple method for generating correlated binary variates. The American Statistician 50 (4).
Mathematical Reviews (MathSciNet):
MR1435310
Digital Object Identifier: doi:10.2307/2684925
JSTOR: links.jstor.org
[7] Prentice, R. L. (1998). Correlated binary regression with covariates specific to each binary observation. Biometrics 44 (4) 1033–1048.
Mathematical Reviews (MathSciNet):
MR980998
Digital Object Identifier: doi:10.2307/2531733
JSTOR: links.jstor.org
Institute of Mathematical Statistics Collections