Source: Electron. J. Statist.
Methods are proposed to construct empirical measures when there are missing terms among the components of a random vector. Furthermore, Vapnik-Chevonenkis type exponential bounds are obtained on the uniform deviations of these estimators, from the true probabilities. These results can then be used to deal with classical problems such as statistical classification, via empirical risk minimization, when there are missing covariates among the data. Another application involves the uniform estimation of a distribution function.
Bennett, G. (1962). Probability inequalities for the sum of independent random variables., J. Amer. Statist Assoc., 57:33–45.
Cheng, P. E. and Chu, C. K. (1996). Kernel estimation of distribution functions and quantiles with missing data., Statist. Sinica, 6:63–78.
Devroye, L. (1982). Bounds on the uniform deviation of empirical meaures., Journal of Multivariate Analysis, 12:72–79.
Mathematical Reviews (MathSciNet): MR650929
Devroye, L., Györfi, L., and Lugosi, G. (1996)., A Probabilistic Theory of Pattern Recognition. Springer-Verlag, New York.
Dudley, R. (1978). Central limit theorems for empirical measures., Ann. Probab., 6:899–929.
Mathematical Reviews (MathSciNet): MR512411
Little, R. J. A. and Rubin, D. B. (2002)., Statistical Analysis With Missing Data. Wiley, New York.
Massart, P. (1990). The tight constant in the Devoretzky-Kiefer-Wolfowitz inequality., Ann. Probab., 18:1269–1283.
Pollard, D. (1984)., Convergence of Stochastic Processes. Springer-Verlag, New York.
Mathematical Reviews (MathSciNet): MR762984
Talagrand, M. (1994). Sharper bounds for gaussian and empirical processes., Ann. Probab., 22:28–76.
van der Vaart, A. W. and Wellner, J. A. (1996)., Weak Convergence and Empirical Processes, with Applications to Statistics. Springer-Verlag, New York.
Vapnik, V. N. and Chervonenkis, A. Y. (1971). On the uniform convergence of relative frequencies of events to their probabilities., Theory Probab. Appl., 16:264–280.