Abstract
Methods are proposed to construct empirical measures when there are missing terms among the components of a random vector. Furthermore, Vapnik-Chevonenkis type exponential bounds are obtained on the uniform deviations of these estimators, from the true probabilities. These results can then be used to deal with classical problems such as statistical classification, via empirical risk minimization, when there are missing covariates among the data. Another application involves the uniform estimation of a distribution function.
Citation
Shojaeddin Chenouri. Majid Mojirsheibani. Zahra Montazeri. "Empirical measures for incomplete data with applications." Electron. J. Statist. 3 1021 - 1038, 2009. https://doi.org/10.1214/09-EJS420
Information