Open Access
2017 Robust PCA and pairs of projections in a Hilbert space
Ilaria Giulini
Electron. J. Statist. 11(2): 3903-3926 (2017). DOI: 10.1214/17-EJS1343

Abstract

This is a study of principal component analysis performed on a statistical sample. We assume that this data sample is made of independent copies of some random variable ranging in a separable real Hilbert space. This covers data in function spaces as well as data represented in reproducing kernel Hilbert spaces. Based on some new inequalities about the perturbation of nonnegative self-adjoint operators, we provide new bounds for the statistical fluctuations of the principal component representation with the draw of the statistical sample.

We suggest two kinds of improvements to decrease these fluctuations: the first is to use a robust estimate of the covariance operator, for which non-asymptotic bounds of the estimation error are available under weak polynomial moment assumptions. The second improvement is to use some modification of the projection on the principal components based on functional calculus applied to the covariance operator. Using this modified projection, we can obtain bounds that do not depend on the spectral gap but on some more favorable factor.

In appendix, we provide a new approach to the analysis of the relative positions of two orthogonal projections that is useful for our proofs and that has an interest of its own.

Citation

Download Citation

Ilaria Giulini. "Robust PCA and pairs of projections in a Hilbert space." Electron. J. Statist. 11 (2) 3903 - 3926, 2017. https://doi.org/10.1214/17-EJS1343

Information

Received: 1 May 2016; Published: 2017
First available in Project Euclid: 18 October 2017

zbMATH: 1384.62185
MathSciNet: MR3714302
Digital Object Identifier: 10.1214/17-EJS1343

Subjects:
Primary: 62G05 , 62G35 , 62H25

Keywords: PAC-Bayesian learning , perturbation of self-adjoint operators , Principal angles , Principal Component Analysis , robust estimation , Spectral projectors

Vol.11 • No. 2 • 2017
Back to Top