Open Access
2017 Random consensus robust PCA
Daniel Pimentel-Alarcón, Robert Nowak
Electron. J. Statist. 11(2): 5232-5253 (2017). DOI: 10.1214/17-EJS1377SI

Abstract

This paper presents R2PCA, a random consensus method for robust principal component analysis. R2PCA takes RANSAC’s principle of using as little data as possible one step further. It iteratively selects small subsets of the data to identify pieces of the principal components, to then stitch them together. We show that if the principal components are in general position and the errors are sufficiently sparse, R2PCA will exactly recover the principal components with probability $1$, in lieu of assumptions on coherence or the distribution of the sparse errors, and even under adversarial settings. R2PCA enjoys many advantages: it works well under noise, its computational complexity scales linearly in the ambient dimension, it is easily parallelizable, and due to its low sample complexity, it can be used in settings where data is so large it cannot even be stored in memory. We complement our theoretical findings with synthetic and real data experiments showing that R2PCA outperforms state-of-the-art methods in a broad range of settings.

Citation

Download Citation

Daniel Pimentel-Alarcón. Robert Nowak. "Random consensus robust PCA." Electron. J. Statist. 11 (2) 5232 - 5253, 2017. https://doi.org/10.1214/17-EJS1377SI

Information

Received: 1 June 2017; Published: 2017
First available in Project Euclid: 15 December 2017

zbMATH: 1384.62188
MathSciNet: MR3738210
Digital Object Identifier: 10.1214/17-EJS1377SI

Vol.11 • No. 2 • 2017
Back to Top