The Annals of Statistics
- Ann. Statist.
- Volume 45, Number 5 (2017), 1920-1950.
Detection and feature selection in sparse mixture models
We consider Gaussian mixture models in high dimensions, focusing on the twin tasks of detection and feature selection. Under sparsity assumptions on the difference in means, we derive minimax rates for the problems of testing and of variable selection. We find these rates to depend crucially on the knowledge of the covariance matrices and on whether the mixture is symmetric or not. We establish the performance of various procedures, including the top sparse eigenvalue of the sample covariance matrix (popular in the context of Sparse PCA), as well as new tests inspired by the normality tests of Malkovich and Afifi [J. Amer. Statist. Assoc. 68 (1973) 176–179].
Ann. Statist., Volume 45, Number 5 (2017), 1920-1950.
Received: May 2014
Revised: December 2015
First available in Project Euclid: 31 October 2017
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Verzelen, Nicolas; Arias-Castro, Ery. Detection and feature selection in sparse mixture models. Ann. Statist. 45 (2017), no. 5, 1920--1950. doi:10.1214/16-AOS1513. https://projecteuclid.org/euclid.aos/1509436823
- Supplement to “Detection and feature selection in sparse mixture models”. This supplement contains the proofs of the results.