Open Access
VOL. 7 | 2010 MCD-RoSIS – A robust procedure for variable selection
Charlotte Guddat, Ursula Gather, Sonja Kuhnt

Editor(s) J. Antoch, M. Hušková, P.K. Sen

Inst. Math. Stat. (IMS) Collect., 2010: 75-83 (2010) DOI: 10.1214/10-IMSCOLL708

Abstract

Consider the task of estimating a regression function for describing the relationship between a response and a vector of p predictors. Often only a small subset of all given candidate predictors actually effects the response, while the rest might inhibit the analysis. Procedures for variable selection aim to identify the true predictors. A method for variable selection when the dimension p of the regressor space is much larger than the sample size n is Sure Independence Screening (SIS). The number of predictors is to be reduced to a value less than the number of observations before conducting the regression analysis. As SIS is based on nonrobust estimators, outliers in the data might lead to the elimination of true predictors. Hence, a robustified version of SIS called RoSIS was proposed which is based on robust estimators. Here, we give a modification of RoSIS by using the MCD estimator in the new algorithm. The new procedure MCD-RoSIS leads to better results, especially under collinearity. In a simulation study we compare the performance of SIS, RoSIS and MCD-RoSIS w.r.t. their robustness against different types of data contamination as well as different degrees of collinearity.

Information

Published: 1 January 2010
First available in Project Euclid: 29 November 2010

MathSciNet: MR2808368

Digital Object Identifier: 10.1214/10-IMSCOLL708

Subjects:
Primary: 62G35 , 62J99

Keywords: Dimension reduction , Outliers , regression , robust estimation , Variable selections

Rights: Copyright © 2010, Institute of Mathematical Statistics

Back to Top