The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 4, Number 4 (2010), 2150-2180.
Sparse modeling of categorial explanatory variables
Shrinking methods in regression analysis are usually designed for metric predictors. In this article, however, shrinkage methods for categorial predictors are proposed. As an application we consider data from the Munich rent standard, where, for example, urban districts are treated as a categorial predictor. If independent variables are categorial, some modifications to usual shrinking procedures are necessary. Two L1-penalty based methods for factor selection and clustering of categories are presented and investigated. The first approach is designed for nominal scale levels, the second one for ordinal predictors. Besides applying them to the Munich rent standard, methods are illustrated and compared in simulation studies.
Ann. Appl. Stat., Volume 4, Number 4 (2010), 2150-2180.
First available in Project Euclid: 4 January 2011
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Gertheiss, Jan; Tutz, Gerhard. Sparse modeling of categorial explanatory variables. Ann. Appl. Stat. 4 (2010), no. 4, 2150--2180. doi:10.1214/10-AOAS355. https://projecteuclid.org/euclid.aoas/1294167814