Open Access
2017 Variable selection for partially linear models via learning gradients
Lei Yang, Yixin Fang, Junhui Wang, Yongzhao Shao
Electron. J. Statist. 11(2): 2907-2930 (2017). DOI: 10.1214/17-EJS1300

Abstract

Partially linear models (PLMs) are important generalizations of linear models and are very useful for analyzing high-dimensional data. Compared to linear models, the PLMs possess desirable flexibility of non-parametric regression models because they have both linear and non-linear components. Variable selection for PLMs plays an important role in practical applications and has been extensively studied with respect to the linear component. However, for the non-linear component, variable selection has been well developed only for PLMs with extra structural assumptions such as additive PLMs and generalized additive PLMs. There is currently an unmet need for variable selection methods applicable to general PLMs without structural assumptions on the non-linear component. In this paper, we propose a new variable selection method based on learning gradients for general PLMs without any assumption on the structure of the non-linear component. The proposed method utilizes the reproducing-kernel-Hilbert-space tool to learn the gradients and the group-lasso penalty to select variables. In addition, a block-coordinate descent algorithm is suggested and some theoretical properties are established including selection consistency and estimation consistency. The performance of the proposed method is further evaluated via simulation studies and illustrated using real data.

Citation

Download Citation

Lei Yang. Yixin Fang. Junhui Wang. Yongzhao Shao. "Variable selection for partially linear models via learning gradients." Electron. J. Statist. 11 (2) 2907 - 2930, 2017. https://doi.org/10.1214/17-EJS1300

Information

Received: 1 August 2016; Published: 2017
First available in Project Euclid: 8 August 2017

zbMATH: 1379.62034
MathSciNet: MR3694572
Digital Object Identifier: 10.1214/17-EJS1300

Keywords: gradient learning , group lasso , High-dimensional data , PLM , ‎reproducing kernel Hilbert ‎space , Variable selection

Vol.11 • No. 2 • 2017
Back to Top