Abstract
We consider the problem of both prediction and model selection in high dimensional generalized linear models. Predictive performance can be improved by leveraging structure information among predictors. In this paper, a graphic model-based doubly sparse regularized estimator is discussed under the high dimensional generalized linear models, that utilizes the graph structure among the predictors. The graphic information among predictors is incorporated node-by-node using a decomposed representation and the sparsity is encouraged both within and between the decomposed components. We propose an efficient iterative proximal algorithm to solve the optimization problem. Statistical convergence rates and selection consistency for the doubly sparse regularized estimator are established in the ultra-high dimensional setting. Specifically, we allow the dimensionality grows exponentially with the sample size. We compare the estimator with existing methods through numerical analysis on both simulation study and a microbiome data analysis.
Funding Statement
The work of Wei Xu was funded by Natural Sciences and Engineering Research Council of Canada (NSERC Grant RGPIN-2017-06672), Crohn’s and Colitis Canada (CCC Grant CCC-GEMIII), and Helmsley Charitable Trust. The work of Xin Gao was supported by the Natural Sciences and Engineering Research Council of Canada.
Acknowledgments
The authors are grateful to the referees, the associate editor and the editor for their insightful comments and suggestions.
Citation
Yaguang Li. Wei Xu. Xin Gao. "Graphical-model based high dimensional generalized linear models." Electron. J. Statist. 15 (1) 1993 - 2028, 2021. https://doi.org/10.1214/21-EJS1831
Information