Open Access
2023 Sufficient variable screening with high-dimensional controls
Chenlu Ke
Author Affiliations +
Electron. J. Statist. 17(2): 2139-2179 (2023). DOI: 10.1214/23-EJS2150

Abstract

Variable screening for ultrahigh-dimensional data has attracted extensive attention in the past decade. In many applications, researchers learn from previous studies about certain important predictors or control variables related to the response of interest. Such knowledge should be taken into account in the screening procedure. The development of variable screening conditional on prior information, however, has been less fruitful, compared to the vast literature for generic unconditional screening. In this paper, we propose a model-free variable screening paradigm that allows for high-dimensional controls and applies to either continuous or categorical responses. The contribution of each individual predictor is quantified marginally and conditionally in the presence of the control variables as well as the other candidates by reproducing-kernel-based R2 and partial R2 statistics. As a result, the proposed method enjoys the sure screening property and the rank consistency property in the notion of sufficiency, with which its superiority over existing methods is well-established. The advantages of the proposed method are demonstrated by simulation studies encompassing a variety of regression and classification models, and an application to high-throughput gene expression data.

Acknowledgments

We sincerely thank the editor, the associate editor and two anonymous referees for their constructive comments, which led to a significant improvement of this article.

Citation

Download Citation

Chenlu Ke. "Sufficient variable screening with high-dimensional controls." Electron. J. Statist. 17 (2) 2139 - 2179, 2023. https://doi.org/10.1214/23-EJS2150

Information

Received: 1 August 2022; Published: 2023
First available in Project Euclid: 2 October 2023

MathSciNet: MR4649037
Digital Object Identifier: 10.1214/23-EJS2150

Subjects:
Primary: 62B10 , 62B86
Secondary: 62G05 , 62G08 , 62H30

Keywords: Conditional independence , rank consistency , ‎reproducing kernel Hilbert ‎space , sure screening

Vol.17 • No. 2 • 2023
Back to Top