The Annals of Statistics
- Ann. Statist.
- Volume 46, Number 3 (2018), 958-988.
Test for high-dimensional regression coefficients using refitted cross-validation variance estimation
Testing a hypothesis for high-dimensional regression coefficients is of fundamental importance in the statistical theory and applications. In this paper, we develop a new test for the overall significance of coefficients in high-dimensional linear regression models based on an estimated U-statistics of order two. With the aid of the martingale central limit theorem, we prove that the asymptotic distributions of the proposed test are normal under two different distribution assumptions. Refitted cross-validation (RCV) variance estimation is utilized to avoid the overestimation of the variance and enhance the empirical power. We examine the finite-sample performances of the proposed test via Monte Carlo simulations, which show that the new test based on the RCV estimator achieves higher powers, especially for the sparse cases. We also demonstrate an application by an empirical analysis of a microarray data set on Yorkshire gilts.
Ann. Statist., Volume 46, Number 3 (2018), 958-988.
Received: February 2016
Revised: April 2017
First available in Project Euclid: 3 May 2018
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Cui, Hengjian; Guo, Wenwen; Zhong, Wei. Test for high-dimensional regression coefficients using refitted cross-validation variance estimation. Ann. Statist. 46 (2018), no. 3, 958--988. doi:10.1214/17-AOS1573. https://projecteuclid.org/euclid.aos/1525313072
- Supplement to “Test for high-dimensional regression coefficients using refitted cross-validation variance estimation”. This supplemental article contains the proof of Theorem 3.2 and additional figures of empirical powers of different tests.