Open Access
2021 Uncertainty quantification for principal component regression
Suofei Wu, Jan Hannig, Thomas C. M. Lee
Author Affiliations +
Electron. J. Statist. 15(1): 2157-2178 (2021). DOI: 10.1214/21-EJS1837

Abstract

Principal component regression is an effective dimension reduction method for regression problems. To apply it in practice, one typically starts by selecting the number of principal components k, then estimates the corresponding regression parameters using say maximum likelihood, and finally obtains predictions with the fitted results. The success of this approach highly depends on the choice of k, and very often, due to the noisy nature of the data, it could be risky to just use one single value of k. Using the generalized fiducial inference framework, this paper develops a method for constructing a probability function on k, which provides an uncertainty measure on its value. In addition, this paper also constructs novel confidence intervals for the regression parameters and prediction intervals for future observations. The proposed methodology is backed up by theoretical results and is tested by simulation experiments and compared with other methods using real data. To the best of our knowledge, this is the first time that a full treatment for uncertainty quantification is formally considered for principal component regression.

Funding Statement

This work was partially supported by the National Science Foundation under grants DMS-1512893, DMS-1512945, IIS-1633074, DMS-1811405, DMS-1811661, DMS-1916115. DMS-1916125 and CCF-1934568.

Acknowledgments

The authors are most grateful to the reviewers for their most constructive and helpful comments, which led to a much improved version of the paper.

Citation

Download Citation

Suofei Wu. Jan Hannig. Thomas C. M. Lee. "Uncertainty quantification for principal component regression." Electron. J. Statist. 15 (1) 2157 - 2178, 2021. https://doi.org/10.1214/21-EJS1837

Information

Received: 1 September 2019; Published: 2021
First available in Project Euclid: 7 April 2021

Digital Object Identifier: 10.1214/21-EJS1837

Keywords: confidence intervals , Fiducial inference , High-dimensional data , model dimension selection

Vol.15 • No. 1 • 2021
Back to Top