Open Access
February 2023 Double-Estimation-Friendly Inference for High-Dimensional Misspecified Models
Rajen D. Shah, Peter Bühlmann
Author Affiliations +
Statist. Sci. 38(1): 68-91 (February 2023). DOI: 10.1214/22-STS850

Abstract

All models may be wrong—but that is not necessarily a problem for inference. Consider the standard t-test for the significance of a variable X for predicting response Y while controlling for p other covariates Z in a random design linear model. This yields correct asymptotic type I error control for the null hypothesis that X is conditionally independent of Y given Z under an arbitrary regression model of Y on (X,Z), provided that a linear regression model for X on Z holds. An analogous robustness to misspecification, which we term the “double-estimation-friendly” (DEF) property, also holds for Wald tests in generalised linear models, with some small modifications.

In this expository paper, we explore this phenomenon, and propose methodology for high-dimensional regression settings that respects the DEF property. We advocate specifying (sparse) generalised linear regression models for both Y and the covariate of interest X; our framework gives valid inference for the conditional independence null if either of these hold. In the special case where both specifications are linear, our proposal amounts to a small modification of the popular debiased Lasso test. We also investigate constructing confidence intervals for the regression coefficient of X via inverting our tests; these have coverage guarantees even in partially linear models where the contribution of Z to Y can be arbitrary. Numerical experiments demonstrate the effectiveness of the methodology.

Funding Statement

The first author was supported by an EPSRC Programme Grant EP/N031938/1 and an EPSRC First Grant EP/R013381/1.
The second author was supported by the European Research Council under the Grant Agreement No 786461 (CausalStats—ERC-2017-ADG).

Acknowledgments

The authors would like to thank Nicolai Meinshausen for many helpful conversations, and also coining the term “double estimation friendly”.

Citation

Download Citation

Rajen D. Shah. Peter Bühlmann. "Double-Estimation-Friendly Inference for High-Dimensional Misspecified Models." Statist. Sci. 38 (1) 68 - 91, February 2023. https://doi.org/10.1214/22-STS850

Information

Published: February 2023
First available in Project Euclid: 28 October 2022

MathSciNet: MR4535395
zbMATH: 07654778
Digital Object Identifier: 10.1214/22-STS850

Keywords: Conditional independence , debiased Lasso , double robustness , generalised linear models , high-dimensional inference

Rights: Copyright © 2023 Institute of Mathematical Statistics

Vol.38 • No. 1 • February 2023
Back to Top