The Annals of Statistics
- Ann. Statist.
- Volume 46, Number 2 (2018), 679-710.
Selective inference with a randomized response
Inspired by sample splitting and the reusable holdout introduced in the field of differential privacy, we consider selective inference with a randomized response. We discuss two major advantages of using a randomized response for model selection. First, the selectively valid tests are more powerful after randomized selection. Second, it allows consistent estimation and weak convergence of selective inference procedures. Under independent sampling, we prove a selective (or privatized) central limit theorem that transfers procedures valid under asymptotic normality without selection to their corresponding selective counterparts. This allows selective inference in nonparametric settings. Finally, we propose a framework of inference after combining multiple randomized selection procedures. We focus on the classical asymptotic setting, leaving the interesting high-dimensional asymptotic questions for future work.
Ann. Statist., Volume 46, Number 2 (2018), 679-710.
Received: March 2016
Revised: February 2017
First available in Project Euclid: 3 April 2018
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Tian, Xiaoying; Taylor, Jonathan. Selective inference with a randomized response. Ann. Statist. 46 (2018), no. 2, 679--710. doi:10.1214/17-AOS1564. https://projecteuclid.org/euclid.aos/1522742433
- Supplement to “Selective inference with a randomized response”. We provide additional sampling schemes, technical details for plugin variance estimators and proofs for all the theorems and lemmas in the supplementary materials.