Open Access
February 2023 Additive Bayesian Variable Selection under Censoring and Misspecification
David Rossell, Francisco Javier Rubio
Author Affiliations +
Statist. Sci. 38(1): 13-29 (February 2023). DOI: 10.1214/21-STS846

Abstract

We discuss the role of misspecification and censoring on Bayesian model selection in the contexts of right-censored survival and concave log-likelihood regression. Misspecification includes wrongly assuming the censoring mechanism to be noninformative. Emphasis is placed on additive accelerated failure time, Cox proportional hazards and probit models. We offer a theoretical treatment that includes local and nonlocal priors, and a general nonlinear effect decomposition to improve power-sparsity trade-offs. We discuss a fundamental question: what solution can one hope to obtain when (inevitably) models are misspecified, and how to interpret it? Asymptotically, covariates that do not have predictive power for neither the outcome nor (for survival data) censoring times, in the sense of reducing a likelihood-associated loss, are discarded. Misspecification and censoring have an asymptotically negligible effect on false positives, but their impact on power is exponential. We show that it can be advantageous to consider simple models that are computationally practical yet attain good power to detect potentially complex effects, including the use of finite-dimensional basis to detect truly nonparametric effects. We also discuss algorithms to capitalize on sufficient statistics and fast likelihood approximations for Gaussian-based survival and binary models.

Funding Statement

David Rossell was supported by Spanish Government grants RyC-2015-18544, Plan Estatal PGC2018-101643-B-I00, Europa Excelencia EUR2020-112096 by the AEI/10.13039/501100011033 and European Union “NextGenerationEU”/PRTR, Ayudas Fundación BBVA a Investigación en Big Data 2017, and NIH grant R01 CA158113-01.

Acknowledgments

The authors thank Natalia Bochkina for pointing out an error in our original proof of Proposition 1 and suggesting a remedy that led to Assumption A3.

David Rossell is also affiliated to the Barcelona School of Economics, Data Science Center, Barcelona, Spain

Citation

Download Citation

David Rossell. Francisco Javier Rubio. "Additive Bayesian Variable Selection under Censoring and Misspecification." Statist. Sci. 38 (1) 13 - 29, February 2023. https://doi.org/10.1214/21-STS846

Information

Published: February 2023
First available in Project Euclid: 28 October 2022

MathSciNet: MR4534642
zbMATH: 07654775
Digital Object Identifier: 10.1214/21-STS846

Keywords: additive regression , generalized additive model , misspecification , Model selection , survival

Rights: Copyright © 2023 Institute of Mathematical Statistics

Vol.38 • No. 1 • February 2023
Back to Top