Open Access
December 2017 Bayesian Variable Selection Regression of Multivariate Responses for Group Data
B. Liquet, K. Mengersen, A. N. Pettitt, M. Sutton
Bayesian Anal. 12(4): 1039-1067 (December 2017). DOI: 10.1214/17-BA1081

Abstract

We propose two multivariate extensions of the Bayesian group lasso for variable selection and estimation for data with high dimensional predictors and multi-dimensional response variables. The methods utilize spike and slab priors to yield solutions which are sparse at either a group level or both a group and individual feature level. The incorporation of group structure in a predictor matrix is a key factor in obtaining better estimators and identifying associations between multiple responses and predictors. The approach is suited to many biological studies where the response is multivariate and each predictor is embedded in some biological grouping structure such as gene pathways. Our Bayesian models are connected with penalized regression, and we prove both oracle and asymptotic distribution properties under an orthogonal design. We derive efficient Gibbs sampling algorithms for our models and provide the implementation in a comprehensive R package called MBSGS available on the Comprehensive R Archive Network (CRAN). The performance of the proposed approaches is compared to state-of-the-art variable selection strategies on simulated data sets. The proposed methodology is illustrated on a genetic dataset in order to identify markers grouping across chromosomes that explain the joint variability of gene expression in multiple tissues.

Citation

Download Citation

B. Liquet. K. Mengersen. A. N. Pettitt. M. Sutton. "Bayesian Variable Selection Regression of Multivariate Responses for Group Data." Bayesian Anal. 12 (4) 1039 - 1067, December 2017. https://doi.org/10.1214/17-BA1081

Information

Published: December 2017
First available in Project Euclid: 26 October 2017

zbMATH: 1384.62259
MathSciNet: MR3724978
Digital Object Identifier: 10.1214/17-BA1081

Keywords: Bayesian variable selection , multivariate regression , Sparsity , spike and slab

Vol.12 • No. 4 • December 2017
Back to Top