The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 5, Number 3 (2011), 1978-2002.
Incorporating biological information into linear models: A Bayesian approach to the selection of pathways and genes
The vast amount of biological knowledge accumulated over the years has allowed researchers to identify various biochemical interactions and define different families of pathways. There is an increased interest in identifying pathways and pathway elements involved in particular biological processes. Drug discovery efforts, for example, are focused on identifying biomarkers as well as pathways related to a disease. We propose a Bayesian model that addresses this question by incorporating information on pathways and gene networks in the analysis of DNA microarray data. Such information is used to define pathway summaries, specify prior distributions, and structure the MCMC moves to fit the model. We illustrate the method with an application to gene expression data with censored survival outcomes. In addition to identifying markers that would have been missed otherwise and improving prediction accuracy, the integration of existing biological knowledge into the analysis provides a better understanding of underlying molecular processes.
Ann. Appl. Stat., Volume 5, Number 3 (2011), 1978-2002.
First available in Project Euclid: 13 October 2011
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Stingo, Francesco C.; Chen, Yian A.; Tadesse, Mahlet G.; Vannucci, Marina. Incorporating biological information into linear models: A Bayesian approach to the selection of pathways and genes. Ann. Appl. Stat. 5 (2011), no. 3, 1978--2002. doi:10.1214/11-AOAS463. https://projecteuclid.org/euclid.aoas/1318514292
- Supplementary material: Supplement. Description of the MCMC steps for (θ, γ) and discussion on ergodicity of the Markov chain on the restricted space.