The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 9, Number 1 (2015), 43-68.
Modeling county level breast cancer survival data using a covariate-adjusted frailty proportional hazards model
Understanding the factors that explain differences in survival times is an important issue for establishing policies to improve national health systems. Motivated by breast cancer data arising from the Surveillance Epidemiology and End Results program, we propose a covariate-adjusted proportional hazards frailty model for the analysis of clustered right-censored data. Rather than incorporating exchangeable frailties in the linear predictor of commonly-used survival models, we allow the frailty distribution to flexibly change with both continuous and categorical cluster-level covariates and model them using a dependent Bayesian nonparametric model. The resulting process is flexible and easy to fit using an existing R package. The application of the model to our motivating example showed that, contrary to intuition, those diagnosed during a period of time in the 1990s in more rural and less affluent Iowan counties survived breast cancer better. Additional analyses showed the opposite trend for earlier time windows. We conjecture that this anomaly has to be due to increased hormone replacement therapy treatments prescribed to more urban and affluent subpopulations.
Ann. Appl. Stat., Volume 9, Number 1 (2015), 43-68.
First available in Project Euclid: 28 April 2015
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Zhou, Haiming; Hanson, Timothy; Jara, Alejandro; Zhang, Jiajia. Modeling county level breast cancer survival data using a covariate-adjusted frailty proportional hazards model. Ann. Appl. Stat. 9 (2015), no. 1, 43--68. doi:10.1214/14-AOAS793. https://projecteuclid.org/euclid.aoas/1430226084
- Supplement to “Modeling county-level breast cancer survival data using a covariate-adjusted frailty proportional hazards model”.: In this online supplemental article we provide (A) technical details on the mixture of linear dependent tailfree processes, (B) a detailed description of the MCMC algorithm, (C) sample R code to analyze the SEER data, (D) additional simulation studies and (E) additional analysis of the SEER data.