Semiparametric zero-inflated modeling in multi-ethnic study of atherosclerosis (MESA)

Hai Liu; Shuangge Ma; Richard Kronmal; Kung-Sik Chan

doi:10.1214/11-AOAS534

September 2012 Semiparametric zero-inflated modeling in multi-ethnic study of atherosclerosis (MESA)

Hai Liu, Shuangge Ma, Richard Kronmal, Kung-Sik Chan

Ann. Appl. Stat. 6(3): 1236-1255 (September 2012). DOI: 10.1214/11-AOAS534

Abstract

We analyze the Agatston score of coronary artery calcium (CAC) from the Multi-Ethnic Study of Atherosclerosis (MESA) using the semiparametric zero-inflated modeling approach, where the observed CAC scores from this cohort consist of high frequency of zeroes and continuously distributed positive values. Both partially constrained and unconstrained models are considered to investigate the underlying biological processes of CAC development from zero to positive, and from small amount to large amount. Different from existing studies, a model selection procedure based on likelihood cross-validation is adopted to identify the optimal model, which is justified by comparative Monte Carlo studies. A shrinkaged version of cubic regression spline is used for model estimation and variable selection simultaneously. When applying the proposed methods to the MESA data analysis, we show that the two biological mechanisms influencing the initiation of CAC and the magnitude of CAC when it is positive are better characterized by an unconstrained zero-inflated normal model. Our results are significantly different from those in published studies, and may provide further insights into the biological mechanisms underlying CAC development in humans. This highly flexible statistical framework can be applied to zero-inflated data analyses in other areas.

Citation

Download Citation

Hai Liu. Shuangge Ma. Richard Kronmal. Kung-Sik Chan. "Semiparametric zero-inflated modeling in multi-ethnic study of atherosclerosis (MESA)." Ann. Appl. Stat. 6 (3) 1236 - 1255, September 2012. https://doi.org/10.1214/11-AOAS534

Information

Published: September 2012

First available in Project Euclid: 31 August 2012

zbMATH: 1248.62215

MathSciNet: MR3012528

Digital Object Identifier: 10.1214/11-AOAS534

Keywords: cardiovascular disease , coronary artery calcium , likelihood cross-validation , Model selection , penalized spline , proportional constraint , shrinkage

Access the abstract

JOURNAL ARTICLE
20 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY