The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 5, Number 2B (2011), 1456-1487.
A new multivariate measurement error model with zero-inflated dietary data, and its application to dietary assessment
In the United States the preferred method of obtaining dietary intake data is the 24-hour dietary recall, yet the measure of most interest is usual or long-term average daily intake, which is impossible to measure. Thus, usual dietary intake is assessed with considerable measurement error. Also, diet represents numerous foods, nutrients and other components, each of which have distinctive attributes. Sometimes, it is useful to examine intake of these components separately, but increasingly nutritionists are interested in exploring them collectively to capture overall dietary patterns. Consumption of these components varies widely: some are consumed daily by almost everyone on every day, while others are episodically consumed so that 24-hour recall data are zero-inflated. In addition, they are often correlated with each other. Finally, it is often preferable to analyze the amount of a dietary component relative to the amount of energy (calories) in a diet because dietary recommendations often vary with energy level. The quest to understand overall dietary patterns of usual intake has to this point reached a standstill. There are no statistical methods or models available to model such complex multivariate data with its measurement error and zero inflation. This paper proposes the first such model, and it proposes the first workable solution to fit such a model. After describing the model, we use survey-weighted MCMC computations to fit the model, with uncertainty estimation coming from balanced repeated replication. The methodology is illustrated through an application to estimating the population distribution of the Healthy Eating Index-2005 (HEI-2005), a multi-component dietary quality index involving ratios of interrelated dietary components to energy, among children aged 2–8 in the United States. We pose a number of interesting questions about the HEI-2005 and provide answers that were not previously within the realm of possibility, and we indicate ways that our approach can be used to answer other questions of importance to nutritional science and public health.
Ann. Appl. Stat., Volume 5, Number 2B (2011), 1456-1487.
First available in Project Euclid: 13 July 2011
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Zhang, Saijuan; Midthune, Douglas; Guenther, Patricia M.; Krebs-Smith, Susan M.; Kipnis, Victor; Dodd, Kevin W.; Buckman, Dennis W.; Tooze, Janet A.; Freedman, Laurence; Carroll, Raymond J. A new multivariate measurement error model with zero-inflated dietary data, and its application to dietary assessment. Ann. Appl. Stat. 5 (2011), no. 2B, 1456--1487. doi:10.1214/10-AOAS446. https://projecteuclid.org/euclid.aoas/1310562729
- Supplementary material A: Additional tables.
- Supplementary material B: Data files of the NHANES data used in the analysis.
- Supplementary material C: Matlab programs for the data analysis.