The Annals of Applied Statistics

An estimating equations approach to fitting latent exposure models with longitudinal health outcomes

Brisa N. Sánchez, Esben Budtz-Jørgensen, and Louise M. Ryan

Source: Ann. Appl. Stat. Volume 3, Number 2 (2009), 830-856.

Abstract

The analysis of data arising from environmental health studies which collect a large number of measures of exposure can benefit from using latent variable models to summarize exposure information. However, difficulties with estimation of model parameters may arise since existing fitting procedures for linear latent variable models require correctly specified residual variance structures for unbiased estimation of regression parameters quantifying the association between (latent) exposure and health outcomes. We propose an estimating equations approach for latent exposure models with longitudinal health outcomes which is robust to misspecification of the outcome variance. We show that compared to maximum likelihood, the loss of efficiency of the proposed method is relatively small when the model is correctly specified. The proposed equations formalize the ad-hoc regression on factor scores procedure, and generalize regression calibration. We propose two weighting schemes for the equations, and compare their efficiency. We apply this method to a study of the effects of in-utero lead exposure on child development.

Related Works:

Keywords: Factor score regression; measurement error; dimension reduction; lead exposure

Full-text: Access denied (no subscription detected)

In 2007, access to the Annals of Applied Statistics was open. Beginning in 2008, you must hold a subscription or be a member of the IMS to view the full journal. For more information on subscribing, please visit: http://imstat.org/orders.
If you are already an IMS member, you may need to update your Euclid profile following the instructions here: http://imstat.org/publications/eaccess.htm.
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aoas/1245676197
Digital Object Identifier: doi:10.1214/08-AOAS226

References

Arminger, G. and Schoenberg, R. J. (1989). Pseudo maximum-likelihood estimation and a test for misspecification in mean and covariance structure models., Psychometrika 54 409–425.
Mathematical Reviews (MathSciNet): MR1030011
Digital Object Identifier: doi:10.1007/BF02294626
Bartholomew, D. J. (1981). Posterior analysis of the factor model., British J. Math. Statist. Psych. 34 93–99.
Mathematical Reviews (MathSciNet): MR649325
Bayley, N. (1993)., Bayley Scale for Infant Development, 2nd ed. The Psychological Corporation, San Antonio, TX.
Beck, R. (1982)., Handbook of Methods for Detecting Test Bias. Johns Hopkins Univ. Press, Baltimore, MD.
Bellinger, D. (1989). Prenatal/early postnatal exposure to lead and risk of developmental impairment., Birth Defects 25 73–97.
Bentler, P. M. and Chou, C. (1987). Practical issues in structural modeling., Sociological Methods & Research 16 78–117.
Bergdahl, I. A., Sheveleva, M., Schütz, A., Artamonova, V. G. and Skerfving, S. (1998). Plasma and blood lead in humans: Capacity-limited binding to, δ-aminolevulinic acid dehidratase and other lead-binding components. Educational and Psychological Measurement 52 549–570.
Bollen, K. A. (1989)., Structural Equation Models With Latent Variables. Wiley, New York.
Mathematical Reviews (MathSciNet): MR996025
Zentralblatt MATH: 0731.62159
Budtz-Jørgensen, E., Keiding, N., Grandjean, P., Weihe, P. and White, R. F. (2002). Estimation of health effects of prenatal mercury exposure using structural equation models., Environmental Health: A Global Access Science Source 1 2.
Budtz-Jørgensen, E., Keiding, N., Grandjean, P., Weihe, P. and White, R. F. (2003). Consequences of measurement error for confounder identification in environmental epidemiology., Statistics in Medicine 22 3089–3100.
Browne, M. W. (1984). Asymptotically distribution-free methods for the analysis of covariance-structures., British J. Math. Statist. Psych. 37 62–83.
Mathematical Reviews (MathSciNet): MR783499
Carroll, R. J., Ruppert, D., Stefansky, L. A. and Crainiceanu, C. (2006)., Measurement Error in Nonlinear Models: A Modern Perspective. Chapman & Hall, Boca Raton, FL.
Mathematical Reviews (MathSciNet): MR2243417
Fox, J. (2006). Structural equation modeling with the sem package in R., Struct. Equ. Model. 13 465–486.
Mathematical Reviews (MathSciNet): MR2240110
Digital Object Identifier: doi:10.1207/s15328007sem1303_7
Fuller, W. A. (1987)., Measurement Error Models. Wiley, New York.
Mathematical Reviews (MathSciNet): MR898653
Fulton, M., Raab, G. M., Thompson, G. O. B., Laxen, D. P. H., Hunter, R. and Hepburn, W. (1987). Influence of blood lead on the ability and attainment of children in Edinburgh., Lancet 1 1221–1226.
Hatzakis, A., Kokkevi, A., Maravelias, C. et al. (1989). Psychometric intelligence deficits in lead-exposed children. In, Lead Exposure and Child Development: An International Assessment 211–223. Kluwer Academic, Dordrecht.
Hoogland, J. J. and Boomsma, A. (1998). Robustness studies in covariance structure modeling: An overview and a meta-analysis., Sociological Methods & Research 26 329–367.
Hu, H., Rabinowitz, M. and Smith, D. (1998). Bone lead as a biologic marker of in epidemiologic studies of chronic toxicity: Conceptual paradigms., Environmental Health Perpsectives 106 1–8.
Hu, H., Tellez-Rojo, M. M., Bellinger, D., Smith, D., Ettinger, A. S., LaMadrid-Figueroa, H., Schwartz, J., Schnass, L., Mercado-Garcia, A. and Hernandez-Avila, M. (2006). Fetal lead exposure at each stage of pregnancy as a predictor of infant mental development., Environmental Health Perpsectives 114 1730–1735.
Gomaa, A., Hu, H., Bellinger, D., Schwartz, J., Tsaih, S. W., Gonzalez-Cossio, T., Schnaas, L., Peterson, K., Aro, A. and Hernandez-Avila, M. (2002). Maternal bone lead as an independent risk factor for fetal neurotoxicity: A prospective study., Pediatrics 110 110–118.
Iwata, S. (1992). Error in-variables regression using estimated latent variables., Econometric Reviews 11 195–200.
Jolliffe, I. T. (1998). Factor analysis, overview. In, Encyclopedia of Biostatistics 1474–1482. Wiley, New York.
Jöreskog, K. G. and Sörbom, D. (1989)., Lisrel 7, A Guide to the Program and Applications. SPSS, Chicago.
Laird, N. M. and Ware, J. H. (1982). Random-effects models for longitudinal data., Biometrics 38 963–974.
LaMadrid-Figueroa, H., Tellez-Rojo, M., Hernandez-Cadena, L., Mercado, A., Smith, D., Hernandez-Avila, M. et al. (2006). The relationship between lead in plasma and whole blood during pregnancy., Journal of Toxicology and Environmental Health 69 1781–1796.
Liang, K. Y. and Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models., Biometrika 73 13–22.
Mathematical Reviews (MathSciNet): MR836430
Zentralblatt MATH: 0595.62110
Digital Object Identifier: doi:10.1093/biomet/73.1.13
Little, R. J. A. and Rubin, D. B. (2002)., Statistical Analysis With Missing Data, 2nd ed. Wiley, New York.
Mathematical Reviews (MathSciNet): MR1925014
Mendola, P., Selevan, S. G., Gutter, S. and Rice, D. (2002). Environmental factors associated with a spectrum of neurodevelopmental deficits., Mental Retardation and Developmental Disabilities Review 8 188–197.
Muthén, B. O. and Muthén, L. K. (1998–2004)., Mplus User’s Guide, 3rd ed. Muthén and Muthén, Los Angeles.
Nikolov, M. C., Coull, B. A., Catalano, P. J. and Godleski, J. J. (2006). An informative Bayesian structural equation model to assess source-specific health effects of air pollution., Biostatistics 3 609–624.
Rabe-Hesketh, S. and Skrondal, A. (2008). Classical latent variable models for medical research., Statistical Methods in Medical Research 17 5–32.
Mathematical Reviews (MathSciNet): MR2420188
Zentralblatt MATH: 1154.62085
Digital Object Identifier: doi:10.1177/0962280207081236
Rabe-Hesketh, S., Skrondal, A. and Pickles, A. (2004). GLLAMM Manual. Working Paper 160, Division of Biostatistics, Univ. California,, Berkeley.
Reddy, S. K. (1992). Effects of ignoring correlated measurement error in structural equation models., Educational and Psychological Measurement 52 549–570.
Roy, J. and Lin, X. (2000). Latent variable models for longitudinal data with multiple continuous outcomes., Biometrics 56 1047–1054.
Mathematical Reviews (MathSciNet): MR1806745
Digital Object Identifier: doi:10.1111/j.0006-341X.2000.01047.x
Sammel, M. D. and Ryan, L. M. (2002). Effects of covariance misspecification in a latent variable model for multiple outcomes., Statist. Sinica 12 1207–1222.
Mathematical Reviews (MathSciNet): MR1947072
Zentralblatt MATH: 1010.62049
Sánchez, B. N., Budtz-Jørgensen, E. and Ryan, L. M. (2009a). Supplement to “An estimating equations approach to fitting latent exposure models with longitudinal health outcomes.”, DOI:10.1214/08-AOAS226SUPPA.
Sánchez, B. N., Budtz-Jørgensen, E., Ryan, L. M. (2009b). Computer code supplement to “An estimating equations approach to fitting latent exposure models with longitudinal health outcomes.”, DOI:10.1214/08-AOAS226SUPPB.
Sánchez, B. N., Budtz-Jørgensen, E., Ryan, L. M. and Hu, H. (2005). Structural equation models: A review with applications to environmental epidemiology., J. Amer. Statist. Assoc. 100 1443–1454.
Schnaas, L., Rothenberg, S. J., Flores, M. F., Martinez, S., Hernandez, C., Osorio, E., Ruiz Velasco, S. and Perroni, E. (2006). Reduced intellectual development in children with prenatal exposure., Environmental Health Perspectives 114 791–797.
Skrondal, A. and Laake, P. (2001). Regression among factor scores., Psychometrika 66 563–575.
Mathematical Reviews (MathSciNet): MR1961914
Digital Object Identifier: doi:10.1007/BF02296196
Skrondal, A. and Rabe-Hesketh, S. (2004)., Generalized Latent Variable Modeling: Multilevel, Longitudinal, and Structural Equation Models. Chapman & Hall, Boca Raton, FL.
Mathematical Reviews (MathSciNet): MR2059021
Zentralblatt MATH: 1097.62001
Stevens, J. (1996)., Applied Multivariate Statistics for the Social Sciences, 4th ed. Lawrence Erlbaum, Mahwah.
Tellez-Rojo, M. M., Hernandez-Avila, M., LaMadrid-Figueroa, H., Smith, D., Hernandez-Cadena, L., Mercado, A., Aro, A., Schwartz, J. and Hu, H. (2004). Impact of bone lead and bone resorption on plasma and whole blood lead levels during pregnancy., American Journal of Epidemiology 160 668–678.
Tucker, L. R. (1971). Relations of factor score estimates to their use., Psycohometrika 36 427–436.
Mathematical Reviews (MathSciNet): MR368315
Digital Object Identifier: doi:10.1007/BF02291367
Wall, M. M. and Li, R. (2003). Comparison of multiple regression to two latent variable techniques for estimation and prediction., Stat. Med. 22 3671–3685.

2009 © Institute of Mathematical Statistics