The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 13, Number 1 (2019), 367-392.
Modeling within-household associations in household panel studies
Household panel data provide valuable information about the extent of similarity in coresidents’ attitudes and behaviours. However, existing analysis approaches do not allow for the complex association structures that arise due to changes in household composition over time. We propose a flexible marginal modeling approach where the changing correlation structure between individuals is modeled directly and the parameters estimated using second-order generalized estimating equations (GEE2). A key component of our correlation model specification is the “superhousehold”, a form of social network in which pairs of observations from different individuals are connected (directly or indirectly) by coresidence. These superhouseholds partition observations into clusters with nonstandard and highly variable correlation structures. We thus conduct a simulation study to evaluate the accuracy and stability of GEE2 for these models. Our approach is then applied in an analysis of individuals’ attitudes towards gender roles using British Household Panel Survey data. We find strong evidence of between-individual correlation before, during and after coresidence, with large differences among spouses, parent–child, other family, and unrelated pairs. Our results suggest that these dependencies are due to a combination of nonrandom sorting and causal effects of coresidence.
Ann. Appl. Stat., Volume 13, Number 1 (2019), 367-392.
Received: September 2017
Revised: April 2018
First available in Project Euclid: 10 April 2019
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Steele, Fiona; Clarke, Paul S.; Kuha, Jouni. Modeling within-household associations in household panel studies. Ann. Appl. Stat. 13 (2019), no. 1, 367--392. doi:10.1214/18-AOAS1189. https://projecteuclid.org/euclid.aoas/1554861653
- Supplementary information, analysis and code. The supplement includes descriptive analysis of events leading to household change, further details on superhouseholds, Stata code for the construction of superhousehold IDs, additional simulation results, a discussion on positive definite correlation matrices, details on data structures and model estimation in R using geepack.