Bayesian Analysis

Chain Event Graphs for Informed Missingness

Lorna M. Barclay, Jane L. Hutton, and Jim Q. Smith

Full-text: Open access


Chain Event Graphs (CEGs) are proving to be a useful framework for modelling discrete processes which exhibit strong asymmetric dependence structures between the variables of the problem. In this paper we exploit this framework to represent processes where missingness is influential and data cannot plausibly be hypothesised to be missing at random in all situations. We develop new classes of models where data are missing not at random but nevertheless exhibit context-specific symmetries which are captured by the CEG. We show that it is possible to score each model efficiently and in closed form. Hence standard Bayesian selection methods can be used to search over a wide variety of models, each with its own explanatory narrative. One of the advantages of this method is that the selected maximum a posteriori model and other closely scoring models can be easily read back to the client in a graphically transparent way. The efficacy of our methods are illustrated using a cerebral palsy cohort study, analysing their survival with respect to weight at birth and various disabilities.

Article information

Bayesian Anal., Volume 9, Number 1 (2014), 53-76.

First available in Project Euclid: 24 February 2014

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Chain Event Graphs Ordinal Chain Event Graphs Bayesian Model Selection Missing Data Missing Not at Random


Barclay, Lorna M.; Hutton, Jane L.; Smith, Jim Q. Chain Event Graphs for Informed Missingness. Bayesian Anal. 9 (2014), no. 1, 53--76. doi:10.1214/13-BA843.

Export citation


  • M. Akacha and N. Benda. The impact of dropouts on the analysis of dose-finding studies with recurrent event data. Statistics in medicine, 29(15):1635–1646, 2010.
  • L.M. Barclay, J.L. Hutton, and J.Q. Smith. Refining a Bayesian Network using a Chain Event Graph. International Journal of Approximate Reasoning, 2013..
  • J. Copas. What works?: selectivity models and meta-analysis. Journal of the Royal Statistical Society: Series A (Statistics in Society), 162(1):95–109, 1999.
  • J.B. Copas and JQ Shi. Reanalysis of epidemiological evidence on lung cancer and passive smoking. British Medical Journal, 320(7232):417–418, 2000.
  • R.G. Cowell, A.P. Dawid, S.L. Lauritzen, and D.J. Spiegelhalter. Probabilistic Networks and Expert Systems. Springer Verlag, 2007.
  • J. Cussens. Bayesian network learning by compiling to weighted max-sat. In Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI 2008), pages 105–112, 2008.
  • G. Freeman and J.Q. Smith. Bayesian map model selection of chain event graphs. Journal of Multivariate Analysis, 102(7):1152–1165, 2011.
  • D. Heckerman, D. Geiger, and D.M. Chickering. Learning bayesian networks: The combination of knowledge and statistical data. Machine learning, 20(3):197–243, 1995.
  • JL Hutton and POD Pharoah. Effects of cognitive, motor, and sensory disabilities on survival in cerebral palsy. Archives of disease in Childhood, 86(2):84–89, 2002.
  • J.L. Hutton, T. Cooke, and P.O.D. Pharoah. Life expectancy in children with cerebral palsy. British Medical Journal, 309(6952):431–435, 1994.
  • R.J.A. Little and D.B. Rubin. Statistical Analysis with Missing Data. Wiley, 2002.
  • R.E. Neapolitan. Learning Bayesian Networks. Pearson Prentice Hall Upper Saddle River, NJ, 2004.
  • D.B. Rubin. Inference and missing data. Biometrika, 63(3):581–592, 1976.
  • J.L. Schafer. Analysis of incomplete multivariate data, volume 72. Chapman & Hall/CRC, 1997.
  • J.Q. Smith and P.E. Anderson. Conditional independence and chain event graphs. Artificial Intelligence, 172(1):42–68, 2008.
  • J.A.C. Sterne, I.R. White, J.B. Carlin, M. Spratt, P. Royston, M.G. Kenward, A.M. Wood, and J.R. Carpenter. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. British Medical Journal, 338:157–160, 2009.
  • P. Thwaites, J.Q. Smith, and E. Riccomagno. Causal analysis with chain event graphs. Artificial Intelligence, 174(12–13):889–909, 2010.
  • C. Winship, R.D. Mare, and J.R. Warren. Latent Class Models for contingency tables with missing data. In Jacques A Hagenaars and Allan L McCutcheon, editors, Applied Latent Class Analysis, pages 408–432. Cambridge University Press, 2002.