The Annals of Applied Statistics

A Bayesian regression tree approach to identify the effect of nanoparticles’ properties on toxicity profiles

Cecile Low-Kam, Donatello Telesca, Zhaoxia Ji, Haiyuan Zhang, Tian Xia, Jeffrey I. Zink, and Andre E. Nel

Full-text: Open access


We introduce a Bayesian multiple regression tree model to characterize relationships between physico-chemical properties of nanoparticles and their in-vitro toxicity over multiple doses and times of exposure. Unlike conventional models that rely on data summaries, our model solves the low sample size issue and avoids arbitrary loss of information by combining all measurements from a general exposure experiment across doses, times of exposure, and replicates. The proposed technique integrates Bayesian trees for modeling threshold effects and interactions, and penalized B-splines for dose- and time-response surface smoothing. The resulting posterior distribution is sampled by Markov Chain Monte Carlo. This method allows for inference on a number of quantities of potential interest to substantive nanotoxicology, such as the importance of physico-chemical properties and their marginal effect on toxicity. We illustrate the application of our method to the analysis of a library of 24 nano metal oxides.

Article information

Ann. Appl. Stat., Volume 9, Number 1 (2015), 383-401.

First available in Project Euclid: 28 April 2015

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Bayesian CART nanotoxicology P-splines regression trees


Low-Kam, Cecile; Telesca, Donatello; Ji, Zhaoxia; Zhang, Haiyuan; Xia, Tian; Zink, Jeffrey I.; Nel, Andre E. A Bayesian regression tree approach to identify the effect of nanoparticles’ properties on toxicity profiles. Ann. Appl. Stat. 9 (2015), no. 1, 383--401. doi:10.1214/14-AOAS797.

Export citation


  • Besag, J. and Kooperberg, C. (1995). On conditional and intrinsic autoregressions. Biometrika 82 733–746.
  • Breiman, L., Friedman, J. H., Olshen, R. A. and Stone, C. J. (1984). Classification and Regression Trees. Wadsworth, Belmont, CA.
  • Chipman, H. A., George, E. I. and McCulloch, R. E. (1998). Bayesian CART model search. J. Amer. Statist. Assoc. 93 935–948.
  • Chipman, H. A., George, E. I. and McCulloch, R. E. (2002). Bayesian treed models. Machine Learning 48 299–320.
  • Chipman, H. A., George, E. I. and McCulloch, R. E. (2010a). BART: Bayesian additive regression trees. Ann. Appl. Stat. 4 266–298.
  • Chipman, H. A., George, E. I. and McCulloch, R. E. (2010b). Implementation of BART: Bayesian additive regression trees. R package version 0.3-1.1.
  • De’ath, G. (2002). Multivariate regression trees: A new technique for modeling species-environment relationships. Ecology 83 1105–1117.
  • Eilers, P. H. C. and Marx, B. D. (1996). Flexible smoothing with B-splines and penalties. Statist. Sci. 11 89–121.
  • Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Ann. Statist. 29 1189–1232.
  • Galimberti, G. and Montanari, A. (2002). Regression trees for longitudinal data with time-dependent covariates. In Classification, Clustering, and Data Analysis 391–398. Springer, Berlin.
  • Gramacy, R. B. and Lee, H. K. H. (2008). Bayesian treed Gaussian process models with an application to computer modeling. J. Amer. Statist. Assoc. 103 1119–1130.
  • Gramacy, R. B. and Taddy, M. A. (2010). Categorical inputs, sensitivity analysis, optimization and importance tempering with tgp version 2, an R package for treed Gaussian process models. Journal of Statistical Software 33 1–48.
  • Gramacy, R. B., Taddy, M. and Wild, S. M. (2013). Variable selection and sensitivity analysis using dynamic trees, with an application to computer code performance tuning. Ann. Appl. Stat. 7 51–80.
  • Konomi, B., Karagiannis, G., Sarkar, A., Sun, X. and Lin, G. (2014). Bayesian treed multivariate Gaussian process with adaptive design: Application to a carbon capture unit. Technometrics 56 145–158.
  • Lang, S. and Brezger, A. (2004). Bayesian P-splines. J. Comput. Graph. Statist. 13 183–212.
  • Liu, R., Rallo, R., George, S., Ji, Z., Nair, S., Nel, A. E. and Cohen, Y. (2011). Classification NanoSAR development for cytotoxicity of metal oxide nanoparticles. Small 7 1118–1126.
  • Low-Kam, C., Telesca, D., Ji, Z., Zhang, H., Xia, T., Zink, J. I. and Nel, A. (2015). Supplement to “A Bayesian regression tree approach to identify the effect of nanoparticles’ properties on toxicity profiles.” DOI:10.1214/14-AOAS797SUPPA, DOI:10.1214/14-AOAS797SUPPB.
  • Patel, T., Telesca, D., Low-Kam, C., Ji, Z. X., Zhang, H. Y., Xia, T., Zinc, J. I. and Nel, A. E. (2014). Relating nano-particle properties to biological outcomes in exposure escalation experiments. Environmetrics 25 57–68.
  • Ramsay, J. O. (1998). Monotone regression splines in action. Statist. Sci. 3 425–441.
  • Rowe, D. B. (2003). Multivariate Bayesian Statistics: Models for Source Separation and Signal Unmixing. Chapman & Hall/CRC, Boca Raton, FL.
  • Segal, M. R. (1992). Tree-structured methods for longitudinal data. J. Amer. Statist. Assoc. 87 407–418.
  • Sela, R. J. and Simonoff, J. S. (2012). RE–EM trees: A data mining approach for longitudinal and clustered data. Mach. Learn. 86 169–207.
  • Wu, Y., Tjelmeland, H. and West, M. (2007). Bayesian CART: Prior specification and posterior simulation. J. Comput. Graph. Statist. 16 44–66.
  • Yu, Y. and Lambert, D. (1999). Fitting trees to functional data, with an application to time-of-day patterns. J. Comput. Graph. Statist. 8 749–762.
  • Yu, K., Wheeler, W., Li, Q., Bergen, A. W., Caporaso, N., Chatterjee, N. and Chen, J. (2010). A partially linear tree-based regression model for multivariate outcomes. Biometrics 66 89–96.
  • Zhang, S., Shih, Y.-C. T. and Müller, P. (2007). A spatially-adjusted Bayesian additive regression tree model to merge two datasets. Bayesian Anal. 2 611–633.
  • Zhang, H., Ji, Z., Xia, T., Meng, H., Low-Kam, C., Liu, R., Pokhrel, S., Lin, S., Wang, X., Liao, Y.-P., Wang, M., Li, L., Rallo, R., Damoiseaux, R., Telesca, D., Mädler, L., Cohen, Y., Zink, J. I. and Nel, A. E. (2012). Use of metal oxide nanoparticle band gap to develop a predictive paradigm for oxidative stress and acute pulmonary inflammation. ACS Nano 6 4349–4368.

Supplemental materials