Electronic Journal of Statistics

Visualizing the structure of large trees

Burcu Aydın, Gábor Pataki, Haonan Wang, Alim Ladha, Elizabeth Bullitt, and J.S. Marron

Full-text: Open access


This study introduces a new method of visualizing complex tree structured objects. The usefulness of this method is illustrated in the context of detecting unexpected features in a data set of very large trees. The major contribution is a novel two-dimensional graphical representation of each tree, with a covariate coded by color.

The motivating data set contains three dimensional representations of brain artery systems of 105 subjects. Due to inaccuracies inherent in the medical imaging techniques, issues with the reconstruction algorithms and inconsistencies introduced by manual adjustment, various discrepancies are present in the data. The proposed representation enables quick visual detection of the most common discrepancies. For our driving example, this tool led to the modification of 10% of the artery trees and deletion of 6.7%.

The benefits of our cleaning method are demonstrated through a statistical hypothesis test on the effects of aging on vessel structure. The data cleaning resulted in improved significance levels. Our second example analyses brain artery images of healthy patients and patients with brain tumor. Our visualization can identify tumor patients.

Article information

Electron. J. Statist., Volume 5 (2011), 405-420.

First available in Project Euclid: 10 May 2011

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Primary: 62-09: Graphical methods
Secondary: 62-07: Data analysis

Visualization data cleaning tree structure descendant-level view brain arteries


Aydın, Burcu; Pataki, Gábor; Wang, Haonan; Ladha, Alim; Bullitt, Elizabeth; Marron, J.S. Visualizing the structure of large trees. Electron. J. Statist. 5 (2011), 405--420. doi:10.1214/11-EJS612. https://projecteuclid.org/euclid.ejs/1305034908

Export citation


  • [1] Aydın, B., Pataki, G., Wang, H., Bullitt, E., Marron, J.S. (2009) A Principal Component Analysis For Trees, Annals of Applied Statistics, 3, 1597–1615.
  • [2] Aylward, S. and Bullitt, E. (2002) Initialization, noise, singularities and scale in height ridge traversal for tubular object centerline extraction, IEEE Transactions on Medical Imaging, 21, 61–75.
  • [3] Bullitt E, Gerig G, Pizer SM, Aylward S. (2003) Measuring tortuosity of the intracerebral vasculature from MRA images. IEEE-TMI, 22:1163-1171.
  • [4] Huson, D., Richter, D., Rausch, C., Dezulian, T., Franz, M., Rupp, R. (2007) Dendroscope: An interactive viewer for large phylogenetic trees, BMC Bioinformatics, 8, 460.
  • [5] Letunic, I., Bork, P. (2006) Interactive Tree of Life (iTOL): an online tool for phylogenetic tree display and annotation, Bioinformatics Applications Note, 23-1, 127–128.
  • [6] Nguyen, Q., Huang, M. (2002) A Space-Optimized Tree Visualization, INFOVIS ’02: Proceedings of the IEEE Symposium on Information Visualization (InfoVis’02), 85.
  • [7] Nguyen, Q., Huang, M. (2007) Improvements of Space-Optimized Tree for Visualizing and Manipulating Very Large Hierarchies, Pan-Sydney Area Workshop on Visual Information Processing (VIP2002), Sydney, Australia, Conferences in Research and Practice in Information Technology, Vol. 22.
  • [8] Simonis, H., Aggoun, A. (2000) Search-Tree Visualization, Lecture Notes In Computer Science, Vol. 1870, 191–208.
  • [9] Shneiderman, B. (1998) Tree Visualization With Tree-Maps: 2-D Space Filling Approach, ACM Trans. Graph, 11-1, 92–99.
  • [10] Wang, H. and Marron, J. S. (2007) Object oriented data analysis: Sets of trees, The Annals of Statistics, 35, 1849–1873.