- Bayesian Anal.
- Volume 13, Number 2 (2018), 559-626.
Bayesian Cluster Analysis: Point Estimation and Credible Balls (with Discussion)
Clustering is widely studied in statistics and machine learning, with applications in a variety of fields. As opposed to popular algorithms such as agglomerative hierarchical clustering or k-means which return a single clustering solution, Bayesian nonparametric models provide a posterior over the entire space of partitions, allowing one to assess statistical properties, such as uncertainty on the number of clusters. However, an important problem is how to summarize the posterior; the huge dimension of partition space and difficulties in visualizing it add to this problem. In a Bayesian analysis, the posterior of a real-valued parameter of interest is often summarized by reporting a point estimate such as the posterior mean along with 95% credible intervals to characterize uncertainty. In this paper, we extend these ideas to develop appropriate point estimates and credible sets to summarize the posterior of the clustering structure based on decision and information theoretic techniques.
Bayesian Anal., Volume 13, Number 2 (2018), 559-626.
First available in Project Euclid: 19 October 2017
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Wade, Sara; Ghahramani, Zoubin. Bayesian Cluster Analysis: Point Estimation and Credible Balls (with Discussion). Bayesian Anal. 13 (2018), no. 2, 559--626. doi:10.1214/17-BA1073. https://projecteuclid.org/euclid.ba/1508378464
- Supplementary material for Bayesian cluster analysis: Point estimation and credible balls.