Open Access
2015 Estimation and model selection for model-based clustering with the conditional classification likelihood
Jean-Patrick Baudry
Electron. J. Statist. 9(1): 1041-1077 (2015). DOI: 10.1214/15-EJS1026

Abstract

The Integrated Completed Likelihood (ICL) criterion was introduced by Biernacki, Celeux and Govaert (2000) in the model-based clustering framework to select a relevant number of classes and has been used by statisticians in various application areas. A theoretical study of ICL is proposed.

A contrast related to the clustering objective is introduced: the conditional classification likelihood. An estimator and model selection criteria are deduced. The properties of these new procedures are studied and ICL is proved to be an approximation of one of these criteria. We contrast these results with the current leading point of view about ICL, that it would not be consistent. Moreover these results give insights into the class notion underlying ICL and feed a reflection on the class notion in clustering.

General results on penalized minimum contrast criteria and upper-bounds of the bracketing entropy in parametric situations are derived, which can be useful per se.

Practical solutions for the computation of the introduced procedures are proposed, notably an adapted EM algorithm and a new initialization method for EM-like algorithms which helps to improve the estimation in Gaussian mixture models.

Citation

Download Citation

Jean-Patrick Baudry. "Estimation and model selection for model-based clustering with the conditional classification likelihood." Electron. J. Statist. 9 (1) 1041 - 1077, 2015. https://doi.org/10.1214/15-EJS1026

Information

Received: 1 March 2014; Published: 2015
First available in Project Euclid: 27 May 2015

zbMATH: 1307.62015
MathSciNet: MR3352067
Digital Object Identifier: 10.1214/15-EJS1026

Subjects:
Primary: 62H30
Secondary: 62H12

Keywords: Bracketing entropy , ICL , Model selection , Model-based clustering , number of classes , penalized criteria

Rights: Copyright © 2015 The Institute of Mathematical Statistics and the Bernoulli Society

Vol.9 • No. 1 • 2015
Back to Top