Unsupervised naive Bayes for data clustering with mixtures of truncated exponentials

In this paper we propose a naive Bayes model for unsupervised data clustering, where the class variable is hidden. The feature variables can be discrete or continuous, as the conditional distributions are represented as mixtures of truncated exponentials (MTEs). The number of classes is determined u...

Descrición completa

Detalles Bibliográficos
Main Authors: Gámez Martín, José Antonio, Rumí, Rafael, Salmerón Cerdán, Antonio
Formato: info:eu-repo/semantics/report
Idioma:English
Publicado: 2012
Acceso en liña:http://hdl.handle.net/10835/1555
Descripción
Summary:In this paper we propose a naive Bayes model for unsupervised data clustering, where the class variable is hidden. The feature variables can be discrete or continuous, as the conditional distributions are represented as mixtures of truncated exponentials (MTEs). The number of classes is determined using the data augmentation algorithm. The proposed model is compared with the conditional Gaussian model for some real world and synthetic databases.