Unsupervised naive Bayes for data clustering with mixtures of truncated exponentials

In this paper we propose a naive Bayes model for unsupervised data clustering, where the class variable is hidden. The feature variables can be discrete or continuous, as the conditional distributions are represented as mixtures of truncated exponentials (MTEs). The number of classes is determined u...

Description complète

Détails bibliographiques
Auteurs principaux: Gámez Martín, José Antonio, Rumí, Rafael, Salmerón Cerdán, Antonio
Format: info:eu-repo/semantics/report
Langue:English
Publié: 2012
Accès en ligne:http://hdl.handle.net/10835/1555
Description
Résumé:In this paper we propose a naive Bayes model for unsupervised data clustering, where the class variable is hidden. The feature variables can be discrete or continuous, as the conditional distributions are represented as mixtures of truncated exponentials (MTEs). The number of classes is determined using the data augmentation algorithm. The proposed model is compared with the conditional Gaussian model for some real world and synthetic databases.