In this project, Gaussian Mixture Model (GMM) is used as a generative classifier. We use the scikit-learn library from python which uses the Expectation Maximization (EM) to train a GMM model. A GMM model can be employed to estimate the PDF of some samples (like a parametric density estimator).
Here, we train an individual GMM model (with K Components, K = 1,5,10,) for each class. Therefore, N GMM models will be created where N shows the number of classes. The label of a sample can be determined using Maximum Likelihood(ML) criteria. In another words, we should find the likelihood of a sample in all classes and then select the class with the maximum likelihood as the label of the sample. Also, use five-time-five-fold cross validation to determine the best K.
-
User Knowledge Modeling Data Set (UKM): https://archive.ics.uci.edu/ml/datasets/User+Knowledge+Modeling
-
Vehicle.dat
-
Health.dat