I am attempting to work out a model to do Textual Classification and I guess I could be able to use VBA for the programming part of it. Some methods I've explored are - Naive Bayes & EM, each one constituiting one for supervised and unsupervised.
I'm only looking for supervised learning for the time being, so assume I've got about 1000 odd documents containing words which I want to classify into 4 pre-determined classes. So, I want to be able to apply a Bayes Classifier on a training set, and finally use it on a validation set. Using the bag of words model is a hidden box inside, and I'm stuck at the place where I need to find the log likelihood of each word appearing in the class.
Looking for some helping pertaining to this area.