Skip to content

jimis-anastas/Binary-Sentiment-Classification

Repository files navigation

Binary Sentiment Classification

In this project three machine learning algorithms (Naive Bayes, ID3 & Ada Boost) have been implemented for the classification of movie reviews as either positive or negative.

For the training and testing of the algorithms the "Large Movie Review Dataset", also known as "IMDB dataset", has been used which can be found here. In order for the code to work simply download it from the link provided and import it in the project file.

In this implementation every review text is represented by a binary vector, which shows which words from a vocabulary are contained in each review. For the definition of the vocabulary, an information gain algorithm is used that selects the most appropriate words.

The learning curves of the algorithms are shown below.

Naive Bayes Learning Curves

figure_naiveBayes

ID3 Learning Curves

figure_ID3

AdaBoost Learning Curves

figure_adaBoost

About

A machine learning project that implements Binary Sentiment Classification.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages