Skip to content

Course project at McGill University. Implementation from scratch of a Bernouilli Naïve Bayes classifier and text-processing features.

License

Notifications You must be signed in to change notification settings

pierremtb/IMDb-binary-sentiment-analysis

Repository files navigation

IMBD Binary Sentiment Analysis

Course project (COMP551) at McGill University, to classify movie reviews straight out of IMDb as positive or negative.

Implementation from scratch in Python 3 of a Bernouilli Naïve Bayes classifier and text-processing features, mainly lexicons and n-grams.

Performance

The course hosted a Kaggle competition. This model as-is, without external Machine Learning libraries got the following score on the test set.

F1 Score

Lexicon reference

Minqing Hu and Bing Liu. "Mining and Summarizing Customer Reviews." Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2004), Aug 22-25, 2004, Seattle, Washington, USA

Copyright

MIT license

About

Course project at McGill University. Implementation from scratch of a Bernouilli Naïve Bayes classifier and text-processing features.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages