In this project, we aim to develop a system to automatically classifier incoming message, comment,.. and tells whether the underlying sentiment is positive or negative.
We used stacking ensemble (navie bayes, logistic regression, decision tree, randomn forest) for sentiment classification using AIViVN's comments dataset.
The model scored 0.87 (f1 score) in test set.
We used mlxtend to stacking models.
The first-level classifiers are:
- Logistic Regression
- Decision Tree
- Random Forest
- GaussianNB
- BernoulliNB
The meta classifier is Logistic Regression. The class-probabilities of the first-level classifiers are used to train the meta-classifier.
Here’s what an ensemble stacking model does: reference