Sentiment-analysis-on-Covid-19-tweets

Objective:

Classify 45k tweets on Covid-19 as positive or negative based on the following machine learning and deep learning models:

Multinomial Naive Bayes Model
Random Forests
ADABoost
XGBoost
Simple RNN
LSTM
GRU
Bidirectional LSTM
BERT

For machine learning models, the tweets are preprocessed using the following NLP methods:

Bag-of-words model
Bag-of-POS model
Pre-trained Spacy word embeddings

For neural networks, we use the following preprocessing methods:

Pre-trained Spacy word embeddings
Keras embedding layers

Results:

Among machine learning models, XGBoost trained on a bag-of-words model has the best performance in terms of accuracy (82%) and AUC ROC (90%)
Among all models, BERT has the best performance (accuracy = 94%)

References:

https://www.kaggle.com/andreshg/nlp-glove-bert-tf-idf-lstm-explained
https://www.kaggle.com/tanulsingh077/deep-learning-for-nlp-zero-to-transformers-bert#Bi-Directional-RNN's
Azunre, P. (2021). Transfer learning for natural language processing. Simon and Schuster.
Ferrario, A., & Nägelin, M. (2020). The art of natural language processing: classical, modern and contemporary approaches to text document classification. Modern and Contemporary Approaches to Text Document Classification (March 1, 2020).

Data source:

https://www.kaggle.com/datatattle/covid-19-nlp-text-classification

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
input		input
LICENSE		LICENSE
README.md		README.md
nlp-on-covid19-tweets-with-xgboost-lstm-and-bert.ipynb		nlp-on-covid19-tweets-with-xgboost-lstm-and-bert.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-analysis-on-Covid-19-tweets

Objective:

Results:

References:

Data source:

About

Releases

Packages

Languages

License

vettorefburana/Sentiment-analysis-on-Covid-19-tweets

Folders and files

Latest commit

History

Repository files navigation

Sentiment-analysis-on-Covid-19-tweets

Objective:

Results:

References:

Data source:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages