Tweet Disasters Detection

Create a ML model to predict which tweets are alerting about real disaters. An example of a disaster tweet will contain hashtag like #earthquake, #COVID, #pandemic, etc compared to a normal tweet such as "What is up man"
In this project, I have performed data preprocessing, data cleaning such as removing extra space, hashtag, https link,...
Also, I have implemented three different models : simple linear classifier with count vectorizers of words, Logistic Regression with K-fold cross validation and Grid Search CV for best parameters, BERT model. For the first two models, I got accuracy of 78% and 79.9% respectively. The BERT model is the best one which with some data cleaning has achieved accuary of 82.8%.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
BERT_Tweet_Disaster.ipynb		BERT_Tweet_Disaster.ipynb
NLP_disater_tweets.ipynb		NLP_disater_tweets.ipynb
README.md		README.md
model.json		model.json
submission_final.csv		submission_final.csv
test.csv		test.csv
train.csv		train.csv
y_predict_0.8accuracy.csv		y_predict_0.8accuracy.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BERT_Tweet_Disaster.ipynb

BERT_Tweet_Disaster.ipynb

NLP_disater_tweets.ipynb

NLP_disater_tweets.ipynb

README.md

README.md

model.json

model.json

submission_final.csv

submission_final.csv

test.csv

test.csv

train.csv

train.csv

y_predict_0.8accuracy.csv

y_predict_0.8accuracy.csv

Repository files navigation

Tweet Disasters Detection

About

Releases

Packages

Languages

mytran2111/NLP_tweet_disaters

Folders and files

Latest commit

History

Repository files navigation

Tweet Disasters Detection

About

Resources

Stars

Watchers

Forks

Languages