Detection-of-Disaster-from-Tweets

Twitter has become an important communication channel in times of emergency. The ubiquitousness of smartphones enables people to announce an emergency they’re observing in real-time. Because of this, more agencies are interested in programmatically monitoring Twitter (i.e. disaster relief organizations and news agencies).

But, it’s not always clear whether a person’s words are actually announcing a disaster. Take this example:

The author explicitly uses the word “ABLAZE” but means it metaphorically. This is clear to a human right away, especially with the visual aid. But it’s less clear to a machine.

Objective

Develop a machine learning model that predicts which Tweets are about real disasters and which ones aren’t.

Python's Libraries Used

Numpy
Pandas
Seaborn
Matplotlib.pyplot
Warnings
NLTK
re
string
SymSpellPy
Sklearn
XGBoost
WordCloud

Predictive Accuracy of Machine Learning Models Used

Model Name	Accuracy Score
Random Forests Classifier	79.06%
Decision Tree Classifier	75.38%
Multinomial Naive Bayes	80.53%
Support Vector Classifier	79.62%
K Nearest Neighbors	68.89%
Logistic Regression	79.06%
XG Boost Classifier	77.64%

Best Performing Algorithm

The Multinomial Naive Bayes made the most accurate predictions on detection of a real disaster from any tweet given in the dataset with an overall accuracy of almost 81%. This proves the effectiveness of the Multinomial Naive Bayes algorithm in text classification tasks.

Worst Performing Algorithm

K Nearest Neighbors had the worst performance among all the ML algorithms used, having an accuracy of just over 68%.

Acknowledgments

This dataset was created by the company figure-eight and originally shared on their 'Data For Everyone' website here.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Disaster Detection in Tweets.ipynb		Disaster Detection in Tweets.ipynb
README.md		README.md
sample_submission.csv		sample_submission.csv
test.csv		test.csv
train.csv		train.csv
tweet_disaster_prediction.csv		tweet_disaster_prediction.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Disaster Detection in Tweets.ipynb

Disaster Detection in Tweets.ipynb

README.md

README.md

sample_submission.csv

sample_submission.csv

test.csv

test.csv

train.csv

train.csv

tweet_disaster_prediction.csv

tweet_disaster_prediction.csv

Repository files navigation

Detection-of-Disaster-from-Tweets

Objective

Python's Libraries Used

Predictive Accuracy of Machine Learning Models Used

Best Performing Algorithm

Worst Performing Algorithm

Acknowledgments

About

Releases

Packages

Languages

SayamAlt/Detection-of-Disaster-from-Tweets

Folders and files

Latest commit

History

Repository files navigation

Detection-of-Disaster-from-Tweets

Objective

Python's Libraries Used

Predictive Accuracy of Machine Learning Models Used

Best Performing Algorithm

Worst Performing Algorithm

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Languages