A system to predict the category of tweets during times of crisis (like - flood, cyclone, earthquake, wildfires, pandemic etc.) Multilabel classification was done. Here, we have used 24 categories. They are-
- Request for Goods and Services
- Request for Search And Rescue
- Request for Information
- Call to Action for Volunteer
- Call To Action for Donations
- Call To Action for Moving People
- Report of First Party Observation
- Report of Third Party Observation
- Report of Weather
- Report of Emerging Threats
- Report of New Sub Event
- Report of Multimedia Share
- Report of Service Available
- Report of Factoid
- Report of Official,
- Report of News
- Report of CleanUp
- Report of Hashtags
- Report of OriginalEvent
- ContextualInformation
- Advice
- Sentiment
- Discussion
- Irrelevant.
Of these 24 categories, 6 of them(written in bold) are actionable categories that needs immediate attention by the emergency response officers.
We have experimented with 5 statistical classifiers. They are-
- Multinomial Naive Bayes
- Support Vector Machine
- Stochastic Gradient Descent
- Decision Tree
- Random Forest
We have also experimented with 1 neural network (Multilayer Perceptron)