Disaster Tweet Classification

This is the repository for a mini-project where we classify whether a tweet is discussing a disastrous event (e.g flood) or not (see dataset for details). We use a number of pre-trained transformer models such as BERT and RoBERTa and perform an ablation study on how these models perform with various degrees of preprocessing. Additionally, we also studied the effect of data augmentation via back translation on the models and could generally observe an increase in performance:

For more details, see the report.

Installation & Running

To run the code:

Install Python 3.9+ and jupyter notebook.
Run all_models.ipynb which will install the remaining dependencies.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
data		data
docs		docs
misc		misc
.gitignore		.gitignore
README.md		README.md
all_models.ipynb		all_models.ipynb
train-augmented.csv		train-augmented.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

data

data

docs

docs

misc

misc

.gitignore

.gitignore

README.md

README.md

all_models.ipynb

all_models.ipynb

train-augmented.csv

train-augmented.csv

Repository files navigation

Disaster Tweet Classification

Installation & Running

About

Releases

Packages

Languages

NiklasZ/disaster-tweets-project

Folders and files

Latest commit

History

Repository files navigation

Disaster Tweet Classification

Installation & Running

About

Resources

Stars

Watchers

Forks

Languages