Moj Multilingual Abusive Comment Identification - Challenge

Submission for the Moj Abusive Comment Detection Challenge, hosted on Kaggle.

Setup

python -m venv env # virtual env
pip install -r requirements.txt
source env/bin/activate

Change index in main.py to choose which model of the ensemble to train/test.
Run python main.py
utils.py contains helper functions for caching and ensembling.

Note: BERT models are large, GPU with 16GB VRAM required. Batch size can be reduced if training on 8GB GPU.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
utils.py		utils.py