Toxic-Comment-Classifiers

The increased use in social media has led to more human interactions, but at the cost of more toxic language. In this paper, we attempt to produce an automated classification system to flag toxic comments in social media using deep learning techniques. We broadly define toxic language as any comment that includes profanity, offensive language, or hate speech. We compared the performance of a Logistic Regression, a Bidirectional LSTM Neural Network, and a Bidirectional Encoder Representations from Transformers (BERT) model. The best model, in terms of F1 score and recall, is BERT, followed by the Bidirectional LSTM and the Logistic Regression. We observe high overfitting in the Bidirectional LSTM model, which we propose to improve by including more training data. Compared to other state-of-the-art classifiers, all our models are more robust in adversarial contexts, where users obfuscate toxic language. We propose more thorough preprocessing to recognize toxic text, such as using spell check.

See results of the models below.

Data

Find a sample of the input data in sample_data/

Analysis

Find the Data exploration, Logistic Regression, biLSTM and BERT classifications model notebooks in [src/models/]src/models/

Presentation slides

https://docs.google.com/presentation/d/1WC6C03zoXjigjjxAV_hsDCdBcG27DAZtMcX1L058HDI/edit#slide=id.g12e6e4cab85_0_43

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
output		output
sample_data		sample_data
src		src
.gitignore		.gitignore
Final Project.pdf		Final Project.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Toxic-Comment-Classifiers

Data

Analysis

Presentation slides

Final Report

Kaggle Dataset

Twitter Dataset

About

Releases

Packages

Languages

Beau-Smit/toxic-comment

Folders and files

Latest commit

History

Repository files navigation

Toxic-Comment-Classifiers

Data

Analysis

Presentation slides

Final Report

Kaggle Dataset

Twitter Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages