Multilingual Toxicity Detector

NLP deep learning model for toxicity detection in text (English, Spanish, Turkish, Russian, French, Portuguese, Italian), trained on a TS-877 Ryzen-based NAS with 8 cores and 16 threads, with a GeForce GTX 1060 6GB graphics card. This repo includes the serving of the model with Tensorflow + Flask + AJAX.

The model

The input is ingested by a Distilbert Transformer (from @huggingface) previously being tokenized by the corresponding tokenizer. Then, the embeddings enter a Funnel component, which models (non-)linear combinations starting from the embedding up to the final node, which contains a neuron with a sigmoid activation function that predicts the toxicity for the given input.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
img		img
static/css		static/css
templates		templates
.gitignore		.gitignore
FinalReport.pdf		FinalReport.pdf
README.md		README.md
app.py		app.py
initial-notebook.ipynb		initial-notebook.ipynb
open-model.py		open-model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

static/css

static/css

templates

templates

.gitignore

.gitignore

FinalReport.pdf

FinalReport.pdf

README.md

README.md

app.py

app.py

initial-notebook.ipynb

initial-notebook.ipynb

open-model.py

open-model.py

Repository files navigation

Multilingual Toxicity Detector

The model

About

Releases

Packages

Languages

margaritageleta/multilingual-toxicity-detector

Folders and files

Latest commit

History

Repository files navigation

Multilingual Toxicity Detector

The model

About

Topics

Resources

Stars

Watchers

Forks

Languages