Toxic Comment Test

Note: Due to the nature of toxic comments please cosider this project as explicit.

Python script to train a classification TensorFlow model, and a streamlit app to use the model.

Demo running instance: https://huggingface.co/spaces/vluz/Tox

To use pretrained model, please donload toxmodel.keras and vectorizer.pkl from HuggingFace link:
https://huggingface.co/vluz/toxmodel30/tree/main/model

Open a command prompt and cd to a new directory of your choosing.

Create a virtual environment with:

python -m venv "venv"
venv\Scripts\activate

To install do:

git clone https://github.com/vluz/ToxTest.git
cd ToxTest
pip install -r requirements.txt

Put train.csv into the data dir
and/or
Put toxmodel.keras and vectorizer.pkl into the model dir.

To train do:

python toxtrain.py

To test using existing model do:

stramlit run toxtest.py

To exit the virtual environment do:

venv\Scripts\deactivate

The helper script dataclean.py provides text cleaning for original data

The helper script renderwordcloud.py renders wordclouds for both the data as a whole, and toxic comments

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data		data
model		model
LICENSE		LICENSE
README.md		README.md
app.py		app.py
dataclean.py		dataclean.py
renderwordcloud.py		renderwordcloud.py
requirements.txt		requirements.txt
toxtest.py		toxtest.py
toxtrain.py		toxtrain.py