Skip to content

This repo involves creating dataset for toxity ranking, and ranking the datat using a LightGBM model

Notifications You must be signed in to change notification settings

superdianuj/toxity_ranker

Repository files navigation

toxity_ranker

First the dataset for learning embeddings of the textual data in "toxity_data.csv" is created from "create_dataset".

Second, the embedddings of the text are learned from "transfer_learning_text" in a transfer learned manner.

Lastly, the LightGBM ranking model is fitted on these embedding features, with output ranks as thresholding of toxity type vector.

About

This repo involves creating dataset for toxity ranking, and ranking the datat using a LightGBM model

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published