Text Detoxification with Parallel Data

toxic text	neutral paraphrase
this is scaring the s**t out of me.	This is really scaring me.
calm the f**k down	Please calm your nerves
its a crock of s**t , and you know it.	It's senseless, you know it

This repo summarize all the information about Text Detoxification project. Here, you can find all dataset, evaluation setups, and SOTA models for text detoxification for Enlgish and Russian languages.

Where text detoxification can be used?

Before training your language model, chatbot, you can preprocess scrapped training data to ensure that there will be no toxicity. But, you should not just through away your samples. You can detoxify them! Then, the major part of the dataset will not be lost but the content will be saved.
You can ensure that the user message is also non-toxic. Again, the replica will be saved. Now after detoxification, we will ensure that the conversation will not be transferred into unsafe tone.
You can cross-save the answers from your chat-bot as well! The conversation will not be stopped even if you chat-bot generates something toxic. Its reply will be detoxified and the user will see neutral answer.

Main Idea

We are the first to address text detoxification task as seq2seq generation task. To achieve this, we collected the parallel corpora of pairs toxic<->non-toxic texts (See illustration). After that, we trained LM on this corpora to perform detoxification task. These models perform very good and definitely way better than previous unsupervised approaches.

Datasets

We release all the main ParaDetox datates as well as the results of crowdsourcing tasks:

The negatily marked samples from Paraphrasing task can be used to train classifier to detect if the sentence is detoxified in general or not;
The results from Content task can be used to train paraphrase detector between toxic and non-toxic texts and then to measure texts similarity.
The results from Toxicity task can be used as additional data for toxicity classification.

English

EnParaDetox: 🤗 paradetox
EnParaDetox: Paraphrase Task Negative Results: 🤗 en_non_detoxified
EnParaDetox: Content Task Results: 🤗 en_paradetox_content
EnParaDetox: Toxicity Task Results: 🤗 en_paradetox_toxicity
Additionally, we filtered ParaNMT corpus to have detoxified pairs: 🤗 paranmt_for_detox

Russian

RuParaDetox: 🤗 ru_paradetox
RuParaDetox: Paraphrase Task Negative Results: 🤗 ru_non_detoxified
RuParaDetox: Content Task Results: 🤗 ru_paradetox_content
RuParaDetox: Toxicity Task Results: 🤗 ru_paradetox_toxicity

Models

English SOTA: 🤗 bart-base-detox

Russian SOTA: 🤗 ruT5-base-detox

Evaluation

Automatic evaluation always is separated into three parameters: (i) style transfer accuracy (STA) which is usually estimated by toxicity classifier; (ii) content similarity (SIM) which can be estimated either via cosine simiarity between embeddings or as a score from a classifier; (iii) fluency (FL) which can be estimated either via perplexity from LM or as a score from language acceptability classifier. The most recent evaluation setup for the languages:

English

Toxicity classifier: 🤗 roberta_toxicity_classifier
Text embedder: 🤗 LaBSE
Fluency classifier: 🤗 roberta-large-cola-krishna2020

Russian

Toxicity classifier: 🤗 rubertconv_toxic_clf
Text embedder: 🤗 fine-tuned RuBERT
Fluency classifier: 🤗 ruRoBERTa-large-rucola

Demos

You can check our telegram bot and take out all the anger on it!

Relevant Github Pages

The first version of Russian Texts Detoxifier: simple baselines, Russian condBERT, ruGPT-3.
Unsupervised English Texts Detoxiifcation: English condBERT, ParaGedi.
English ParaDetox: English ParaDetox dataset, the first supervised English detoxification SOTA bart-detox.
Russian ParaDetox: competition RUSSE-2022 details, Russian ParaDetox dataset, the first supervised Russian detoxification SOTA ruT5-detox

Papers

Logacheva, V.*, Dementieva, D.*, Ustyantsev, S., Moskovskiy, D., Dale, D., Krotova, I., ... & Panchenko, A. (2022, May). ParaDetox: Detoxification with Parallel Data. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 6804-6818). (*equal contribution) TL;DR EnParaDetox and EnDetox SOTA. [paper]
Dementieva, D., Logacheva, V., Nikishina, I., Fenogenova, A., Dale, D., Krotova, I., ... & Panchenko, A. RUSSE-2022: Findings of the First Russian Detoxification Shared Task Based on Parallel Corpora. TL;DR RuParaDetox and RuDetox SOTA. [paper]
Dementieva, D., Moskovskiy, D., Logacheva, V., Dale, D., Kozlova, O., Semenov, N., & Panchenko, A. (2021). Methods for detoxification of texts for the russian language. Multimodal Technologies and Interaction, 5(9), 54. TL;DR The first detoxification experiments on Russian, unsupervised, condBERT for TST. [paper]
Dale, D., Voronov, A., Dementieva, D., Logacheva, V., Kozlova, O., Semenov, N., & Panchenko, A. (2021, November). Text Detoxification using Large Pre-trained Neural Models. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp. 7979-7996). TL;DR Unsupervised English detoxification based on condBERT and ParaGedi. [paper]
Logacheva, V.*, Dementieva, D.*, Krotova, I., Fenogenova, A., Nikishina, I., Shavrina, T., & Panchenko, A. (2022, May). A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification. In Proceedings of the 2nd Workshop on Human Evaluation of NLP Systems (HumEval) (pp. 90-101). (*equal contribution) TL;DR The problems in automatic evaluation of TST tasks. [paper]
Moskovskiy, D., Dementieva, D., & Panchenko, A. (2022, May). Exploring Cross-lingual Text Detoxification with Large Multilingual Language Models. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop (pp. 346-354). TL;DR Transferring detoxification knowledge between English and Russian. [paper]

Contact

Any question, suggestions, discussions to: Daryna Dementieva (dardem96@gmail.com)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
detox_usage.png		detox_usage.png
en_manual_results.png		en_manual_results.png
ru_manual_results.png		ru_manual_results.png
schema.jpg		schema.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

_config.yml

_config.yml

detox_usage.png

detox_usage.png

en_manual_results.png

en_manual_results.png

ru_manual_results.png

ru_manual_results.png

schema.jpg

schema.jpg

Repository files navigation

Text Detoxification with Parallel Data

Where text detoxification can be used?

Main Idea

Datasets

English

Russian

Models

Evaluation

English

Russian

Demos

Relevant Github Pages

Papers

Contact

About

Releases

Packages

License

dardem/text_detoxification

Folders and files

Latest commit

History

Repository files navigation

Text Detoxification with Parallel Data

Where text detoxification can be used?

Main Idea

Datasets

English

Russian

Models

Evaluation

English

Russian

Demos

Relevant Github Pages

Papers

Contact

About

Resources

License

Stars

Watchers

Forks