add hate speech benchmarking (+doc)

alexandrainst · Jul 16, 2021 · 6aa3432 · 6aa3432
1 parent bf44087
commit 6aa3432
Show file tree

Hide file tree

Showing 3 changed files with 107 additions and 0 deletions.
diff --git a/docs/docs/tasks/hatespeech.md b/docs/docs/tasks/hatespeech.md
@@ -0,0 +1,70 @@
+Hate Speech Detection
+=====================
+
+Hate speech detection is a general term that can include several different tasks. 
+The most common is the identification of offensive language which aims at detecting whether a text is offensive or not (e.g. any type of comment that should be moderated on a social media platform such as containing bad language or attacking an individual). 
+Once a text is detected as offensive, one can detect whether the content is hateful or not. 
+
+Here are definitions of the previous concepts: 
+ * offensive : contains profanity or insult
+ * hateful : targets a group or an individual with the intent to be harmful or to cause social chaos.
+
+
+| Model         | Train Data                      | License   | Trained by          | Tags      | DaNLP |
+|---------------|---------------------------------|-----------|---------------------|-----------|-------|
+| [BERT](#bert) | [DKHate](../datasets.md#dkhate) | CC BY 4.0 | Alexandra Instittut | OFF / NOT | ✔️    |
+
+
+### Use cases 
+
+Hate speech detection is mostly used with the aim of providing support to moderators of social media platform. 
+
+## Models
+
+### 🔧 BERT Offensive {#bert}
+
+The offensive language identification model is intended to solve the binary classification problem of identifying whether a text is offensive or not (contains profanity or insult), therefore, given a text, can predict two classes: `OFF` (offensive) or `NOT` (not offensive). 
+Its architecture is based on BERT [(Devlin et al. 2019)](https://www.aclweb.org/anthology/N19-1423/). 
+In particular, it is based on the pretrained [Danish BERT](https://github.com/botxo/nordic_bert) trained by BotXO and finetuned on the [DKHate](../datasets.md#dkhate) data using the [Transformers](https://github.com/huggingface/transformers) library. 
+
+The BERT Offensive model can be loaded with the `load_bert_offensive_model()` method. 
+Please note that it can maximum take 512 tokens as input at a time. The sentences are automatically truncated if longer.
+
+Below is a small snippet for getting started using the BERT Offensive model. 
+
+```python
+from danlp.models import load_bert_offensive_model
+
+# load the offensive language identification model
+offensive_model = load_bert_offensive_model()
+
+sentence = "Han ejer ikke respekt for nogen eller noget... han er megaloman og psykopat"
+
+# apply the model on the sentence to get the class in which it belongs
+pred = offensive_model.predict(sentence)
+# or to get its probability of being part of each class
+proba = offensive_model.predict_proba(sentence)
+```
+
+
+## 📈 Benchmarks
+
+See detailed scoring of the benchmarks in the [example](<https://github.com/alexandrainst/danlp/tree/master/examples>) folder.
+
+The benchmarks has been performed on the test part of the [DKHate](../datasets.md#dkhate) dataset.
+
+The scores presented here describe the performance of the models for the task of offensive language identification. 
+
+| Model | OFF  | NOT  | AVG F1 |
+|-------|------|------|--------|
+| BERT  | 61.9 | 95.4 | 78.7   |
+
+
+The evaluation script `hatespeech_benchmarks.py` can be found [here](https://github.com/alexandrainst/danlp/blob/master/examples/benchmarks/hatespeech_benchmarks.py).
+
+
+## 🎓 References 
+
+- Marc Pàmies, Emily Öhman, Kaisla Kajava, Jörg Tiedemann. 2020. [LT@Helsinki at SemEval-2020 Task 12: Multilingual or Language-specific BERT?](https://aclanthology.org/2020.semeval-1.205/). In **SemEval-2020**
+
+
diff --git a/examples/benchmarks/README.md b/examples/benchmarks/README.md
@@ -26,3 +26,5 @@ For running the `sentiment_benchmarks_twitter` you need a twitter development ac
 - Benchmark script of [Dependency Parsing](<https://github.com/alexandrainst/danlp/blob/master/docs/models/dependency.md>) on [Danish Dependency Treebank](<https://github.com/alexandrainst/danlp/blob/master/docs/datasets.md#danish-dependency-treebank-dane>). SpaCy, Dacy and Stanza models are benchmarked `dependency_benchmarks.py`
 
 - Benchmark script of Noun-phrase Chunking -- depending on the [Dependency Parsing model](<https://github.com/alexandrainst/danlp/blob/master/docs/models/dependency.md>) -- on [Danish Dependency Treebank](<https://github.com/alexandrainst/danlp/blob/master/docs/datasets.md#danish-dependency-treebank-dane>). The (convertion of the dependencies given by the) spaCy model is benchmarked `chunking_benchmarks.py`
+
+- Benchmark script for [Hate Speech Detection](<https://github.com/alexandrainst/danlp/blob/master/docs/models/hatespeech.md>) on [DKHate](<https://github.com/alexandrainst/danlp/blob/master/docs/docs/datasets.md#dkhate>). A BERT model for identification of offensive language is benchmarked `hatespeech_benchmarks.py`
diff --git a/examples/benchmarks/hatespeech_benchmarks.py b/examples/benchmarks/hatespeech_benchmarks.py
@@ -0,0 +1,35 @@
+from danlp.datasets import DKHate
+from danlp.models import load_bert_offensive_model
+import time
+from .utils import *
+
+## Load the DKHate data
+dkhate = DKHate()
+df_test, _ = dkhate.load_with_pandas()
+
+sentences = df_test["tweet"].tolist()
+labels_true = df_test["subtask_a"].tolist()
+num_sentences = len(sentences)
+
+
+def benchmark_bert_mdl():
+    bert_model = load_bert_offensive_model()
+
+    start = time.time()
+
+    preds = []
+    for i, sentence in enumerate(sentences):
+        pred = bert_model.predict(sentence)
+        preds.append(pred)
+    print('BERT:')
+    print_speed_performance(start, num_sentences)
+
+    assert len(preds) == num_sentences
+
+    print(f1_report(labels_true, preds, "BERT", "DKHate"))    
+
+
+
+if __name__ == '__main__':
+    benchmark_bert_mdl()
+