Wikipedia Abusive Conversations (WAC) is a large corpus of wikipedia conversations annotated 3 types of abusive content (personal attack, aggression and toxicity). We developped a reconstruction pipeline to synchronize 2 existing corpora of wikipedia comments and create WAC.
The dataset is available for download on figshare:
The content of wikipedia comments is distributed under the CC-BY-SA 3.0 license. The dataset is distributed under the CC0 1.0 Universal license.