Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Annotated_Corpus
Guidelines.pdf First commit Sep 3, 2018
README.md

README.md

WhatsApp-Dataset

We developed this WhatsApp dataset to study cyberbullying among Italian students aged 12-13 in the context of the CREEP EIT project. The corpus of Whatsapp chats is made of 14,600 tokens divided in 10 chats. All the chats have been annotated by two annotators using the CAT web-based tool following the same guidelines. Our guidelines are an adaptation to Italian of the “Guidelines for the Fine-Grained Analysis of Cyberbullying” developed for English by the Language and Translation Technology Team of Ghent University. With respect to the original guidelines, we added a new type of insult called "Body Shame" to cover expressions that criticize someone based on the shape, size, or appearance of his/her body. We have also changed the original type "Encouragement to the Harasser" into "Encouragement to the Harassment", so to include all the incitements between the bully and his/her assistants.

Reference: Rachele Sprugnoli, Stefano Menini, Sara Tonelli, Filippo Oncini, Enrico Maria Piras. 2018. "Creating a WhatsApp Dataset to Study Pre-teen Cyberbullying". TO APPEAR in Proceedings of the 2nd Workshop on Abusive Language Online (ALW2).

You can’t perform that action at this time.