GermEval-2018 data repository
This repository contains all different types of data from the GermEval-2018 shared task.
manually labelled training and test data
The manually labelled GermEval-2018 data consists of two files:
This file represents the training data from the shared task. It contains the annotation for both tasks of the shared task.
This file represents the test data from the shared task. It contains the annotation for both tasks of the shared task.
Detailed guidelines regarding the annotation scheme of the data can be found in the following file:
lIf you publish any work using the GermEval-2018 data, please cite the following publication:
Michael Wiegand, Melanie Siegel, and Josef Ruppenhofer: "Overview of the GermEval 2018 Shared Task on the Identification of Offensive Language", in Proceedings of the GermEval, 2018, Vienna, Austria.
represents the evaluation tool. It can be used to evaluated the predictions for both tasks of the shared task.
In order to obtain further information on the usage of that tool please type:
perl evaluationScriptGermeval2018.pl --help
The complete evaluation of all runs submitted by participants can be found in the file:
The submissions of all participants for both tasks can be found in the package:
All participants had to complete an online survey as part of their submission. The results of the survey are available in the file:
More information about the shared task can be found on the website of the task: https://projects.fzai.h-da.de/iggsa/
Please direct any questions that you have about this data to the organizers of the shared task:
Josef Ruppenhofer email: email@example.com
Melanie Siegel email: firstname.lastname@example.org
Michael Wiegand email: email@example.com