[AVAILABLE from 2 APRIL 2023]
This repository contains the Multi-label Infectious Disease News Event Corpus mentioned in the following paper:
Multi-label Infectious Disease News Event Corpus. Jakub Piskorski, Nicolas Stefanovitch, Brian Doherty, Jens P. Linge, Sopho Kharazi, Jas Mantero, Guillaume Jacquet, Alessio Spadaro and Giulia Teodori. In Proceedings of Text2Story 2023: Sixth International Workshop on Narrative Extraction from Texts held in conjunction with the 45th European Conference on Information Retrieval, Dublin, Ireland, 2023.
Please cite using this reference: https://github.com/jpiskorski/infectious-diseases-events/blob/main/reference.bib
The archive contains three files:
- infectious_diseases_finegrained_grained.txt
The text snippets labelled with fine-grained event types. The text snippets and the labels are separated by tabs.
- infectious_diseases_coarse_grained.txt
The text snippets labelled with coarse-grained event types. The text snippets and the labels are separated by tabs.
- Annotation_guidelines.pdf
The draft version of the annotation guidelines used by the annotators. A new complete version will be released soon.
NOTE: new updated version 1.1 available since 25 May 2023 (small fixes related to inconsistent labels and removing some redundant entries)