Skip to content
Materials for the StoryLine extraction task - annotated data, baselines and evaluation scripts, evaluation data.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
ECB+_LREC2014
annotated_data
evaluation_format
.DS_Store
LICENSE.md
README.md
baseline_OP.py
baseline_PPMI1.py
baseline_PPMI_CONTAINS.py add scripts May 27, 2017
create_gold_document.py
eval_script.py

README.md

EventStoryLine

This repository contains the following materials associated with the StoryLine extraction Task:

  • annotated data in CAT-XML format (folder: annotated_data). To visualise the data, you have to use CAT (Content Annotation Tool: http://dh.fbk.eu/resources/cat-content-annotation-tool). Ask for a n account, it's free.
  • annotated data in evaluation format, extending PLOT_LINK relations to include coreference relations (folder: evaluation_format)
  • test data (folder: evaluation_format/test)
  • Python3.* scripts for creating the evaluation format of the data, extracting baselines systems, evaluating baselines'output

The corpus is still growing. Different versions will be made available in this repository as soon as they are ready. Reference papers:

Caselli, T. and P. Vossen. 2016. The Storyline Annotation and Representation Scheme (StaR): A Proposal. In Proceedings of the 2nd Workshop on Computing News Storylines (CNS 2016). Held in conjunction with EMNLP 2016 Caselli, T. and P. Vossen. 2017. The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction. In Proceedings of the Events and Stories in the News (EventStory 2017). Held in conjunction with ACL 2017

Experiments reported in Caselli and Vossen 2017 use version 0.9 of the corpus.

Version 1.0 is available.

You can’t perform that action at this time.