/
wikisum.yaml
34 lines (34 loc) · 1.72 KB
/
wikisum.yaml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
Name: "WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation"
Description: |
This dataset provides how-to articles from [wikihow.com](https://www.wikihow.com) and their summaries,
written as a coherent paragraph.
The dataset itself is available at [wikisum.zip](https://wikisum.s3.amazonaws.com/WikiSumDataset.zip),
and contains the article, the summary, the wikihow url, and an official fold (train, val, or test).
In addition, human evaluation results are available at
[wikisum-human-eval.zip](https://wikisum.s3.amazonaws.com/HumanEvaluation.zip).
It consists of human evaluation of the summary of the Pegasus system, annotators response regarding the difficulty
of the task, and words they marked as unknown.
Documentation: https://wikisum.s3.amazonaws.com/README.txt
Contact: nachshon@amazon.com, orenk@amazon.com
ManagedBy: "[Amazon](https://www.amazon.com/)"
UpdateFrequency: Not currently being updated
Tags:
- amazon.science
- natural language processing
- machine learning
License: |
Dataset is published under [CC-NC-SA-3.0](https://creativecommons.org/licenses/by-nc-sa/3.0/).
Human evaluation is published under [CC-SA-4.0](https://creativecommons.org/licenses/by-sa/4.0/).
Resources:
- Description: WikiSum Dataset
ARN: arn:aws:s3:::wikisum
Region: us-east-1
Type: S3 Bucket
Explore:
- "[wikisum.zip](https://wikisum.s3.amazonaws.com/WikiSumDataset.zip)"
- "[wikisum-human-eval.zip](https://wikisum.s3.amazonaws.com/HumanEvaluation.zip)"
DataAtWork:
Publications:
- Title: "WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation"
URL: https://2021.aclweb.org/
AuthorName: Nachshon Cohen, Oren Kalinsky, Yftah Ziser & Alessandro Moschitti