GitHub

What Have We Achieved on Text Summarization?

This is the official repo for

the EMNLP 2020 long paper What Have We Achieved on Text Summarization?

Motivation
Released Data
Evaluation Tool
Bibtex

Motivation

Within the decades, Automatic Text Summarization has been developed greatly from the traditional statistical way to nowadays neural architectures. We concern two questions in this domain: 1) overall, what have we achieved on text summarization? 2) what fundamental changes each milestone technique has brought to summarization systems?

We conduct an empirical analysis on 10 representative summarization models by using an error-guided and fine-grained framework of evaluation metric (PolyTope). These 10 models include pre-neural and neural methods, extractive and abstractive methods, and milestone techniques such as copy, coverage, pre-training and hybrid.

The main goal of this work is to investigate the differences between summarization systems. Nonetheless, our dataset gives us a test bed to meta-evaluate the commonly used evaluation metrics like ROUGE, Pyramid, Ranking, etc. Thus, we also report a contrast between ROUGE and PolyTope quantitatively, and between PolyTope and other human evaluation metrics qualitatively to demonstrate why we used PolyTope for our research goal.

Released Data

There are 10 system outputs. For each system, we have 150 trials from the non-anonymized CNN/DM dataset for human evaluation.

Evaluation Tool

Issue Types
Labels
Decision Tree
Guidelines
Template
- Click this link to watch the demonstration in the video, from 2:25 to 3:40. Remember to enable macros when using the template.

Bibtex

@inproceedings{huang2020have,
  title={What Have We Achieved on Text Summarization?},
  author={Huang, Dandan and Cui, Leyang and Yang, Sen and Bao, Guangsheng and Wang, Kun and Xie, Jun and Zhang, Yue},
  booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  pages={446--469},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
outputs		outputs
outputs_with_human_annotation		outputs_with_human_annotation
AnnotationTemplate.xlsm		AnnotationTemplate.xlsm
DecisionTree.xlsx		DecisionTree.xlsx
GuideLines.xlsx		GuideLines.xlsx
IssueTypes.xlsm		IssueTypes.xlsm
Labels.xlsm		Labels.xlsm
Readme.md		Readme.md
test_model.png		test_model.png
westlake.png		westlake.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

outputs

outputs

outputs_with_human_annotation

outputs_with_human_annotation

AnnotationTemplate.xlsm

AnnotationTemplate.xlsm

DecisionTree.xlsx

DecisionTree.xlsx

GuideLines.xlsx

GuideLines.xlsx

IssueTypes.xlsm

IssueTypes.xlsm

Labels.xlsm

Labels.xlsm

Readme.md

Readme.md

test_model.png

test_model.png

westlake.png

westlake.png

Repository files navigation

What Have We Achieved on Text Summarization?

Motivation

Released Data

Evaluation Tool

Bibtex

About

Releases

Packages

hddbang/PolyTope

Folders and files

Latest commit

History

Repository files navigation

What Have We Achieved on Text Summarization?

Motivation

Released Data

Evaluation Tool

Bibtex

About

Resources

Stars

Watchers

Forks