Skip to content
An Exhaustive Paper List for Text Summarization
Branch: master
Clone or download

Latest commit

Latest commit 41bf535 May 2, 2020

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
crowdsource update May 2, 2020
fig upload a gif Apr 17, 2020
README.md upload a gif Apr 17, 2020
summ_concept.md update May 2, 2020

README.md

Text Summarization Papers

by Pengfei Liu, Yiran Chen, Jinlan Fu, Hiroaki Hayashi, Danqing Wang and other contributors.

An exhaustive paper list for Text Summarization, covering papers from eight top conferences (ACL / EMNLP / NAACL / ICML / ICLR / AAAI / IJCAI / NeurIPS) in the last eight years (2013-2020).

What can I get here?

1. Paper Retrieval System [Click Me !!!] 🔽

  • Find the top-cited summarization papers! [The latest update on: 02.25/2020]
  • Track the latest summarization papers!
  • Find the milestone summarization papers for beginners.
  • Search papers by research concepts or your interested keywords.

2. What are the recent Research Concepts and which are HOT?

We first define the typology of essential concepts for the summarization task. We then plot the number of papers for each concept below.

before 2019 denotes the number of papers before 2019.

from 2019 denotes the number of papers since 2019.

Summary

Trends in 2019:

HOT Concepts in red suggest HOT topics, and we can observe:

  • HOTTask: Scientific paper-based summarization has gain growing interests.
  • HOTData: More new datasets are constructed.
  • HOTArchitecture: Pretrained models and graph neural networks prevail.
  • HOTEvaluation: Evaluation of the generated summary's factuality attracts recent attention.
Hot topic: when the proportion of papers on a concept since 2019 is greater than a certain threshold (0.4), we define this concept as a hot topic.

3. Recommended Papers

Papers with Hot Topics

  • pre-X: summarizer with unsupervised pretrained models.
  • task-sci: scientifc paper-based summarization.
  • eval-factuality: factuality evaluation on generated summaries.
  • arch-gnn: graph neural network-based summarizers.
  • data-new: more new datasets are constructed.

Milestone Papers

4. Mainstream Dataset List 🔽

What can I do here?

  • If you have a new "research concept" -- Tell us

    • Update the file summ_concept.md and send us a Pull request.
    • Or you could open an Issue.
  • If you have a new "paper" or want to modify our inaccurate annotations of concepts:

    • Update your paper into the file summ_paper.crowdsource and send us a Pull request.
    • Or you could open an Issue.
  • If you have a new "dataset" or want to modify our inaccurate annotations:

    • Add your dataset (If possible, with a brief description) into summ_data.crowdsource and send us a Pull request.
    • Or you could open an Issue.

Related Work

Future Work

Hopefully, you will see our version-2.0 covering papers from 1980 to 2020.

Acknowledgments

  • Thanks Prof. Graham Neubig's idea on the "concept" and other comments.
  • Thanks Prof. Jackie C. K. Cheung's useful idea about the "old" papers.
  • Thanks Prof. Fei Liu for providing us with a bunch of interesting work and description, which enriches our concept file.
  • Thanks Peter J. Liu a lot for the crowdsourcing idea of the paper and dataset annotations. Feel free to correct our wrong annotations by updating summ_paper.crowdsource and summ_data.crowdsource.
  • Thanks Prof. Mohit Bansal's feedback about this summary.
  • Thanks for Richard Socher's invitation to giving a talk in salesforce and talking more about this project.
You can’t perform that action at this time.