HipoRank

Unsupervised and extractive long document summarization with Hierarchichal and Positional information. Contains code for Discourse-Aware Unsupervised Summarization of Long Scientific Documents accepted at EACL 2021.

Requirements

pip install -r requirements.txt

pyrouge_set_rouge_path /absolute/path/to/ROUGE-1.5.5/directory

Library to define summarization pipeline as modular components with standard interfaces for ease of experimentation;

hipo_rank/dataset_iterators yield document sections and sentences in standard typed format;
hipo_rank/embedders generate vector representations for sentences and sections;
hipo_rank/similarities compute similarities to weigh edges in document graph;
hipo_rank/directions introduce directionality in document graph based on discourse structure;
hipo_rank/scorers compute node centrality for sentences in document graph;
hipo_rank/summarizers generate a summary from a document graph with node centrality scores;
hipo_rank/evaluators evaluate summary with different metrics;

human_eval_sample.ipynb Code to sample examples for human evaluation;
human_eval_samples.jsonl Sampled examples for human evaluation;
human_eval_data.jsonl Results of human evaluation from Prodigy.ai annotation tool;
human_eval_results.ipynb Code to generate human evaluation metrics;

plot_ablation.ipynb Code to plot results of ablation study;
plot_sentence_positions.ipynb Code to plot sentence positions in original document;

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
ROUGE-1.5.5		ROUGE-1.5.5
hipo_rank		hipo_rank
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
exp10_run.py		exp10_run.py
exp11_run.py		exp11_run.py
exp1_run.py		exp1_run.py
exp2_run.py		exp2_run.py
exp3_run.py		exp3_run.py
exp4_run.py		exp4_run.py
exp5_run.py		exp5_run.py
exp6_run.py		exp6_run.py
exp7_run.py		exp7_run.py
exp8_run.py		exp8_run.py
exp9_run.py		exp9_run.py
human_eval_data.jsonl		human_eval_data.jsonl
human_eval_results.ipynb		human_eval_results.ipynb
human_eval_sample.ipynb		human_eval_sample.ipynb
human_eval_samples.jsonl		human_eval_samples.jsonl
plot_ablation.ipynb		plot_ablation.ipynb
plot_sentence_positions.ipynb		plot_sentence_positions.ipynb
requirements.txt		requirements.txt