Extractive Summarization with Discourse Tree Attention

This is the official code for the paper 'Do We Really Need That Many Parameters In Transformer For Extractive Summarization? Discourse Can Help !' (CODI at EMNLP 2020)

Prepare

You need

Python3
Pytorch
Pandas
Numpy
rouge_papier_v2 can be found here

Data and Trained Models

The CNNDM dataset with generated attention maps (C-tree w/ Nuc) can be found here. It is based on the dataset from DiscoBERT with segmented EDUs.

The trained model with discourse tree attention can be found here

Discourse Parser

We use the state-of-the-art Discourse Parser

How to train the model

Run

python main.py

with following arguments:

-bert_dir indicates where to store the pretrained BERT model
-d_v, -d_k, -d_inner, -d_mlp, -n_layers, -n_head, -dropout are the parameters of the Transformer-based Document Encoder model
-lr, -warmup_steps are the parameter for the adam optimizer
-inputs_dir, -val_inputs_dir are the address of the data
-unit, -unit_length_limit, -word_length_limit indicates whether you want to use sentence or edu as the basic unit, and the length limits of generated summaries
-batch_size indicates the number of instances per batch
-attention_type choose from 'tree', 'dense', 'fixed_rand', 'learned_rand', 'none' and 'self-attention'
-device indicates which gpu device you want to use

How to evaluate the model

Run

python test.py

with following arguments:

-model_path, -model_name indicates the folder and name of the saved model, the model to evalueate is 'model_path/model_name'
-test_inputs_dir indicates the address of test data
-device, indicates which gpu device you want to use
-d_v, -d_k, -d_inner, -d_mlp, -n_layers, -n_head, -dropout are the parameters of the Transformer-based Document Encoder model
-unit, -unit_length_limit, -word_length_limit indicates whether you want to use sentence or edu as the basic unit, and the length limits of generated summaries
-attention_type choose from 'tree', 'dense', 'fixed_rand', 'learned_rand', 'none' and 'self-attention'

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
EDU_segment.py		EDU_segment.py
LICENSE		LICENSE
README.md		README.md
bert_config_uncased_base.json		bert_config_uncased_base.json
build_attention_map.py		build_attention_map.py
dataloader.py		dataloader.py
doc_encoder.py		doc_encoder.py
main.py		main.py
models.py		models.py
run.py		run.py
simple_optimizer.py		simple_optimizer.py
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EDU_segment.py

EDU_segment.py

LICENSE

LICENSE

README.md

README.md

bert_config_uncased_base.json

bert_config_uncased_base.json

build_attention_map.py

build_attention_map.py

dataloader.py

dataloader.py

doc_encoder.py

doc_encoder.py

main.py

main.py

models.py

models.py

run.py

run.py

simple_optimizer.py

simple_optimizer.py

test.py

test.py

utils.py

utils.py

Repository files navigation

Extractive Summarization with Discourse Tree Attention

Prepare

Data and Trained Models

Discourse Parser

How to train the model

How to evaluate the model

About

Releases

Packages

Languages

License

Wendy-Xiao/ext_summ_disco_tree_attn

Folders and files

Latest commit

History

Repository files navigation

Extractive Summarization with Discourse Tree Attention

Prepare

Data and Trained Models

Discourse Parser

How to train the model

How to evaluate the model

About

Resources

License

Stars

Watchers

Forks

Languages