BottleSumm

Code for "BottleSum: Self-Supervised and Unsupervised Sentence Summarization using the Information Bottleneck Principle"

Simple

####compatibility:

pytorch 0.4 
pytorch-pretrained-bert
sacremoses
nltk
tqdm

Precomputed outputs:

If your main goal is to compare model outputs, we have example outputs from the DUC datasets and CNN/DM in the outputs directory.

For CNN dailymail, we include the model inputs. For DUC datasets, order of outputs corresponds to the DUC data (we do not make this available.

Temporary code instructions:

Note: We are still working on improving this section.

To run bottleSumEx with the same settings as in the paper

python bottleEx_summarize.py -S1_path <S1FILE> -S2_path <S2FILE> -rem_words 3 -out_name <OUTNAME>

and would contain one source sentence (s1) or next sentence (s2) per line. See code for more options.

To generate data for training BottleSum^Self, use the above command (these are the same setting used in the paper.

We are currently producing more in detailed instructions for training BottleSum^Self. If you are planning to use BottleSum^self, please contact us for instructions, as this process is more involved.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
outputs		outputs
utils		utils
README.md		README.md
bottleEx_summarize.py		bottleEx_summarize.py
bottleSelf_summarize.py		bottleSelf_summarize.py
finetune_BottleSelf.py		finetune_BottleSelf.py
gpt2_token_mod.py		gpt2_token_mod.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BottleSumm

Simple

Precomputed outputs:

Temporary code instructions:

About

Releases

Packages

Languages

peterwestuw/BottleSum

Folders and files

Latest commit

History

Repository files navigation

BottleSumm

Simple

Precomputed outputs:

Temporary code instructions:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages