🌟 CTRLsum #9001

astariul · 2020-12-09T08:12:29Z

🌟 New model addition

Model description

Current summarization systems yield generic summaries that are disconnected from users’ preferences and expectations. To address this limitation, we present CTRLsum, a novel framework for controllable summarization.

Our approach enables users to control multiple aspects of generated summaries by interacting with the summarization system through textual input in the form of a set of keywords or descriptive prompts.
Using a single unified model, CTRLsum is able to achieve a broad scope of summary manipulation at inference time without requiring additional human annotations or pre-defining a set of control aspects during training.
We quantitatively demonstrate the effectiveness of our approach on three domains of summarization datasets and five control aspects:

entity-centric

length-controllable summarization

contribution summarization on scientific papers

invention purpose summarization on patent filings

question-guided summarization on news articles in a reading comprehension setting

Moreover, when used in a standard, uncontrolled summarization setting, CTRLsum achieves state-of-the-art results on the CNN/DailyMail dataset.

Open source status

the model implementation is available: https://github.com/salesforce/ctrl-sum
the model weights are available: Download link available in the README of the repo
who are the authors: @jxhe @muggin

hyunwoongko · 2021-03-21T16:18:15Z

I ported this model for easy use in Hugging Face Transformers. Try using the code below!

1. Create models and tokenizers

>> from transformers import AutoModelForSeq2SeqLM, PreTrainedTokenizerFast

>>> model = AutoModelForSeq2SeqLM.from_pretrained("hyunwoongko/ctrlsum-cnndm")
>>> # model = AutoModelForSeq2SeqLM.from_pretrained("hyunwoongko/ctrlsum-arxiv")
>>> # model = AutoModelForSeq2SeqLM.from_pretrained("hyunwoongko/ctrlsum-bigpatent")

>>> tokenizer = PreTrainedTokenizerFast.from_pretrained("hyunwoongko/ctrlsum-cnndm")
>>> # tokenizer = PreTrainedTokenizerFast.from_pretrained("hyunwoongko/ctrlsum-arxiv")
>>> # tokenizer = PreTrainedTokenizerFast.from_pretrained("hyunwoongko/ctrlsum-bigpatent")

2. Unconditioned summarization

>>> data = tokenizer("My name is Kevin. I love dogs. I loved dogs from 1996. Today, I'm going to walk on street with my dogs", return_tensors="pt")
>>> input_ids, attention_mask = data["input_ids"], data["attention_mask"]
>>> tokenizer.batch_decode(model.generate(input_ids, attention_mask=attention_mask, num_beams=5))[0]
'</s>My name is Kevin. I loved dogs from 1996.</s>'

3. Conditioned summarization

You can input condition token using TOKEN => CONTENTS structure

>>> data = tokenizer("today plan => My name is Kevin. I love dogs. I loved dogs from 1996. Today, I'm going to walk on street with my dogs", return_tensors="pt")
>>> input_ids, attention_mask = data["input_ids"], data["attention_mask"]
>>> tokenizer.batch_decode(model.generate(input_ids, attention_mask=attention_mask, num_beams=5))[0]
"</s> Today, I'm going to walk on street with my dogs. I loved dogs from 1996</s>"

4. Prompt summarization

You can also input decoder_input_ids for input prompt.

>>> data = tokenizer("Q:What is my name? A: => My name is Kevin. I love dogs. I loved dogs from 1996. Today, I'm going to walk on street with my dogs", return_tensors="pt")
>>> input_ids, attention_mask = data["input_ids"], data["attention_mask"]
>>> tokenizer.batch_decode(model.generate(input_ids, attention_mask=attention_mask, num_beams=5, decoder_input_ids=tokenizer("Q:What is My name? A:", return_tensors="pt")["input_ids"][:, :-1]))[0]
'<s>Q:What is My name? A: Kevin.</s>'

astariul added the New model label Dec 9, 2020

astariul changed the title ~~CTRLsum~~ 🌟 CTRLsum Dec 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌟 CTRLsum #9001

🌟 CTRLsum #9001

astariul commented Dec 9, 2020

hyunwoongko commented Mar 21, 2021 •

edited

Loading

🌟 CTRLsum #9001

🌟 CTRLsum #9001

Comments

astariul commented Dec 9, 2020

🌟 New model addition

Model description

Open source status

hyunwoongko commented Mar 21, 2021 • edited Loading

1. Create models and tokenizers

2. Unconditioned summarization

3. Conditioned summarization

4. Prompt summarization

hyunwoongko commented Mar 21, 2021 •

edited

Loading