when the text is very short #4

08tjlys · 2021-11-10T08:41:04Z

hi,
first of all, really thanks for such great project.
When i tried your inference code on hugging face, i found that when the text is short, for example like just "weather is fine", it will generate a longer unrelated summary.
i am totally new to summarization, so is there any thing i should pay attention to in such case? for example, in such case, generating inputs_ids will be different?
really appreciate your guide if possible, thanks!

Tahmid04 · 2021-11-12T09:12:12Z

Hi, thanks for appreciating our work. Since XL-Sum is a news article-summary dataset, models pretrained on it are expected to generate news-like summaries, even when the input is not from the news domain, e.g., paper abstracts, and for your case non-article inputs. If you want to generate summaries for non-news inputs, we recommend you further fine-tune the model checkpoint on such data. And if you just want to reduce the output length, using a higher length penalty may suit your use case. Hope that helps!

08tjlys · 2021-11-13T01:41:50Z

thanks a lot, it really helps

08tjlys closed this as completed Nov 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

when the text is very short #4

when the text is very short #4

08tjlys commented Nov 10, 2021

Tahmid04 commented Nov 12, 2021

08tjlys commented Nov 13, 2021

when the text is very short #4

when the text is very short #4

Comments

08tjlys commented Nov 10, 2021

Tahmid04 commented Nov 12, 2021

08tjlys commented Nov 13, 2021