Headline Token based Discriminative Learning for Subheading Generation in News Article

This is the pytorch implementation of Headline Token based Discriminative Learning for Subheading Generation in News Article

Overview

The news subheading summarizes an article's contents in several sentences to support the headline limited to solely conveying the main contents. So, it is necessary to generate compelling news subheadings in consideration of the structural characteristics of the news. In this paper, we propose a subheading generation model using topical headline information. We introduce a discriminative learning method that utilizes the prediction result of masked headline tokens. Experiments show that the proposed model is effective and outperforms the comparative models on three news datasets written in two languages. We also show that our model performs robustly on a small dataset and various masking ratios. Qualitative analysis and human evaluations also shows that the overall quality of generated subheadings improved over the comparative models.

Our code is based on the code of DIFFCSE. Please refer to their repository for more detailed information.

Pre-requisite

konlpy=0.6.0
kss=3.4.2
matplotlib=3.5.1
pandas=1.4.1
pytorch-lightning=1.2.4
scikit-learn=1.1.0
seaborn=0.11.2
torch=1.7.1
tqdm=4.64.0
transformers=4.3.3
wandb=0.12.16
nltk=3.4.5
datasets=2.2.1
bert-score=0.3.11

DATASET

You can download the YonhapNews Data from the following link

Train data: train Valid data: valid Test data: test

IMPLEMENTATION

sh train_main.sh

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dataset		dataset
img		img
metrics		metrics
.DS_Store		.DS_Store
README.md		README.md
dataset.py		dataset.py
get_bart_model.py		get_bart_model.py
get_score.py		get_score.py
infer.py		infer.py
model.py		model.py
modeling_electra.py		modeling_electra.py
original_modeling_electra.py		original_modeling_electra.py
requirements.txt		requirements.txt
scorer.py		scorer.py
train.py		train.py
train_main.sh		train_main.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Headline Token based Discriminative Learning for Subheading Generation in News Article

Overview

Pre-requisite

DATASET

IMPLEMENTATION

About

Releases

Packages

Languages

Lainshower/Subheading-Gen

Folders and files

Latest commit

History

Repository files navigation

Headline Token based Discriminative Learning for Subheading Generation in News Article

Overview

Pre-requisite

DATASET

IMPLEMENTATION

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages