Skip to content

Latest commit

 

History

History
18 lines (11 loc) · 1.19 KB

README.md

File metadata and controls

18 lines (11 loc) · 1.19 KB

Identifying Automatically Generated Headlines using Transformers

Code for NLP4IF 2021 workshop paper.

How to replicate results?

Step 1: Download Data.

Step 2: Pretrain two LMs using the HuggingFace library. Pretrain one LM on the "attack" data and another on the "defense". Sample bash script at gpt2_pretraining.sh.

Step 3: Generate headlines for attack and defense (if you haven't downloaded the ones from Step 1).

Step 4: Merge the real/generated headlines for attack and defense.

Step 5: Finetune your classifiers (using the example scripts provided here, ie. *_classify.py).