CrossAug

This is the code for the CIKM 2021 paper CrossAug: A Contrastive Data Augmentation Method for Debiasing Fact Verification Models.

In this work, we propose a data augmentation method for debiasing fact verification models by generating contrastive samples.

Setup

Install dependencies

Our code is based on Python 3.7, and experiments are run on a single GPU.

pip install -r requirements.txt

Download the data

Download FEVER¹, FEVER Symmetric², Adversarial FEVER³, and Fool Me Twice⁴ datasets using the bash script below:

./download_data.sh

You can either download the FEVER train set augmented with CrossAug here or manually generate augmented data from the next section.

Data Augmentation

Augment FEVER train dataset with CrossAug

python run_crossaug.py \
  --in_file fever_data/fever.train.jsonl \
  --out_file fever_data/fever+crossaug.train.jsonl

Use the fine-tuned negative claim generation model

We have uploaded the negative claim generation model fine-tuned with WikiFactCheck-English⁵ dataset on the Huggingface repository. An example code for using the fine-tuned model is provided below:

import torch
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

model_name = 'minwhoo/bart-base-negative-claim-generation'
tokenizer = AutoTokenizer.from_pretrained(model_name)

model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
model.to('cuda' if torch.cuda.is_available() else 'cpu')

examples = [
    "Little Miss Sunshine was filmed over 30 days.",
    "Magic Johnson did not play for the Lakers.",
    "Claire Danes is wedded to an actor from England."
]

batch = tokenizer(examples, max_length=1024, padding=True, truncation=True, return_tensors="pt")
out = model.generate(batch['input_ids'].to(model.device), num_beams=5)
negative_examples = tokenizer.batch_decode(out, skip_special_tokens=True)
print(negative_examples)
# ['Little Miss Sunshine was filmed less than 3 days.', 'Magic Johnson played for the Lakers.', 'Claire Danes is married to an actor from France.']

Train and test the model

For training and evaluation, we slightly modified the code from this repo, which was in turn modified from an older version of Huggingface transformers library.

Train with CrossAug-augmented dataset and evaluate on fact verification dev sets

TRAIN_SEED=177697310
python run_fever.py \
    --task_name fever \
    --do_train \
    --train_task_name fever+crossaug \
    --do_eval \
    --eval_task_names fever symmetric adversarial fm2 \
    --data_dir ./fever_data/ \
    --do_lower_case \
    --model_type bert \
    --model_name_or_path bert-base-uncased \
    --max_seq_length 128 \
    --per_gpu_train_batch_size 32 \
    --learning_rate 2e-5 \
    --num_train_epochs 3.0 \
    --save_steps 100000 \
    --output_dir ./crossaug_trained_models_seed=$TRAIN_SEED/ \
    --output_preds \
    --seed $TRAIN_SEED

Train baseline (no augmentation)

TRAIN_SEED=177697310
python run_fever.py \
    --task_name fever \
    --do_train \
    --train_task_name fever \
    --do_eval \
    --eval_task_names fever symmetric adversarial fm2 \
    --data_dir ./fever_data/ \
    --do_lower_case \
    --model_type bert \
    --model_name_or_path bert-base-uncased \
    --max_seq_length 128 \
    --per_gpu_train_batch_size 32 \
    --learning_rate 2e-5 \
    --num_train_epochs 3.0 \
    --save_steps 100000 \
    --output_dir ./baseline_trained_models_seed=$TRAIN_SEED/ \
    --output_preds \
    --seed $TRAIN_SEED

Results

Training and evaluation with the above commands should result in the following accuracies.

	FEVER dev	Symmetric	Adversarial	FM2 dev
No aug	86.43	59.14	50.00	41.15
PoE	86.14	63.88	51.31	47.39
CrossAug	85.05	68.20	52.48	45.17

Citation

@inproceedings{lee2021crossaug,
  title={CrossAug: A Contrastive Data Augmentation Method for Debiasing Fact Verification Models},
  author={Minwoo Lee and Seungpil Won and Juae Kim and Hwanhee Lee and Cheoneum Park and Kyomin Jung},
  booktitle={Proceedings of the 30th ACM International Conference on Information & Knowledge Management},
  publisher={Association for Computing Machinery},
  series={CIKM '21},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
download_data.sh		download_data.sh
modeling_bert.py		modeling_bert.py
requirements.txt		requirements.txt
run_crossaug.py		run_crossaug.py
run_fever.py		run_fever.py
utils_fever.py		utils_fever.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

download_data.sh

download_data.sh

modeling_bert.py

modeling_bert.py

requirements.txt

requirements.txt

run_crossaug.py

run_crossaug.py

run_fever.py

run_fever.py

utils_fever.py

utils_fever.py

Repository files navigation

CrossAug

Setup

Install dependencies

Download the data

Data Augmentation

Augment FEVER train dataset with CrossAug

Use the fine-tuned negative claim generation model

Train and test the model

Results

Citation

About

Releases

Packages

Languages

License

minwhoo/CrossAug

Folders and files

Latest commit

History

Repository files navigation

CrossAug

Setup

Install dependencies

Download the data

Data Augmentation

Augment FEVER train dataset with CrossAug

Use the fine-tuned negative claim generation model

Train and test the model

Results

Citation

Footnotes

About

Resources

License

Stars

Watchers

Forks

Languages