Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

This repository contains the implementation of the EMNLP 2022 paper "Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals" by Maarten De Raedt, Fréderic Godin, Chris Develder and Thomas Demeester.

For any questions about the paper or code contact the first author at maarten.deraedt@ugent.be.

EMNLP2022_robustiyfing_sentiment_classification
└─── datasets/   
│   └──aaai-2021-counterfactuals/
│   └──IMDb/
│   └──OOD/
└───encodings
│   └──all-distilroberta-v1/
│   └──all-mpnet-base-v2/
│   └──all-roberta-large-v1/
│   └──unsup-simcse-bert-base-uncased/
│   └──unsup-simcse-bert-large-uncased/
│   └──unsup-simcse-roberta-large/
└───results
│    └──metrics/
│    │   all-roberta-large-v1.json
│    │   unsup-simcse-roberta-large.json
│    │   ...
│   evaluate.py
│   evaluators.py
│   featurizers.py
│   models.py
│   README.md
│   requirements.txt

Experiments

Run the command below to reproduce the main results for SRoBERTa-large.

python3 evaluate.py --name "all-roberta-large-v1"

And for SimCSE-RoBERTa-large:

python3 evaluate.py --name "unsup-simcse-roberta-large"

The results will be written to results/metrics/{name}.json. Note that for each value of k (16, 32, 64, 128), 50 different: k/2 negative and k/2 positive counterfactuals are randomly sampled. As such, the results may slightly differ from those reported in the paper but the main results and findings will stay consistent. Depending on the CPU, running the experiments for a single encoder may take between 1 to 2 hours.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datasets

datasets

README.md

README.md

evaluate.py

evaluate.py

evaluators.py

evaluators.py

featurizers.py

featurizers.py

models.py

models.py

requirements.txt

requirements.txt

Repository files navigation

Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

Table of Contents

Installation

Experiments

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
datasets		datasets
README.md		README.md
evaluate.py		evaluate.py
evaluators.py		evaluators.py
featurizers.py		featurizers.py
models.py		models.py
requirements.txt		requirements.txt

maarten-deraedt/EMNLP2022-robustifying-sentiment-classification

Folders and files

Latest commit

History

Repository files navigation

Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

Table of Contents

Installation

Experiments

About

Resources

Stars

Watchers

Forks

Languages