input-marginalization

Joint work with Xinyu Ma, Mayura Patwardhan, and Peter Michael. This repo contains code and data to implement "Interpretation of NLP models through input marginalization" by Kim et al.

Data

One of the datasets used for experimentation is the Stanford Sentiment Treebank (SST-2). We started with the SST-2 sentences cleaned by frankaging, which are in the data folder. We then preprocessed the data by removing "neutral" sentiment sentences and representing the sentences in the BERT vocabulary, which can be found in preprocessed_data.

The other dataset used is the Stanford Natural Language Inference (SNLI) Corpus. We processed this data into a tokenized representation using the BERT tokenizer. It is in preprocessed_data/SNLI/snli_1.0/snli_{train, dev, test}.pkl.

Preprocessed SST-2 and SNLI is provided in the preprocessed_data/. Wikitext-2 is provided in generative_model/data.

Code

For all of these, you might have to edit the Google Drive directory that each notebook mounts to. They should also be run in a GPU-accelerated Google Colab environment.

Training

The {bert, lstm, cnn}-sst2.ipynb files run through training and saving a model on SST-2. The training and fine-tuning code is adapted from Chris McCormick's tutorial.

The lstm-snli.ipynb file will train a Bi-LSTM on the SNLI dataset.

Input-Marginalization (Evaluation)

In order to get the figures that are seen in the results section we can run these few main files: figure2 will reproduce figure 2 in the original paper for {CNN, LSTM, BERT} trained on SST-2. snli_input_marge_v2.ipynb replicates figure 2 for LSTM trained on SNLI. figure3 will reproduce figure 3a in the original paper. figure4 will reproduce figure 3b in the original paper.

Models

The models, as well as their training and testing stats are saved, and can be found in this Google Drive folder.

We fine-tuned a BERTForSentenceClassification model from HuggingFace for SST-2.

Results

Table 1: Test accuracy of target models

Corpus	LSTM	BERT	CNN
SST-2	0.77	0.92	0.75
SNLI	0.67	--	--

Table 2: Comparison of AUCrep with existing erasure scheme (lower is better)

Zero	Unk	Input-Marginalization
0.5608	0.5328	0.4834

Dependencies

PyTorch, HuggingFace, pytorch_pretrained_bert

Difficulty

We started by implementing each of the models, were some of the difficulties lay in preprocessing and formatting the data correctly, as well as improving our accuracy. We had some trouble with implementing the input marginalization, and had to adjust our code to work for BERT, as well as our 3 other models. Finally, we ran into some subtle bugs when computing our AUC curve that took time to identify and solve. Overall we are proud of the work we accomplished on this project despite these challenges!

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
data		data
generative_model		generative_model
models		models
preprocessed_data		preprocessed_data
.gitignore		.gitignore
README.md		README.md
figure2.ipynb		figure2.ipynb
figure3.ipynb		figure3.ipynb
input_marge_mc.py		input_marge_mc.py
input_marge_v3.ipynb		input_marge_v3.ipynb
interpretation.ipynb		interpretation.ipynb
metrics.py		metrics.py
models.py		models.py
preprocess.py		preprocess.py
snli_figure2.ipynb		snli_figure2.ipynb
snli_input_marge_v2.ipynb		snli_input_marge_v2.ipynb
utils.py		utils.py

ronakdm/input-marginalization

Folders and files

Latest commit

History

Repository files navigation

input-marginalization

Data

Code

Training

Input-Marginalization (Evaluation)

Models

Results

Dependencies

Difficulty

About

Resources

Stars

Watchers

Forks

Languages