Short Cuts for Deep Neural models (2023)

Smit, J.P. - TU Delft

This repository contains code to accompany a Thesis Submitted to EEMCS Faculty Delft University of Technology, in Partial Fulfilment of the Requirements for the Bachelor of Computer Science and Engineering.

Notable credits should go to JoshuaGhost for creating and maintaining ExPred: the Deep Neural model which we have studied for this thesis. The model is an implementation of the paper Explain and Predict, and then Predict Again.

Structure

This repository is a copy of the ExPred repository, with added Jupyter Notebooks. It contains subsequences mined from the FeVer dataset, a big database (90.000 items) of fact queries containing a label 'Supported' or 'Refuted'. The ExPred model draws evidence from Wikipedia pages to give queries those labels.

Interestingly, the ExPred model is not hundred percent fallible, it makes mistakes sometimes. Yet the mistakes can be structural, meaning the model is biased. We design an algorithm to point out the biggest biases of the ExPred model.

Algorithm

The algorithm was designed as follows:

Take the training dataset of FeVer as a dataset. Take ExPred as a model. Take DESQ as a subsequence mining tool.

Mine sequences from the 'Refuted' queries of the dataset.
Repeat with 'Supported' queries of the dataset.
XOR: Combine the two subsets of sequences and remove the duplicates.
Evaluate the mined sequences of both sides.
Compute the model's correlation: confirm that the model output agrees with the training data
Propose 'Unseen claims': claims containing the subsequence, that the model has not yet observed
Perform 'Adverserial Attacks': swap the subsequence for a term that retains the meaning

Results

Click on a sequence name to get to the corresponding notebook. Each notebook contains the code needed to perform the correlation check and adverserial attacks.

Subsequence : subsequence that was mined from FeVer.
Prediction Class : corresponds to the most prevalent prediction label of the mined subsequence.
Relative Support : percentage of cases in which ExPred predicts the same class for the query containing the subsequence.
Precision : precision of the ExPred model on the queries in which it is actually right.
Success-rate : amount of adverserial attacks in which flipping the meaning also meant flipping the label.

Subsequence	Prediction Class
is incapable of being	REFUTED
has only ever been	REFUTED
does not have	REFUTED
is exclusively	REFUTED
is not a(n)	REFUTED
has yet to	REFUTED
is only a(n)	REFUTED
was unable to	REFUTED
There is a	SUPPORTED
was incapable of	REFUTED

Reproducing the Research

Here are the steps for those who are interested in reproducing the research.

Clone this repository
Install the requirements for the ExPred model by pip install -r requirements
Run the provided Jupyter Notebooks

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.idea		.idea
eraserbenchmark		eraserbenchmark
graphics		graphics
scripts		scripts
.gitignore		.gitignore
README.md		README.md
There_is_a.ipynb		There_is_a.ipynb
There_is_a.json		There_is_a.json
bert_as_tfkeras_layer.ipynb		bert_as_tfkeras_layer.ipynb
bert_cls_exp_gen.py		bert_cls_exp_gen.py
bert_data_preprocessing_rational.py		bert_data_preprocessing_rational.py
bert_data_preprocessing_rational_eraser.py		bert_data_preprocessing_rational_eraser.py
bert_utils.py		bert_utils.py
bert_with_ration.py		bert_with_ration.py
bert_with_ration_eraser.py		bert_with_ration_eraser.py
config.py		config.py
display_rational.py		display_rational.py
does_not_have.ipynb		does_not_have.ipynb
does_not_have.json		does_not_have.json
draw_pictures.ipynb		draw_pictures.ipynb
eraser_benchmark.py		eraser_benchmark.py
evaluate_tf_models.py		evaluate_tf_models.py
film_directed_by.json		film_directed_by.json
for_the_academy_award.json		for_the_academy_award.json
has_only_ever_been.ipynb		has_only_ever_been.ipynb
has_only_ever_been.json		has_only_ever_been.json
has_yet_to.ipynb		has_yet_to.ipynb
has_yet_to.json		has_yet_to.json
is_a_song_by.json		is_a_song_by.json
is_exclusively.ipynb		is_exclusively.ipynb
is_exclusively.json		is_exclusively.json
is_incapable_of_being.ipynb		is_incapable_of_being.ipynb
is_incapable_of_being.json		is_incapable_of_being.json
is_not_a.ipynb		is_not_a.ipynb
is_not_a.json		is_not_a.json
is_not_an.json		is_not_an.json
is_only_a.ipynb		is_only_a.ipynb
is_only_a.json		is_only_a.json
load_data_acl_imdb.py		load_data_acl_imdb.py
load_data_eraser_fever.py		load_data_eraser_fever.py
load_data_eraser_multirc.py		load_data_eraser_multirc.py
load_data_imdb_zaidan07_cls.py		load_data_imdb_zaidan07_cls.py
load_data_imdb_zaidan07_eraser.py		load_data_imdb_zaidan07_eraser.py
load_data_imdb_zaidan07_seq.py		load_data_imdb_zaidan07_seq.py
load_data_semeval16.py		load_data_semeval16.py
load_data_semeval18.py		load_data_semeval18.py
losses.py		losses.py
lstm_embedding.ipynb		lstm_embedding.ipynb
making_it_the.ipynb		making_it_the.ipynb
making_it_the.json		making_it_the.json
metrices.py		metrices.py
model.py		model.py
model_origin.py		model_origin.py
requirements.txt		requirements.txt
run_anecdotal.sh		run_anecdotal.sh
utils.py		utils.py
val_fever.sh		val_fever.sh
was_incapable_of.ipynb		was_incapable_of.ipynb
was_incapable_of.json		was_incapable_of.json
was_unable_to.ipynb		was_unable_to.ipynb
was_unable_to.json		was_unable_to.json

jpsmit/Short-Cuts-for-Deep-Neural-models

Folders and files

Latest commit

History

Repository files navigation

Short Cuts for Deep Neural models (2023)

Structure

Algorithm

Results

Reproducing the Research

About

Topics

Resources

Stars

Watchers

Forks

Languages