Analysis-of-Explainability-Techniques-on-BERT-for-Medical-Domain

In this work, we aim to experiment with variety of interpretability approaches on deep learning models trained for a classification task in the medical domain. We believe such work could contribute significantly to the AI for healthcare industry and increase the trust for usage of these models in high stake industries. We particularly focus on the Post-Hoc analysis of the model and we would like to acknowledge the survey paper Post-hoc Interpretability for Neural NLP: A Survey for providing key insights into the problem.

Interpretability approaches can be categorized broadly into two approaches: intrinsic and extrinsic. In intrinsic approaches, the model's own architecture generates explanations. In extrinsic or post-hoc approaches, the model is explained by analyzing its outputs. In this work our focus is exclusively on post-hoc approaches in medical NLP. Post-hoc approaches are often more practical than task-dependent intrinsic approaches because they are model-agnostic, treating the model as a black box and using its outputs to generate explanations. However, post-hoc methods are sometimes criticized for providing misleading explanations of models that are fundamentally unexplainable. In this study, we have worked on diverse set of post-hoc methods using a fine tuned pre-trained BERT based model, and assessed the strengths and weaknesses of each method.

This work was submitted as the final project for the course CSE 256: Statistical NLP at University of California San Diego.

Dependency Installation

Clone the repo

git clone https://github.com/PrasannaKumaran/Analysis-of-Explainability-Techniques-on-BERT-for-Medical-Domain.git

For accounts that are SSH configured

 git clone git@github.com:PrasannaKumaran/Analysis-of-Explainability-Techniques-on-BERT-for-Medical-Domain.git

Install pip
```
python -m pip install --upgrade pip
```

Create and Activate Virtual Environment (Linux)

python3 -m venv [environment-name]
source [environment-name]/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Model and Dataset

For this work we have used the kaggle medical transcription dataset and we have fine tuned a pre-trained BERT based model. We have used the BioBERT and fine tuned it for 100 epochs by freezing last few layers. We have considered using the model state after 16 epochs since the performance of the model begins to drop as shown in Figure

.

Experiments

We implemented multiple methods including SHAP, LIME, Integrated Gradients, Adversarial and Counterfactual examples, Vocabulary and Bertology. The implementation can be found in the corresponding ipynb notebook.

Authors

Prasannakumaran D, Ashwin Muralidharan, Zongze Liu, Pranav Khanna

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data		data
images/report data		images/report data
results		results
AdversarialAttack (1).ipynb		AdversarialAttack (1).ipynb
Ashwin_Model_FineTuning (2).ipynb		Ashwin_Model_FineTuning (2).ipynb
Lime_final_with_examples.ipynb		Lime_final_with_examples.ipynb
README.md		README.md
SentenceSelection.ipynb		SentenceSelection.ipynb
Vocab_Analysis.ipynb		Vocab_Analysis.ipynb
bertology.ipynb		bertology.ipynb
integrated_gradients.ipynb		integrated_gradients.ipynb
model_fine_tuning.ipynb		model_fine_tuning.ipynb
requirements.txt		requirements.txt
shap_analysis.ipynb		shap_analysis.ipynb

PrasannaKumaran/Analysis-of-Explainability-Techniques-on-BERT-for-Medical-Domain

Folders and files

Latest commit

History

Repository files navigation

Analysis-of-Explainability-Techniques-on-BERT-for-Medical-Domain

Dependency Installation

Model and Dataset

Experiments

Authors

About

Topics

Resources

Stars

Watchers

Forks

Languages