Classification words importance #17

lspataro · 2019-08-20T16:32:35Z

Is there any way to use bertviz to visualise the importance of the different words respect to a given prediction of a classification task (BertClassifier)?
Similar to this: https://docs.fast.ai/text.interpret.html#interpret

Thank you

jessevig · 2019-08-20T17:23:23Z

Good idea, and thanks for pointing me to that code. I've been experimenting with gradient-based sensitivity analysis on the attention weights (rather than the inputs), but the results weren't particularly interpretable. But I may revisit this with some of the newer fine-tuned models. I agree that this would be a useful analysis for the inputs, though it may overload the visualization. I'll think more about this. Thanks.

Azharo · 2019-08-26T01:10:17Z

I've been wondering how to do this as well as I want to try and visualize which words were most important to the classification. One idea I've had is (if I have 12 encoders in BERT and am only fine-tuning the 12th layer) to take the output of the 11th layer, the Wq, Wk, and Wv weights of the fine-tuned 12th layer and calculate the score manually. Would that be the correct way to think of it? I would essentially get the value score for each token in the input.

zysNLP · 2019-11-14T08:06:31Z

Hi, @jessevig Thanks for your great work. As @lspataro @Azharo mentioned, I want to ask some more details. Lets see an example below:
As we know, BERT/GPT2 accept datas in the form of "sentence, label" or "prompt, inference, label", which maybe means a text classify task and a NLI task, like these sentences:

Text classifies:
s1: The girl wears a red hat and dresses up like a princess. ---label "clothing"
s2: He went to play basketball after class. It seems not many people today. ---label "sport"
As we expected, the words hat and dress in s1, basketball in s2 should get more attentions or a greater weight.

NLIs:
prompt: Some people think that strict punishments for driving offenses are the key to reducing traffic accidents.
inference: Government should invest more in non-profit advertisements， so that they would improve people’s safety awareness potentially
label: 1
As we expected, the words "driving offenses","traffic accidents" in prompt and "government","safety awareness" in inference maybe get more attentions or a greater weight, other words maybe get smaller weights.

We want to get results like these, maybe not a relatively complex visualization results as @jessevig showed, I think this work may need to be done in the last layer of Bert. Furthermore, I would like to see a process.
For example, when we focus on the word hat in s1 above, according to the result clothing, I would like to see how Bert focuses higher weight on hat step by step, which should be the result of a series of time series locking hat, but I am not clear about what it should be, hoping to get your answer. Thank you!

diogocortiz · 2020-01-24T10:33:04Z

Hi there, anyone working on this topic?

I am looking for a way to identify the most important words in a sentence classification task as well.

dshahrokhian · 2020-02-13T19:04:20Z

+1

koren-v · 2020-05-01T12:14:00Z

+1

LinkToPast1990 · 2020-07-14T14:35:54Z

same question here :)

koren-v · 2020-07-14T15:59:46Z

Hi, I did the implementation of some gradient-based algorithms, you can check it here: https://github.com/koren-v/Interpret
(Integrated Gradients works pretty good compared to Smooth Grad)

ssdorsey · 2020-07-18T03:44:07Z

I can definitely recommend captum as well, they have an example using BERT: https://captum.ai/tutorials/Bert_SQUAD_Interpret

gcunhase · 2020-11-26T12:29:33Z

+1

clairemcwhite · 2021-04-30T14:41:50Z

This package does word importance directly with huggingface transformers, using captum

https://github.com/cdpierse/transformers-interpret

Transformers Interpret is a model explainability tool designed to work exclusively with the 🤗 transformers package.

In line with the philosophy of the transformers package Transformers Interpret allows any transformers model to be explained in just two lines. It even supports visualizations in both notebooks and as savable html files.

jessevig · 2021-05-08T14:28:50Z

Closing this issue as I feel it's out of scope for BertViz given the other available libraries.

er-vishalgour · 2021-12-11T14:43:44Z

Yeah Same question

shoroukadel · 2022-06-10T13:58:51Z

I try to use the code in this link https://github.com/cdpierse/transformers-interpret for a multi classification task for my pretrained model using bert function.I trained my model and save it using model.save(). then when I try to load the model path to model I got this error

`OSError: Can't load config for '/content/drive/Shared with me/Colab Notebooks/bert_visualization2.h5py'. Make sure that:
'/content/drive/Shared with me/Colab Notebooks/bert_visualization2.h5py' is a correct model identifier listed on 'https://huggingface.co/models'
or '/content/drive/Shared with me/Colab Notebooks/bert_visualization2.h5py' is the correct path to a directory containing a config.json file
`

jessevig added the enhancement New feature or request label Aug 20, 2019

jessevig closed this as completed May 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classification words importance #17

Classification words importance #17

lspataro commented Aug 20, 2019

jessevig commented Aug 20, 2019

Azharo commented Aug 26, 2019

zysNLP commented Nov 14, 2019 •

edited

Loading

diogocortiz commented Jan 24, 2020

dshahrokhian commented Feb 13, 2020

koren-v commented May 1, 2020

LinkToPast1990 commented Jul 14, 2020

koren-v commented Jul 14, 2020

ssdorsey commented Jul 18, 2020

gcunhase commented Nov 26, 2020

clairemcwhite commented Apr 30, 2021

jessevig commented May 8, 2021

er-vishalgour commented Dec 11, 2021

shoroukadel commented Jun 10, 2022

Classification words importance #17

Classification words importance #17

Comments

lspataro commented Aug 20, 2019

jessevig commented Aug 20, 2019

Azharo commented Aug 26, 2019

zysNLP commented Nov 14, 2019 • edited Loading

diogocortiz commented Jan 24, 2020

dshahrokhian commented Feb 13, 2020

koren-v commented May 1, 2020

LinkToPast1990 commented Jul 14, 2020

koren-v commented Jul 14, 2020

ssdorsey commented Jul 18, 2020

gcunhase commented Nov 26, 2020

clairemcwhite commented Apr 30, 2021

jessevig commented May 8, 2021

er-vishalgour commented Dec 11, 2021

shoroukadel commented Jun 10, 2022

zysNLP commented Nov 14, 2019 •

edited

Loading