Difference problem between Local explainability and Global explainability #371

yaoching0 · 2022-03-22T08:10:02Z

HI,
I train tabnet with 4700-dimension feature,and i check the Global explainability.(Because of its sparseness, I did unique(),the s shows the index in the Global explainability matrix )
and i input training data to the function .explain(Training data), i sum explain_matrix cross the rows, but i get totally different result with Global explainability.
For example,the index of the globally explained maximum value is even 0 in the output of explain() function

eduardocarvp · 2022-03-22T09:02:16Z

Hello @yaoching0 ,

Are you using embeddings for categorical features on your model?

yaoching0 · 2022-03-22T09:24:19Z

@eduardocarvp No, all numerical data between (0,1)

yaoching0 · 2022-03-22T09:30:38Z

My data format is processed and trained exactly by example.

yaoching0 · 2022-03-22T09:41:37Z

This is my training data format

Optimox · 2022-03-22T09:46:05Z

The clf.feature_importances are normalized while the individual importance are not, have a look at #180

So you need to divide by the sum of each row. Moreover, you want to average no sum
avg_imp = np.mean(explain_matrix, axis=0) avg_imp = avg_imp / np.sum(avg_imp)
(make sure that axes are correct)

yaoching0 · 2022-03-22T11:04:31Z

@Optimox
I have a new clue and I see that after calculation they have the same value, but are indeed in different positions, why is this happening?

yaoching0 · 2022-03-22T12:07:59Z

I restart the jupyter kernel,and it seems match

Optimox · 2022-03-22T12:50:02Z

@yaoching0,

Actually I'm surprised that things are working now for you.

I did a test and I was not able to find exactly the feature_importance, this is due to the fact that the internal feature_importance are computed on the train DATALOADER while it creates a new dataloader when calling clf.explain(X).

So if you have any parameters that changes the train dataloader you'll end up with a different score. This can happen in different scenarios:

drop_last=True : you are computing the feature importance without a few lines (randomly selected as shuffle=True for training dataloader). Potentially this means that the feature importance is somehow random (but still representative). I think it's ok to call this a bug.
-weights=1 : you are going to over-sample some examples in your training loader, which will change the final feature_importance.

I think those are the only two reasons, but there might be a few other scenarios that I did not spotted.

In the end: this is a bug, I'll fix it, thank you very much for finding it. In the meantime, the differences are coming from the sample used for internal feature importance. So you can trust both methods: if you see a big change in your case, this is due to the high sparsity of your data, and the final training importance might not be very representative.

eduardocarvp · 2022-03-22T13:18:54Z

Yes, I think there are some problems as well. I was wondering if the reducing matrix has any influence, since in one case we do the sum first and then reduction and the inverse in the other case. But since there are no np.abs and only sums and averages I guess it should be the same...

This comment was marked as outdated.

Sign in to view

yaoching0 closed this as completed Mar 22, 2022

Optimox reopened this Mar 22, 2022

Optimox added the bug Something isn't working label Mar 22, 2022

Optimox mentioned this issue Mar 22, 2022

fix: feature importance not dependent from dataloader #372

Merged

Optimox closed this as completed in #372 Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference problem between Local explainability and Global explainability #371

Difference problem between Local explainability and Global explainability #371

yaoching0 commented Mar 22, 2022 •

edited

eduardocarvp commented Mar 22, 2022

yaoching0 commented Mar 22, 2022

yaoching0 commented Mar 22, 2022

yaoching0 commented Mar 22, 2022

Optimox commented Mar 22, 2022

This comment was marked as outdated.

This comment was marked as outdated.

yaoching0 commented Mar 22, 2022

yaoching0 commented Mar 22, 2022 •

edited

Optimox commented Mar 22, 2022 •

edited

eduardocarvp commented Mar 22, 2022

Difference problem between Local explainability and Global explainability #371

Difference problem between Local explainability and Global explainability #371

Comments

yaoching0 commented Mar 22, 2022 • edited

eduardocarvp commented Mar 22, 2022

yaoching0 commented Mar 22, 2022

yaoching0 commented Mar 22, 2022

yaoching0 commented Mar 22, 2022

Optimox commented Mar 22, 2022

This comment was marked as outdated.

This comment was marked as outdated.

yaoching0 commented Mar 22, 2022

yaoching0 commented Mar 22, 2022 • edited

Optimox commented Mar 22, 2022 • edited

eduardocarvp commented Mar 22, 2022

yaoching0 commented Mar 22, 2022 •

edited

yaoching0 commented Mar 22, 2022 •

edited

Optimox commented Mar 22, 2022 •

edited