Add TFTExplainer #1392

Cattes · 2022-11-27T20:40:53Z

Implements explainability for the TFT model as discussed in #675

Summary

I have added a TFTExplainer class similar to the ShapExplainer class to get the explainability insights for the TFT model.

The class contains the encoder_importance and decoder_importance of the trained TFT Model (grouped together with a plot option in the get_variable_selection_weight method).

The TFTExplainer.explain() method calls the predict method of the trained TFT Model to get the attention over time.

I have also provided a plot_attention_heads method to plot the attention over time either as the average attention, as a heatmap or a detailed plot of all available attention heads.

I have added a section on how to use the class to the 13-TFT-example.ipynb notebook.

Other Information

I have oriented myself to the suggestions in the Issue made by @hrzn

from darts.models import TFTModel
from darts.explainability import TFTExplainer

my_model = TFTModel(...)
my_mode.fit(...)

explainer = TFTExplainer(my_model, ...)

# get the explainability results
results = explainer.explain()

# plot variable selection weights
explainer.plot_variable_selection(results)

# plot the attention over time (three plot options)
explainer.plot_attention(results, plot_type="all")
explainer.plot_attention(results, plot_type="time")
explainer.plot_attention(results, plot_type="heatmap")

# get feature importance values
encoder_importance = results.get_encoder_importance()
decoder_importance = results.get_decoder_importance()
static_covariates_importance = results.get_static_covariates_importance()
# get attention `TimeSeries`
attention = results.get_attention()

The inital code on how to get the details from the TFT class was provided by @MagMueller in the Issue.

Edit (@dennisbader, 27-07-2023):

refactored ForecastingModelExplainer and ExplainabilityResult to simplify/unify implementation of new explainers
explain() now calls model.predict() with the passed foreground/background series. The attention and feature importances are actually dependent on the input to predict/forward (and not fixed after training).
plot methods are now relying on the explain() output
added static covariates importance
updated the attention plots to have xaxis 0 point where the prediction starts (start of output chunk)
added colorbar to attention heat map
fixed TFTModel attention mask when full_attention=True
fixed an issue with encoded covariates not being properly stored in the TorchForecastingModel in case of training on a single series
adapted tests to check on combinations of input series (univariate, multivariate, multiple multivariate) and covariates, encoders, and model creation parameter add_relative_index

…FTExplainer

…ner.py

review-notebook-app · 2022-11-27T20:40:57Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

… plot

codecov-commenter · 2022-11-29T10:53:39Z

Codecov Report

Patch coverage: 94.54% and no project coverage change.

Comparison is base (d30f163) 93.84% compared to head (7e80088) 93.85%.

❗ Your organization is not using the GitHub App Integration. As a result you may experience degraded service beginning May 15th. Please install the Github App Integration for your organization. Read more.

Additional details and impacted files

@@           Coverage Diff            @@
##           master    #1392    +/-   ##
========================================
  Coverage   93.84%   93.85%            
========================================
  Files         126      128     +2     
  Lines       12174    12416   +242     
========================================
+ Hits        11425    11653   +228     
- Misses        749      763    +14

Files Changed	Coverage Δ
darts/models/forecasting/forecasting_model.py	`95.01% <80.00%> (-0.14%)`	⬇️
darts/explainability/explainability_result.py	`92.13% <90.00%> (-2.87%)`	⬇️
darts/explainability/shap_explainer.py	`87.69% <91.66%> (-1.66%)`	⬇️
darts/explainability/tft_explainer.py	`93.15% <93.15%> (ø)`
darts/explainability/utils.py	`96.66% <96.66%> (ø)`
darts/explainability/__init__.py	`100.00% <100.00%> (ø)`
darts/explainability/explainability.py	`100.00% <100.00%> (+3.57%)`	⬆️
darts/models/forecasting/regression_model.py	`95.36% <100.00%> (+0.03%)`	⬆️
darts/models/forecasting/tft_model.py	`97.27% <100.00%> (+0.50%)`	⬆️
...arts/models/forecasting/torch_forecasting_model.py	`90.64% <100.00%> (-0.05%)`	⬇️
... and 1 more

... and 4 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

hrzn · 2022-12-22T16:03:48Z

Thanks a lot for this PR @Cattes ! Please bear with us... we are a bit slow to review at the moment (busy preparing the release of 0.23), but we'll get at it, and we feel very enthusiastic about adding TFT explainability!

hrzn

This looks very good @Cattes , I think it'll be a nice addition to Darts. Thanks a lot!
I have some comments, mainly related to the docstrings, but also one question about how we handle the horizons in the returned ExplainabilityResult - are you sure we should always consider horizon 0 only there? What about the case where the actual forecast horizon is larger?
When n > output_chunk_length, forward() will be called multiple times auto-regressively, so we have to be careful there. Maybe it'd make sense to disregard n in explain(), and return some result for each horizon in the output_chunk_length? I might also be missing something.

darts/timeseries.py

darts/explainability/tft_explainer.py

hrzn · 2023-01-18T15:39:02Z

darts/explainability/tft_explainer.py

+        self._model = model
+
+    @property
+    def encoder_importance(self):


Could you add docstrings explaining what this and decoder_importance are returning? They can be quite useful I think.

I have added docstrings to the module and properties. I wasn't 100% sure on the details of the model so if you could have a look at it that would be great? If everything is fine you can resolve this conversation.

Hey, I was trying to call this function and ran into an error. it says no attribute 'encoder_sparse_weights'. I went to the tft_model.py and uncommented this code chunk :
return self.to_network_output(
prediction=self.transform_output(out, target_scale=x["target_scale"]),
attention=attn_out_weights,
static_variables=static_covariate_var,
encoder_variables=encoder_sparse_weights,
decoder_variables=decoder_sparse_weights,
decoder_lengths=decoder_lengths,
encoder_lengths=encoder_lengths,
)

It now says TFTModule has no attribute called to_network_output.

Can I get some help regarding how to call the explainer and use it in my code?

hrzn · 2023-01-18T15:43:32Z

darts/explainability/tft_explainer.py

+            "decoder_importance": self.decoder_importance,
+        }
+
+    def explain(self, **kwargs) -> ExplainabilityResult:


How about taking some of the predict() parameters explicitly? At least series, past_covariates, future_covariates and n would make sense IMO. It will produce more comprehensible API documentation.

I am not sure if that is relevant here at all.

I do not understand why predict has to be called to get the proper attention heads of the time series. The learned autoregressive connections should depend on how predict is called. But if predict is not called at all the attention_heads saved in self._model.model._attn_out_weights do not have the right format. I assume they are still in a state of training and the predict() call changes that.

If that is the case I would rather remove the **kwargs completely from the explain method here and call predict once with self._model.model.output_chunk_length to get the correct attention heads.

Yes I agree with you we need to call predict() here. However predict() takes a lot more arguments than just n. It takes series (the series to predict), as well as covariates arguments and other arguments: see the API doc.
I think we should probably change the signature of explain() to something like

def explain(self, series, past_covariates, future_covariates, **kwargs) -> ExplainabilityResult

This way in the docstring you can list series, past_covariates and future_covariates, and explain that those are passed down to predict(). You can also say that n will always be set to output_chunk_length (unless I'm wrong I think that's always what's needed), and that **kwargs can contain extra arguments for the predict method and link to the API documentation of TFTModel.predict().
I hope it makes sense.

I think calling predict() is just a technicality to get to the correct attention weights. I don't think the way we call predict matters at all for the result, its just important that it was called (for whatever reason).
If I understand it correctly the attention weights are learned during training and are not impacted by the data used in the predict call.

They don't have a similar logic behind them like shapley values but are learned during the training and are a fixed part of the trained model.

Maybe I am wrong, but if I am right I would rather remove all parameter passed to explain() and have the predict() call happen without the user needing to know about it at all.

darts/explainability/tft_explainer.py

hrzn · 2023-01-18T15:48:10Z

darts/explainability/tft_explainer.py

+        # return the explainer result to be used in other methods
+        return ExplainabilityResult(
+            {
+                0: {


Is this always relating to horizon 0 only? How about the cases where predict() is called with n > 1 above?

I had to set the 0 here to be compatible with the ForecastingModelExplainer base class. To get the attention_heads the predict method of the TFT class has to be called or the attention_heads will not show the correct values. I am not sure why yet. Placing this logic into the explain() method as the ExplainabilityResult felt like a sensible choice.
We could deviate from the ForecastingModelExplainer class or add a note to the docstring that the 0 is irrelevant in this context.

So if I follow well, here the explanation is for all forecasted horizons at once, right?
I would then propose the following. We can adapt the class ExplainabilityResult in order to make it a little bit more flexible:

It could be used with one explanation per horizon (as now), or

with one single explanation for all horizons (as required in this case for the TFT).

To accommodate the second case, we could make it possible to build ExplainabilityResult with only a Dict[str, TimeSeries] (in addition to Dict[integer, Dict[str, TimeSeries]]), so we avoid specifying the horizon. We can also adapt ExplainabilityResult.get_explanation() to make specifying the horizon optional, and not supported if the underlying explanation is not split by horizon.

WDYT? I would find this cleaner than "hacking" the class by using a fake horizon 0.

@Cattes any thoughts on this? ^

I think its a good idea to change the class to handle the TFT explainer. I didn't want to do it before discussing it. Having the hack with horizon=0 was just to conform with the given api. It was not an intuitive solution.
I have added Dict[str, TimeSeries] to the valid type for class initialization and made the horizon optional.
I also added a few more validations to deal with the different input types explicitly.

darts/explainability/tft_explainer.py

darts/timeseries.py

hrzn · 2023-01-24T10:24:24Z

@Cattes it seems there's a linting issue preventing the tests from being run.

dennisbader · 2023-07-11T10:17:45Z

Yes, we did some refactoring of the Explainability module a couple of weeks back.
I also noticed some things while experimenting a bit with the TFTExplainer:

TFTModel's attention head is actually dependent on the input. This is why you got different results for the attention head when calling predict() for the first time. So in my opinion we indeed need the explain() foreground arguments.
we need to add support for interpretable static covariate variable selection / importance

I can implement these things :)

dennisbader · 2023-07-27T12:03:21Z

Hi @Cattes I finished the adaptions for TFTExplainer now. The refactor got quite big, because I took the time to refactor the ForecastingModelExplainer and ExplainabilityResult backbone to make it easier to implement new explainers in the future.

I also caught one or two bugs on the way that I fixed with it.

I updated the PR description with the points I added/adapted.

Let me know if you want to go over the changes and/or if you're okay with the new version :)

Thanks for this great PR and sorry again for the time it took us to get this through 🚀
This will be part of the next release which comes in one to two weeks 💯

madtoinou

Nice refactoring of the explainability and really cool feature (awaited by a lot of users)!

Some minors comments, this is almost ready for merge 🚀

darts/explainability/explainability.py

darts/explainability/explainability_result.py

darts/explainability/tft_explainer.py

darts/explainability/utils.py

darts/models/forecasting/tft_model.py

darts/tests/explainability/test_tft_explainer.py

darts/timeseries.py

Cattes · 2023-07-31T17:50:29Z

Thank you @dennisbader and @madtoinou for finishing up the PR! Sorry I could not have a look at it before the merge because I was on holidays.

Cattes added 6 commits November 2, 2022 09:35

unit8co#675 add first draft for tft_explainer

ce98db1

unit8co#675 add first working version of TFTExplainer class with tests

5678f7f

unit8co#675 allow passing of arguments to the explain method of the T…

c22e96d

…FTExplainer

unit8co#675 add test for multiple_covariates input to test_tft_explai…

598b134

…ner.py

unit8co#675 add correct feature names to vsv

f7387a4

unit8co#675 add TFTExplainer to 13-TFT-examples.ipynb

68e384d

Cattes requested review from hrzn and dennisbader as code owners November 27, 2022 20:40

Cattes mentioned this pull request Nov 27, 2022

Add support for interpretable outputs in TFTModel #675

Closed

Cattes added 2 commits November 27, 2022 21:59

Merge branch 'master' into feature/675_tft_explainer

bfffe87

unit8co#675 add CHANGELOG.md entry for the TFTExplainer class

160d196

Cattes mentioned this pull request Nov 28, 2022

#675 add first draft for tft_explainer Cattes/darts#1

Closed

Cattes and others added 2 commits November 28, 2022 20:49

Merge branch 'unit8co:master' into feature/675_tft_explainer

60eae66

unit8co#675 use @MagMueller's plot method for the variable importance…

42cfb92

… plot

Cattes added 2 commits December 19, 2022 17:24

Merge branch 'master' into feature/675_tft_explainer

57e64fb

unit8co#675 allow absolute tolerance of 1% in feature importance test

e59a7ba

hrzn and others added 2 commits January 18, 2023 09:20

Merge branch 'master' into feature/675_tft_explainer

49587c9

Update CHANGELOG.md

706e190

hrzn reviewed Jan 18, 2023

View reviewed changes

hrzn and others added 5 commits January 20, 2023 11:50

Merge branch 'master' into feature/675_tft_explainer

eb4bcfc

Merge branch 'master' into feature/675_tft_explainer

96a1c2b

Merge branch 'unit8co:master' into feature/675_tft_explainer

4556a44

unit8co#675 work in PR feedback

79b7755

Merge branch 'master' into feature/675_tft_explainer

18f9f65

hrzn reviewed Jan 24, 2023

View reviewed changes

darts/timeseries.py Outdated Show resolved Hide resolved

dennisbader added 17 commits July 12, 2023 09:02

refactor ForecastingModelExplainer.__init__

42c9c96

further explainability refactoring for input processing

9464c8f

Merge branch 'master' into feature/675_tft_explainer

71e1e81

refactor ForecastingModelExplainer p3

7ef185a

Merge branch 'master' into feature/675_tft_explainer

ce4e233

full refactor of ForecastingModelExplainer

8979bd8

update component naming

40c5aef

add static covariates importance

10fc42b

improved attention head plots

052680a

multiple time series support

29e5aa3

update explainability documnetation

090419b

update TFTModel full attention

72f8ded

remove optional horizon from HorizonBasedExplainabilityResult

8a3af53

update TFTModel example notebook

9ffcd5f

fix covariates issue when supplying covariates at predict time

04daed1

update unit tests

cd79722

update changelog

a66c86e

dennisbader requested a review from madtoinou July 30, 2023 11:23

Merge branch 'master' into feature/675_tft_explainer

7e80088

madtoinou reviewed Jul 31, 2023

View reviewed changes

dennisbader added 2 commits July 31, 2023 13:46

applied suggestions from PR review

d0c472c

Merge branch 'master' into feature/675_tft_explainer

92391a8

madtoinou approved these changes Jul 31, 2023

View reviewed changes

dennisbader approved these changes Jul 31, 2023

View reviewed changes

dennisbader merged commit 071c7e8 into unit8co:master Jul 31, 2023
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TFTExplainer #1392

Add TFTExplainer #1392

Cattes commented Nov 27, 2022 •

edited by dennisbader

Loading

review-notebook-app bot commented Nov 27, 2022

codecov-commenter commented Nov 29, 2022 •

edited

Loading

hrzn commented Dec 22, 2022

hrzn left a comment

hrzn Jan 18, 2023

Cattes Feb 27, 2023

rahuldixit18 Apr 5, 2023

hrzn Jan 18, 2023

Cattes Feb 23, 2023

hrzn Feb 26, 2023 •

edited

Loading

Cattes Feb 27, 2023

hrzn Jan 18, 2023

Cattes Jan 23, 2023

hrzn Jan 24, 2023

hrzn Feb 10, 2023

Cattes Feb 23, 2023

hrzn commented Jan 24, 2023

dennisbader commented Jul 11, 2023 •

edited

Loading

dennisbader commented Jul 27, 2023 •

edited

Loading

madtoinou left a comment

Cattes commented Jul 31, 2023

Add TFTExplainer #1392

Add TFTExplainer #1392

Conversation

Cattes commented Nov 27, 2022 • edited by dennisbader Loading

Summary

Other Information

review-notebook-app bot commented Nov 27, 2022

codecov-commenter commented Nov 29, 2022 • edited Loading

Codecov Report

hrzn commented Dec 22, 2022

hrzn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hrzn Feb 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hrzn commented Jan 24, 2023

dennisbader commented Jul 11, 2023 • edited Loading

dennisbader commented Jul 27, 2023 • edited Loading

madtoinou left a comment

Choose a reason for hiding this comment

Cattes commented Jul 31, 2023

Cattes commented Nov 27, 2022 •

edited by dennisbader

Loading

codecov-commenter commented Nov 29, 2022 •

edited

Loading

hrzn Feb 26, 2023 •

edited

Loading

dennisbader commented Jul 11, 2023 •

edited

Loading

dennisbader commented Jul 27, 2023 •

edited

Loading