Torchscript-compatible TabNet #2126

geoffreyangus · 2022-06-09T21:57:34Z

This PR implements a Torchscript-compatible TabNet (addressing #2124). The issues in the original implementation were twofold:

Some weirdness in the interaction between the Torchscript compiler and the weights shared between torch.nn.Module objects. This is a somewhat known issue (HuggingFace explicitly accounts for it through a torchscript flag: link). This PR works around it by first registering a module with the exact same properties as the shared module, then doing an overwriting assignment to the shared module immediately following.
Inability to script custom autograd.Function subclasses. This is also a known issue (autodiff for user script functions aka torch.jit.script for autograd.Function pytorch/pytorch#22329). This PR works around it by decomposing the forward functions of the custom classes into standalone functions, which are scriptable.

The following validation was run in order to test the new changes:

A new test was added in tests/integration_tests/test_torchscript.py which tests that a trained LudwigModel (with a TabNet combiner) has the same outputs as its torchscript equivalent.
Existing tests were modified in order to ensure that the custom autograd classes remained unchanged.
Two Ludwig models were trained on the Titanic dataset with a TabNet combiner. One was on the master branch (commit: dd026ca2fb9e7e9dc0ef7fb9bcc73ccaae01b8a7) and one was on this branch (commit: 34fe6da455d44edb8b6cb4947f4595303cf3511d). The models were more or less the same:

Full config here:

input_features:
  - name: Pclass
    type: category
  - name: Sex
    type: category
  - name: Age
    type: number
    preprocessing:
      missing_value_strategy: fill_with_mean
  - name: SibSp
    type: number
  - name: Parch
    type: number
  - name: Fare
    type: number
    preprocessing:
      missing_value_strategy: fill_with_mean
  - name: Embarked
    type: category

output_features:
  - name: Survived
    type: binary

combiner:
  type: tabnet

brightsparc

LGTM

tgaddair

Amazing! Thanks for doing the regression analysis.

ludwig/modules/tabnet_modules.py

tgaddair · 2022-06-09T23:39:19Z

ludwig/utils/entmax/activations.py

+    # Avoids call to custom autograd.Function during eval to ensure torchscript compatibility
+    # custom autograd.Function is not scriptable: https://github.com/pytorch/pytorch/issues/22329#issuecomment-506608053
+    if not training:
+        output, _ = _sparsemax_forward(X, dim, k)


I imagine this means we cannot use integrated gradients with torchscript, since we lose the grad info at predict time with this approach, is that right? Not the end of the world, but something we'll need to think about.

Yup that's correct.

ludwig/utils/entmax/activations.py

geoffreyangus added 6 commits June 9, 2022 13:57

Enables torchscript export for TabNet

a340fc0

cleanup

3f42f69

revert staticmethod to classmethod

890e71e

update placement

8472dd4

update to assign entire shared_fc_layer to fc_layer

34fe6da

adds tests and comments

9a2efa6

geoffreyangus requested review from dantreiman, tgaddair and justinxzhao June 9, 2022 22:00

brightsparc approved these changes Jun 9, 2022

View reviewed changes

tgaddair approved these changes Jun 9, 2022

View reviewed changes

w4nderlust approved these changes Jun 10, 2022

View reviewed changes

tgaddair merged commit 3967cc5 into master Jun 10, 2022

tgaddair deleted the fix-ts-tabnet branch June 10, 2022 02:06

geoffreyangus mentioned this pull request Jun 10, 2022

Tabnet combiner does not compile down to Torchscript #2124

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Torchscript-compatible TabNet #2126

Torchscript-compatible TabNet #2126

geoffreyangus commented Jun 9, 2022 •

edited

Loading

brightsparc left a comment

tgaddair left a comment

tgaddair Jun 9, 2022

geoffreyangus Jun 10, 2022

Torchscript-compatible TabNet #2126

Torchscript-compatible TabNet #2126

Conversation

geoffreyangus commented Jun 9, 2022 • edited Loading

brightsparc left a comment

Choose a reason for hiding this comment

tgaddair left a comment

Choose a reason for hiding this comment

tgaddair Jun 9, 2022

Choose a reason for hiding this comment

geoffreyangus Jun 10, 2022

Choose a reason for hiding this comment

geoffreyangus commented Jun 9, 2022 •

edited

Loading