Add parametrization for normalization #20

y-prudent · 2023-11-29T10:26:10Z

Prerequisite: Torch parametrization tutorial

Features

Rewrite bjorck, frobenius, and lconv normalizations so that they use torch parametrization instead of forward pre hooks.
Faster inference in the model.eval() mode (when model.training is set to False, spectral and bjorck normalizations use cached tensors to perform free normalizations).
Vanilla model conversion can be done on any model (not only Sequential) with vanilla_model. Be careful, this is an in-place conversion!
Torch parametrize.cached() feature is now also usable on Lipschitz layers, allowing to save memory and compute when the same kernel is applied multiple times in an inference step (very useful for RNNs, multi-level convolutions, etc.). Here is how to use it:

import torch.nn.utils.parametrize as P

with P.cached():
    y1 = lip_layer(x1)    # at the first call, compute normalization and save the reparametrized weights
    y2 = lip_layer(x2)    # at the second call, reuse saved reparametrized weights

⚠️ Important note:

Models using parametrized modules can only be serialized through state_dict(). So torch.save(model, PATH) is not possible anymore and will raise an error. Instead, save and load your models like this:

# save model
torch.save(
    {
        'model_state_dict': model.state_dict(),
        'model_kwargs': config,  # arguments used to build the model
    }, PATH
)

# load model
checkpoint = torch.load(PATH)
model = TheModelClass(**checkpoint["model_kwargs"])
model.load_state_dict(checkpoint['model_state_dict'])

For more information, check this torch tutorial.

TODO

Rewrite bjorck, frobenius and lconv normalizations with parametrization
Manage the suppression of individual parametrizations (torch native functions only allow the deletion of all the parametrizations at once)
Manage the parametrization of lconv_norm that depends on the input shape (=> solution: forward pre hooks + parametrization)
Also parametrize the global lipschitz coef multiplication
Write associated tests
Remove deprecated hook relative files
Check that vanilla_export works well
Check parametrize.cached() feature

It does not depends on input shape anymore

cofri

Very interesting PR!
Just a small suggestion for vanilla_export import

cofri · 2024-01-18T16:43:11Z

.github/workflows/python-lints.yml

@@ -8,7 +8,7 @@ jobs:
    strategy:
      max-parallel: 4
      matrix:
-        python-version: [3.6, 3.7, 3.8]
+        python-version: [3.7, 3.8, 3.9]


From the statuses of Python versions, Python 3.7 is already deprecated. Maybe we can use a more powerful test matrix with more recent Python versions (and also PyTorch versions?).
It's a more important change and it could be postponed to a future PR

deel/torchlip/modules/module.py

y-prudent force-pushed the feat-parametrization-for-normalization branch from 7ee8f09 to a3b262f Compare November 29, 2023 18:04

y-prudent added 2 commits November 29, 2023 19:47

feat: add parametrization

55fd48c

chore: update py version in workflows (3.6 deprec)

80a75a0

y-prudent force-pushed the feat-parametrization-for-normalization branch from a3b262f to 80a75a0 Compare November 29, 2023 18:47

y-prudent requested review from cofri and franckma31 November 30, 2023 12:53

y-prudent marked this pull request as ready for review November 30, 2023 12:58

y-prudent added 2 commits November 30, 2023 16:09

feat: add in-place vanilla model conversion

5ab989b

fix: asymptotic coef value for lconv

effd7e6

It does not depends on input shape anymore

cofri reviewed Jan 18, 2024

View reviewed changes

y-prudent and others added 2 commits February 5, 2024 17:22

chore: vanilla_model added in deel/torchlip/__init__.py

87f78e9

remove parametrization for lipschitz param when coef_lip==1.0

b8a0b3d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add parametrization for normalization #20

Add parametrization for normalization #20

y-prudent commented Nov 29, 2023 •

edited

Loading

cofri left a comment

cofri Jan 18, 2024

Add parametrization for normalization #20

Are you sure you want to change the base?

Add parametrization for normalization #20

Conversation

y-prudent commented Nov 29, 2023 • edited Loading

Features

⚠️ Important note:

TODO

cofri left a comment

Choose a reason for hiding this comment

cofri Jan 18, 2024

Choose a reason for hiding this comment

y-prudent commented Nov 29, 2023 •

edited

Loading