EHN Normalizing the last layer in all the models #520

brunaafl · 2023-09-08T12:24:37Z

Changed last layers' names to final_layer, cf #457.

Fix TIDNet HybridNet's and inheritance (open issue).

sliwy · 2023-09-08T12:33:17Z

@brunaafl You may want to take a look at #513 as there may be some conflicts and try to coordinate/see what are the common parts :)

brunaafl · 2023-09-08T12:37:51Z

@brunaafl You may want to take a look at #513 as there may be some conflicts and try to coordinate/see what are the common parts :)

Thanks! I will!

codecov · 2023-09-08T15:15:46Z

Codecov Report

Merging #520 (41d8ee0) into master (64b8e38) will increase coverage by 0.16%.
Report is 2 commits behind head on master.
The diff coverage is 93.91%.

@@            Coverage Diff             @@
##           master     #520      +/-   ##
==========================================
+ Coverage   84.55%   84.72%   +0.16%     
==========================================
  Files          63       63              
  Lines        4676     4741      +65     
==========================================
+ Hits         3954     4017      +63     
- Misses        722      724       +2

brunaafl · 2023-09-08T15:45:24Z

TODO: Add a new way to handle return features

During the renaming, I had to remove the possibility of returning the output of the feature extractor when there was this option (on the SleepStagerEldele2021, SleepStagerBlanco2020, SleepStagerChambon2018, and DeepSleepNet models). However, this is required for some tutorials, so now they are broken.

@PierreGtch

PierreGtch · 2023-09-09T07:06:48Z

@brunaafl on it

PierreGtch · 2023-09-09T07:08:07Z

can you update the whats_new?

brunaafl · 2023-09-09T08:39:42Z

can you update the whats_new?

Yes! I'll do that

sliwy · 2023-09-09T09:21:04Z

thx for the PR @brunaafl

We discussed with @PierreGtch more in details how it should be done to support the transfer learning approach and be easy to use. We identified few cases in which we may act in slightly modified manner, so the PR will require some changes. We would be really happy if you find some time and try to address the changes:

The case in which we have nn.Module -> we would like to have the last layer providing the tensor with on of the dimensions equal to n_outputs to be called final_layer and all the modules after should be also incorporated into the nn.Sequential with it. Sometimes it requires splitting nn.Sequential into two parts or merging nn.Linear with some reshaping. Reshaping should be included in the final_layer because if we replace final_layer with nn.Identity then it will break the code because of mismatch of dimensions.
If the model is nn.Sequential we would like to have the last layer providing the tensor with on of the dimensions equal to n_outputs to be called final_layer and all the modules after incorporated into the same module.
LogSoftmax -> we would like to not touch LogsSoftmax as it is going to be renamed in another PR and it's going to solve a different problem. Let's assume that LogSoftmax is not changing the shape of the tensor and it should not be included in the final_layer.

@brunaafl would you have time to tackle that? if you have any doubts I would be happy to help! 😄

brunaafl · 2023-09-09T10:24:59Z

thx for the PR @brunaafl

We discussed with @PierreGtch more in details how it should be done to support the transfer learning approach and be easy to use. We identified few cases in which we may act in slightly modified manner, so the PR will require some changes. We would be really happy if you find some time and try to address the changes:

The case in which we have nn.Module -> we would like to have the last layer providing the tensor with on of the dimensions equal to n_outputs to be called final_layer and all the modules after should be also incorporated into the nn.Sequential with it. Sometimes it requires splitting nn.Sequential into two parts or merging nn.Linear with some reshaping. Reshaping should be included in the final_layer because if we replace final_layer with nn.Identity then it will break the code because of mismatch of dimensions.

If the model is nn.Sequential we would like to have the last layer providing the tensor with on of the dimensions equal to n_outputs to be called final_layer and all the modules after incorporated into the same module.

LogSoftmax -> we would like to not touch LogsSoftmax as it is going to be renamed in another PR and it's going to solve a different problem. Let's assume that LogSoftmax is not changing the shape of the tensor and it should not be included in the final_layer.

@brunaafl would you have time to tackle that? if you have any doubts I would be happy to help! 😄

Hi! Yes, I can address those changes, although not today, just tomorrow and after. I'm happy to help!

brunaafl · 2023-09-11T08:39:46Z

thx for the PR @brunaafl
We discussed with @PierreGtch more in details how it should be done to support the transfer learning approach and be easy to use. We identified few cases in which we may act in slightly modified manner, so the PR will require some changes. We would be really happy if you find some time and try to address the changes:

The case in which we have nn.Module -> we would like to have the last layer providing the tensor with on of the dimensions equal to n_outputs to be called final_layer and all the modules after should be also incorporated into the nn.Sequential with it. Sometimes it requires splitting nn.Sequential into two parts or merging nn.Linear with some reshaping. Reshaping should be included in the final_layer because if we replace final_layer with nn.Identity then it will break the code because of mismatch of dimensions.

If the model is nn.Sequential we would like to have the last layer providing the tensor with on of the dimensions equal to n_outputs to be called final_layer and all the modules after incorporated into the same module.

LogSoftmax -> we would like to not touch LogsSoftmax as it is going to be renamed in another PR and it's going to solve a different problem. Let's assume that LogSoftmax is not changing the shape of the tensor and it should not be included in the final_layer.

@brunaafl would you have time to tackle that? if you have any doubts I would be happy to help! 😄

HI! I was wondering here, just to know if I really understood. So, you want the first layer that changes the shape of the tensor to one with one of the dimensions equal to n_outputs to be called final_layer, and all the other modules after this one incorporated into the same sequential with it. Is that right? And the LogSoftmax should not be included in this.

My doubt is in the case where LogSoftmax comes in between a conv that changes the shape of one of the dimensions to n_outputs and another layer (such as one to squeeze the final output), such as in the case of Deep4Net and EEGNet. Should I just leave LogSoftmax inside the final_layer module, since it will be changed in a next pr?

PierreGtch · 2023-09-11T09:57:13Z

@brunaafl sorry for the confusion, there was some miscommunication at multiple levels.

So everything after and including the classification layer (sometimes it is ann.Linear, sometimes nn.Conv2d..) must go into this new final_layer.
Typically you will find a softmax, a reshape, a transpose, or a squeeze, etc. They should all go inside final_layer. For example, Deep4net will change from:

        self.add_module(
            "conv_classifier",
            nn.Conv2d(
                self.n_filters_4,
                self.n_outputs,
                (self.final_conv_length, 1),
                bias=True,
             ),
        )
        self.add_module("softmax", nn.LogSoftmax(dim=1))
        self.add_module("squeeze", Expression(squeeze_final_output))

TO:

        self.add_module("final_layer", nn.Sequential())
        self.final_layer.add_module(
            "conv_classifier",
            nn.Conv2d(
                self.n_filters_4,
                self.n_outputs,
                (self.final_conv_length, 1),
                bias=True,
             ),
        )
        self.final_layer.add_module("softmax", nn.LogSoftmax(dim=1))
        self.final_layer.add_module("squeeze", Expression(squeeze_final_output))

If it's still not clear, please tell me

brunaafl · 2023-09-11T09:58:38Z

@brunaafl sorry for the confusion, there was some miscommunication at multiple levels.

So everything after and including the classification layer (sometimes it is ann.Linear, sometimes nn.Conv2d..) must go into this new final_layer. Typically you will find a softmax, a reshape, a transpose, or a squeeze, etc. They should all go inside final_layer. For example, Deep4net will change from:

        self.add_module(
            "conv_classifier",
            nn.Conv2d(
                self.n_filters_4,
                self.n_outputs,
                (self.final_conv_length, 1),
                bias=True,
             ),
        )
        self.add_module("softmax", nn.LogSoftmax(dim=1))
        self.add_module("squeeze", Expression(squeeze_final_output))

TO:

        self.add_module("final_layer", nn.Sequential())
        self.final_layer.add_module(
            "conv_classifier",
            nn.Conv2d(
                self.n_filters_4,
                self.n_outputs,
                (self.final_conv_length, 1),
                bias=True,
             ),
        )
        self.final_layer.add_module("softmax", nn.LogSoftmax(dim=1))
        self.final_layer.add_module("squeeze", Expression(squeeze_final_output))

If it's still not clear, please tell me

Okay, it's clear now, thanks!

Co-authored-by: PierreGtch <25532709+PierreGtch@users.noreply.github.com>

…rmalizing-layer-names

sliwy

Thanks @brunaafl for new changes! :)

I added few more comments. Most important, in eegconformer.py there are 4 " in two docstrings, so we have to fix that before merge (I have added suggestions). Then, few details with naming attributes or removing old attributes.

@bruAristimunha do you have any idea why tests didn't run on this PR after last commit?

braindecode/models/eegconformer.py

braindecode/models/hybrid.py

braindecode/models/shallow_fbcsp.py

braindecode/models/tcn.py

braindecode/models/eegitnet.py

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

…rmalizing-layer-names

done

bruAristimunha · 2023-09-20T12:59:04Z

Thank you @brunaafl, @sliwy and @PierreGtch

brunaafl added 2 commits September 8, 2023 11:06

Renaming last layer to final_layer

6a6f9a8

Renaming last layer to final_layer

cf63701

brunaafl marked this pull request as draft September 8, 2023 12:41

PierreGtch mentioned this pull request Sep 8, 2023

EegModuleMixin extension #514

Merged

brunaafl added 7 commits September 8, 2023 14:49

Renaming last layer to final_layer

9ebe8e2

Renaming last layer to final_layer

3665f10

Renaming last layer to final_layer

a40f29e

Renaming last layer to final_layer

002338f

Commented tests involving return features (disabled for now)

c22e4e7

Commented tests involving return features (disabled for now)

7ca5f94

Fixing examples

0beaa80

Fixing examples

9288a8d

brunaafl marked this pull request as ready for review September 8, 2023 15:54

Merge branch 'master' into Normalizing-layer-names

9553342

brunaafl requested a review from PierreGtch September 9, 2023 08:39

Updating whats_new file

a77b65e

PierreGtch mentioned this pull request Sep 9, 2023

Add set_final_layer and drop_final_layer brunaafl/braindecode#2

Open

brunaafl requested a review from sliwy September 9, 2023 10:25

brunaafl and others added 4 commits September 18, 2023 10:05

Update braindecode/models/eegconformer.py

3d04155

Co-authored-by: PierreGtch <25532709+PierreGtch@users.noreply.github.com>

Update braindecode/models/eegconformer.py

2b5cfb3

Co-authored-by: PierreGtch <25532709+PierreGtch@users.noreply.github.com>

removing a comment hybrid.py

1ce7c70

Merge remote-tracking branch 'origin/Normalizing-layer-names' into No…

6e18a67

…rmalizing-layer-names

PierreGtch approved these changes Sep 18, 2023

View reviewed changes

sliwy previously requested changes Sep 18, 2023

View reviewed changes

bruAristimunha disabled auto-merge September 18, 2023 22:34

brunaafl and others added 15 commits September 19, 2023 11:35

Update braindecode/models/eegconformer.py

f302017

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/eegconformer.py

f429daf

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/eegconformer.py

7846f89

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/eegconformer.py

463b7eb

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/eegconformer.py

557e4ad

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

eegconformer update following suggestions

96d38ab

Update braindecode/models/hybrid.py

20edb41

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/tcn.py

f07079a

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/shallow_fbcsp.py

a7bc0af

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Update braindecode/models/eegitnet.py

2341366

Co-authored-by: Maciej Sliwowski <macieksliwowski@gmail.com>

Merge branch 'master' into Normalizing-layer-names

911f112

fixing flake

426dc75

Merge remote-tracking branch 'origin/Normalizing-layer-names' into No…

c77fe04

…rmalizing-layer-names

fixing test

04a1311

Merge branch 'master' into Normalizing-layer-names

41d8ee0

bruAristimunha enabled auto-merge (squash) September 19, 2023 15:00

bruAristimunha approved these changes Sep 20, 2023

View reviewed changes

bruAristimunha disabled auto-merge September 20, 2023 12:57

bruAristimunha enabled auto-merge (squash) September 20, 2023 12:58

bruAristimunha disabled auto-merge September 20, 2023 12:58

bruAristimunha merged commit 3247744 into braindecode:master Sep 20, 2023
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EHN Normalizing the last layer in all the models #520

EHN Normalizing the last layer in all the models #520

brunaafl commented Sep 8, 2023 •

edited by PierreGtch

sliwy commented Sep 8, 2023

brunaafl commented Sep 8, 2023

codecov bot commented Sep 8, 2023 •

edited

brunaafl commented Sep 8, 2023 •

edited

PierreGtch commented Sep 9, 2023

PierreGtch commented Sep 9, 2023

brunaafl commented Sep 9, 2023

sliwy commented Sep 9, 2023

brunaafl commented Sep 9, 2023

brunaafl commented Sep 11, 2023

PierreGtch commented Sep 11, 2023

brunaafl commented Sep 11, 2023

sliwy left a comment

bruAristimunha commented Sep 20, 2023

EHN Normalizing the last layer in all the models #520

EHN Normalizing the last layer in all the models #520

Conversation

brunaafl commented Sep 8, 2023 • edited by PierreGtch

sliwy commented Sep 8, 2023

brunaafl commented Sep 8, 2023

codecov bot commented Sep 8, 2023 • edited

Codecov Report

brunaafl commented Sep 8, 2023 • edited

PierreGtch commented Sep 9, 2023

PierreGtch commented Sep 9, 2023

brunaafl commented Sep 9, 2023

sliwy commented Sep 9, 2023

brunaafl commented Sep 9, 2023

brunaafl commented Sep 11, 2023

PierreGtch commented Sep 11, 2023

brunaafl commented Sep 11, 2023

sliwy left a comment

Choose a reason for hiding this comment

bruAristimunha commented Sep 20, 2023

brunaafl commented Sep 8, 2023 •

edited by PierreGtch

codecov bot commented Sep 8, 2023 •

edited

brunaafl commented Sep 8, 2023 •

edited