Implements two front-ends for acoustic encoders #17

christophmluscher · 2023-05-31T09:18:43Z

This is not specific for the Conformer so maybe parts/conformer is not the right place?

Atticus1806 · 2023-05-31T09:28:12Z

Some tests regarding the downsampling shapes (similar to #4) would be nice :)

i6_models/parts/conformer/frontend.py

albertz

I think the main problem is that the VGG is applied in the wrong way. Or at least inconsistent to how it usually is done. I don't think this is intended this way.

i6_models/parts/conformer/frontend.py

tests/test_conformer.py

i6_models/parts/conformer/frontend.py

tests/test_conformer.py

i6_models/parts/conformer/frontend.py

tests/test_conformer.py

i6_models/parts/conformer/frontend.py

tests/test_conformer.py

Judyxujj · 2023-06-21T18:17:31Z

I have two questions here,

the param stride should be added to the Config, since usually we set stride in Conv2d to apply the time subsampling
after the front end convolution models, we need to apply one additional linear layer to project the dimension back to the model size. Should the linear layer be included in the front end, or should it be included in the ConformerBlock? I would prefer the former

christophmluscher · 2023-06-21T18:21:10Z

I have two questions here,

the param stride should be added to the Config, since usually we set stride in Conv2d to apply the time subsampling

already added but not pushed yet :)

after the front end convolution models, we need to apply one additional linear layer to project the dimension back to the model size. Should the linear layer be included in the front end, or should it be included in the ConformerBlock? I would prefer the former

I was thinking about putting this as an option into the frontend: with param int N -> add linear layer with N outputs, None -> no linear layer

@Atticus1806

christophmluscher · 2023-06-21T19:12:38Z

tests fail due to the seq mask update missing. I am not 100% sure how to perform the update in a clean fashion. Is there a PyTorch way to do this?

albertz · 2023-06-21T20:13:44Z

tests fail due to the seq mask update missing. I am not 100% sure how to perform the update in a clean fashion. Is there a PyTorch way to do this?

I'm not exactly sure what you refer to. What update do you mean? You mean how to compute the seq mask which is supposed to be returned by the frontend? As I mentioned, you could apply maxpooling with the same striding and kernel size just as the other operations.

christophmluscher · 2023-06-21T20:37:09Z

you could apply maxpooling with the same striding and kernel size just as the other operations.

this would also work for conv layers?

tests/test_conformer.py

i6_models/parts/frontend/vgg.py

tests/test_conformer.py

i6_models/parts/frontend/protocol.py

i6_models/parts/frontend/vgg.py

i6_models/parts/frontend/common.py

christophmluscher · 2023-07-19T10:41:24Z

@JackTemaki @albertz

JackTemaki

I would change the code to only support tuples instead of both tuples and integer, this makes the code more readable and more consistent.

albertz · 2023-07-19T14:55:51Z

I would change the code to only support tuples instead of both tuples and integer, this makes the code more readable and more consistent.

I'm not sure I agree. PyTorch itself also supports both. And the more common use case it that the user provides just a single int.

i6_models/parts/frontend/vgg_act.py

i6_models/parts/frontend/common.py

michelwi · 2023-08-24T12:21:44Z

I have two questions here,

the param stride should be added to the Config, since usually we set stride in Conv2d to apply the time subsampling

already added but not pushed yet :)

Was this added in the final version of the PR, I don't seem to find it..

christophmluscher requested a review from Atticus1806 May 31, 2023 09:18

christophmluscher self-assigned this May 31, 2023

christophmluscher force-pushed the chris-frontend branch from 4aaa978 to ba3fb06 Compare May 31, 2023 09:25

christophmluscher marked this pull request as ready for review June 1, 2023 15:13

christophmluscher requested review from albertz, curufinwe, JackTemaki, mmz33, michelwi, Judyxujj, vieting and kuacakuaca June 1, 2023 15:14

kuacakuaca reviewed Jun 1, 2023

View reviewed changes

i6_models/parts/conformer/frontend.py Outdated Show resolved Hide resolved

i6_models/parts/conformer/frontend.py Outdated Show resolved Hide resolved

albertz requested changes Jun 1, 2023

View reviewed changes

albertz reviewed Jun 2, 2023

View reviewed changes

tests/test_conformer.py Outdated Show resolved Hide resolved

i6_models/parts/conformer/frontend.py Outdated Show resolved Hide resolved

albertz reviewed Jun 2, 2023

View reviewed changes

i6_models/parts/conformer/frontend.py Outdated Show resolved Hide resolved

i6_models/parts/conformer/frontend.py Outdated Show resolved Hide resolved

Atticus1806 reviewed Jun 5, 2023

View reviewed changes

christophmluscher requested a review from albertz June 15, 2023 08:38

albertz reviewed Jun 21, 2023

View reviewed changes

albertz reviewed Jun 22, 2023

View reviewed changes

i6_models/parts/frontend/vgg.py Outdated Show resolved Hide resolved

albertz reviewed Jun 22, 2023

View reviewed changes

tests/test_conformer.py Outdated Show resolved Hide resolved

albertz reviewed Jun 22, 2023

View reviewed changes

i6_models/parts/frontend/vgg.py Outdated Show resolved Hide resolved

christophmluscher added 2 commits July 17, 2023 15:52

remove implicit assumption for dim update

56b27bf

smaller namespace

4810f8b

albertz reviewed Jul 17, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

albertz reviewed Jul 17, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

cleanup

3cf4413

christophmluscher requested a review from albertz July 19, 2023 10:40

JackTemaki requested changes Jul 19, 2023

View reviewed changes

christophmluscher added 2 commits July 19, 2023 16:22

force usage of tuple, no int

c80790c

missing conversion int to tuple

3c059eb

JackTemaki reviewed Jul 19, 2023

View reviewed changes

i6_models/parts/frontend/vgg_act.py Outdated Show resolved Hide resolved

albertz reviewed Jul 19, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

albertz reviewed Jul 19, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

albertz reviewed Jul 19, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

christophmluscher added 2 commits July 19, 2023 17:44

better type hinting

b4c3cb4

add padding var

81e1488

albertz reviewed Jul 19, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

albertz reviewed Jul 19, 2023

View reviewed changes

i6_models/parts/frontend/common.py Outdated Show resolved Hide resolved

christophmluscher added 3 commits July 19, 2023 20:23

turn into kwargs

7ff0589

more

e61d9e9

fix padding kernel check for mask pool

9ee15dd

christophmluscher requested review from albertz and JackTemaki July 20, 2023 09:14

JackTemaki approved these changes Jul 20, 2023

View reviewed changes

albertz approved these changes Jul 21, 2023

View reviewed changes

christophmluscher merged commit 18d3f1a into main Jul 21, 2023
2 checks passed

christophmluscher deleted the chris-frontend branch July 21, 2023 17:02

michelwi mentioned this pull request Aug 24, 2023

add conv4_stride to VGG4LayerActFrontendV1 #33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implements two front-ends for acoustic encoders #17

Implements two front-ends for acoustic encoders #17

christophmluscher commented May 31, 2023

Atticus1806 commented May 31, 2023

albertz left a comment

Judyxujj commented Jun 21, 2023

christophmluscher commented Jun 21, 2023

christophmluscher commented Jun 21, 2023

albertz commented Jun 21, 2023

christophmluscher commented Jun 21, 2023

christophmluscher commented Jul 19, 2023

JackTemaki left a comment

albertz commented Jul 19, 2023

michelwi commented Aug 24, 2023

Implements two front-ends for acoustic encoders #17

Implements two front-ends for acoustic encoders #17

Conversation

christophmluscher commented May 31, 2023

Atticus1806 commented May 31, 2023

albertz left a comment

Choose a reason for hiding this comment

Judyxujj commented Jun 21, 2023

christophmluscher commented Jun 21, 2023

christophmluscher commented Jun 21, 2023

albertz commented Jun 21, 2023

christophmluscher commented Jun 21, 2023

christophmluscher commented Jul 19, 2023

JackTemaki left a comment

Choose a reason for hiding this comment

albertz commented Jul 19, 2023

michelwi commented Aug 24, 2023