Add Initial support for ContextNet Encoder and CTC Decoder by titu1994 · Pull Request #630 · NVIDIA-NeMo/NeMo

titu1994 · 2020-05-13T20:16:38Z

Changelog

Added

Add ContextNetEncoder, ContextNetDecoderForCTC neural modules to ASR collection
Add stride_last flag which allows stride and repeat flags to be used simultaneously. It will perform the strided convolution at the final Conv-BN-ReLU sub-block.
Add swish as optional activation function
Add zero_infinity flag to CTCLoss, default to False.
Adds integration test for ContextNetEncoder and ContextNetDecoderForCTC

Modified

Update Squeeze and Excitation sub-module to support different context sizes, support different activation
- Change default se_reduction_ratio to 8 instead of 16.
SpecAugment now supports either an integer or floating point value for time_width.
- If float is passed, adaptively uses it as percentage of current timesteps that should be cut.

Note: Currently, examples/asr/contextnet.py uses JasperDecoderForCTC instead of ContextNetDecoderForCTC. This will be updated in a future PR once full support is present.

Signed-off-by: smajumdar <titu1994@gmail.com>

okuchaiev

few small comments

okuchaiev · 2020-05-13T21:43:41Z

+logging = nemo.logging
+
+
+class ContextNetEncoder(TrainableNM):


Should this inherit from JasperEncoder ?

On second thought, it probably should not inherit JasperEncoder. While yes currently they share exactly same functionality, in the future they will not. In that case, the __init__ call will instantiate multiple JasperBlocks before ContextNetEncoder starts to instantiate its own values.

While there is duplication for now, it is cleaner to separate the two modules

Signed-off-by: smajumdar <titu1994@gmail.com>

lgtm-com · 2020-05-14T21:45:49Z

This pull request introduces 1 alert when merging 8c81303 into a22d325 - view on LGTM.com

new alerts:

1 for Unused import

Signed-off-by: smajumdar <titu1994@gmail.com>

blisc · 2020-05-14T21:51:16Z

+
+    # (ContextNet uses the Jasper baseline encoder and decoder)
+    encoder = nemo_asr.ContextNetEncoder(
+        feat_in=contextnet_params["AudioToMelSpectrogramPreprocessor"]["features"],


Just a note that you can add this inside the yaml itself.
See https://confluence.atlassian.com/bitbucket/yaml-anchors-960154027.html

Thanks for the hint !

Signed-off-by: smajumdar <titu1994@gmail.com>

lgtm-com · 2020-05-14T23:31:41Z

This pull request introduces 1 alert when merging 81330ba into a22d325 - view on LGTM.com

new alerts:

1 for Unused import

Signed-off-by: smajumdar <titu1994@gmail.com>

…Mo#630) * Add SE + context SE support Signed-off-by: smajumdar <titu1994@gmail.com> * Add contextnet components Signed-off-by: smajumdar <titu1994@gmail.com> * Add ContextNet support Signed-off-by: smajumdar <titu1994@gmail.com> * Add config files Signed-off-by: smajumdar <titu1994@gmail.com> * Correct configs Signed-off-by: smajumdar <titu1994@gmail.com> * Add streaming speech command Signed-off-by: smajumdar <titu1994@gmail.com> * Add kernel size factor argument Signed-off-by: smajumdar <titu1994@gmail.com> * Add docstrings Signed-off-by: smajumdar <titu1994@gmail.com> * Update CHANGELOG.md Signed-off-by: smajumdar <titu1994@gmail.com> * Add integration tests Signed-off-by: smajumdar <titu1994@gmail.com> * Style fixes and add docstrings for se_reduction_ratio Signed-off-by: smajumdar <titu1994@gmail.com> * Style fixes in tests Signed-off-by: smajumdar <titu1994@gmail.com> * Correct CHANGELOG.md Signed-off-by: smajumdar <titu1994@gmail.com> * Correctios to docstrings Signed-off-by: smajumdar <titu1994@gmail.com> * Add WandB support to contextnet.py Signed-off-by: smajumdar <titu1994@gmail.com> * Style fixes Signed-off-by: smajumdar <titu1994@gmail.com> * Remove unused import Signed-off-by: smajumdar <titu1994@gmail.com> * Refactor ContextNetEncoder to subclass JasperEncoder Signed-off-by: smajumdar <titu1994@gmail.com> * Remove unused imports Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: ZeroCool <alejandrogilelias940711@gmail.com>

Use a single jinja template for the prompts with and without a document. Also remove the conditionals checking for te presence of a document. Fixes NVIDIA-NeMo#629 Signed-off-by: Derek Higgins <derekh@redhat.com>

titu1994 added 13 commits May 13, 2020 12:14

Add SE + context SE support

5f17859

Signed-off-by: smajumdar <titu1994@gmail.com>

Add contextnet components

aa28939

Signed-off-by: smajumdar <titu1994@gmail.com>

Add ContextNet support

86e3dbd

Signed-off-by: smajumdar <titu1994@gmail.com>

Add config files

8a5c6de

Signed-off-by: smajumdar <titu1994@gmail.com>

Correct configs

39c1a91

Signed-off-by: smajumdar <titu1994@gmail.com>

Add streaming speech command

28a4cb1

Signed-off-by: smajumdar <titu1994@gmail.com>

Add kernel size factor argument

646dd8f

Signed-off-by: smajumdar <titu1994@gmail.com>

Add docstrings

d990c5c

Signed-off-by: smajumdar <titu1994@gmail.com>

Update CHANGELOG.md

e14d5d5

Signed-off-by: smajumdar <titu1994@gmail.com>

Add integration tests

d526488

Signed-off-by: smajumdar <titu1994@gmail.com>

Style fixes and add docstrings for se_reduction_ratio

851350e

Signed-off-by: smajumdar <titu1994@gmail.com>

Style fixes in tests

2e208f6

Signed-off-by: smajumdar <titu1994@gmail.com>

Correct CHANGELOG.md

46cc5c7

Signed-off-by: smajumdar <titu1994@gmail.com>

okuchaiev requested review from blisc and okuchaiev May 13, 2020 21:19

okuchaiev requested changes May 13, 2020

View reviewed changes

titu1994 added 3 commits May 13, 2020 15:35

Correctios to docstrings

a8d7f4c

Signed-off-by: smajumdar <titu1994@gmail.com>

Add WandB support to contextnet.py

6d3e4ca

Signed-off-by: smajumdar <titu1994@gmail.com>

Style fixes

8c81303

Signed-off-by: smajumdar <titu1994@gmail.com>

Remove unused import

66924b6

Signed-off-by: smajumdar <titu1994@gmail.com>

blisc previously approved these changes May 14, 2020

View reviewed changes

Refactor ContextNetEncoder to subclass JasperEncoder

81330ba

Signed-off-by: smajumdar <titu1994@gmail.com>

titu1994 dismissed blisc’s stale review via 81330ba May 14, 2020 23:23

Remove unused imports

7ea9183

Signed-off-by: smajumdar <titu1994@gmail.com>

okuchaiev approved these changes May 16, 2020

View reviewed changes

titu1994 merged commit 99ef493 into NVIDIA-NeMo:master May 16, 2020

titu1994 deleted the se_context_support branch May 16, 2020 05:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Initial support for ContextNet Encoder and CTC Decoder#630

Add Initial support for ContextNet Encoder and CTC Decoder#630
titu1994 merged 19 commits intoNVIDIA-NeMo:masterfrom
titu1994:se_context_support

titu1994 commented May 13, 2020 •

edited

Loading

Uh oh!

okuchaiev left a comment

Uh oh!

okuchaiev May 13, 2020

Uh oh!

titu1994 May 13, 2020

Uh oh!

titu1994 May 13, 2020 •

edited

Loading

Uh oh!

titu1994 May 15, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lgtm-com Bot commented May 14, 2020

Uh oh!

blisc May 14, 2020

Uh oh!

titu1994 May 14, 2020

Uh oh!

lgtm-com Bot commented May 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		logging = nemo.logging


		class ContextNetEncoder(TrainableNM):

Conversation

titu1994 commented May 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog

Added

Modified

Uh oh!

okuchaiev left a comment

Choose a reason for hiding this comment

Uh oh!

okuchaiev May 13, 2020

Choose a reason for hiding this comment

Uh oh!

titu1994 May 13, 2020

Choose a reason for hiding this comment

Uh oh!

titu1994 May 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

titu1994 May 15, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lgtm-com Bot commented May 14, 2020

Uh oh!

blisc May 14, 2020

Choose a reason for hiding this comment

Uh oh!

titu1994 May 14, 2020

Choose a reason for hiding this comment

Uh oh!

lgtm-com Bot commented May 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

titu1994 commented May 13, 2020 •

edited

Loading

titu1994 May 13, 2020 •

edited

Loading