Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TTS] bugfix IPAG2P and refactor to remove duplicate process. #5304

Merged
merged 6 commits into from Nov 2, 2022

Conversation

XuesongYang
Copy link
Collaborator

@XuesongYang XuesongYang commented Nov 1, 2022

Signed-off-by: Xuesong Yang 1646669+XuesongYang@users.noreply.github.com

What does this PR do ?

  1. fixed a bug for IPAG2P: the returned self.phoneme_dict mismatches between the one obtained from self.parse_as_cmu_dict and the one directly updated from a dict obj. The former will return something like {..., "WIRE": [["ˈ", "w", "a", "ɪ", "ɚ"], ["ˈ", "w", "a", "ɪ", "ɹ"]], ...}, while the latter will return like {..., "WIRE": ["ˈ", "w", "a", "ɪ", "ɚ"], ...}.
  2. refactored to remove duplicate processes for the case when the phoneme dict input is a dict object and for the case when the phoneme dict is a file.
  3. added type hints and docstrings for the whole class IPAG2P.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
nemo_text_processing/g2p/modules.py Outdated Show resolved Hide resolved
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
…mentation.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Copy link
Collaborator

@redoctopus redoctopus left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Discussed on Slack, will have a separate PR for extending support for different dict formats.

@XuesongYang XuesongYang merged commit 8755bf8 into main Nov 2, 2022
@XuesongYang XuesongYang deleted the refactor-g2p branch November 2, 2022 20:53
1-800-BAD-CODE pushed a commit to 1-800-BAD-CODE/NeMo that referenced this pull request Nov 13, 2022
…#5304)

* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: 1-800-bad-code <shane.carroll@utsa.edu>
Davood-M added a commit that referenced this pull request Nov 15, 2022
* remove stage wrapper from parallel

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* fixed the onnx bug in conformer for non-streaming models. (#5242)

Signed-off-by: Vahid <vnoroozi@nvidia.com>

Signed-off-by: Vahid <vnoroozi@nvidia.com>

* [Tools][ASR] Tool for generating data using simulated RIRs (#5158)

[Tools][ASR] Tool for generating data using simulated RIRs

Signed-off-by: Ante Jukić <ajukic@nvidia.com>

* Add fully torch.jit.script-able speaker clustering module (#5191)

* Add files for commit

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Added parallelism on p-value search

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Changed speaker clustering to accept torch.tensor

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Cleaned up the code and tested to have identical output

Signed-off-by: Taejin Park <tango4j@gmail.com>

* update on Notebook demo

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Added eigvalsh for faster eig val calculation:

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Remove NMESC_JitScriptedModule.ipynb

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Cleaned code and style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Modified MSDD framework to fit torch-scripted clustering

Signed-off-by: Taejin Park <tango4j@gmail.com>

* LGTM fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed all string based timestamps

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed unnecessary lines

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed redundant lines

Signed-off-by: Taejin Park <tango4j@gmail.com>

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Update perturb.py (#5231)

* Update perturb.py

Add checking for channels mismatch for audio and noise data, throw an exception if they have different number of channels. Also fixed `perturb_with_foreground_noise` as done in `perturb_with_input_noise` 

Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update check and teest

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix test

Signed-off-by: stevehuang52 <heh@nvidia.com>

Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* remove CV requirements. (#5233)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Fix link to inference notebook (#5247) (#5251)

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>

* checks for accepted adapter type at module level (#5194)

* add accepted adapter functionality into transformer, mlp and attention

Signed-off-by: arendu <adithya.r@gmail.com>

* fix to t5 adapter and ia3 evals due to predict_step dictionary key changes

Signed-off-by: arendu <adithya.r@gmail.com>

* use mixin logic for adapters in ParallelAttention and ParallelMLP classes

Signed-off-by: arendu <adithya.r@gmail.com>

* typo fix

Signed-off-by: arendu <adithya.r@gmail.com>

* updates

Signed-off-by: arendu <adithya.r@gmail.com>

* moved adapter tools

Signed-off-by: arendu <adithya.r@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix error with t5 adapter

Signed-off-by: arendu <adithya.r@gmail.com>

* updates'

Signed-off-by: arendu <adithya.r@gmail.com>

* replace ColumnParallelLinear with nn.Linear in export_utils

Signed-off-by: arendu <adithya.r@gmail.com>

* remove ColumnLinear

Signed-off-by: arendu <adithya.r@gmail.com>

* typo fix

Signed-off-by: arendu <adithya.r@gmail.com>

* update to check config targets

Signed-off-by: arendu <adithya.r@gmail.com>

* updates

Signed-off-by: arendu <adithya.r@gmail.com>

* refactor so that mixin is adapter name agnostic

Signed-off-by: arendu <adithya.r@gmail.com>

* fix merge conflict

Signed-off-by: arendu <adithya.r@gmail.com>

* minor

Signed-off-by: arendu <adithya.r@gmail.com>

* minor

Signed-off-by: arendu <adithya.r@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* using class comparison instead of string match

Signed-off-by: arendu <adithya.r@gmail.com>

* fix test fail

Signed-off-by: arendu <adithya.r@gmail.com>

* fixed checks for add_adapter

Signed-off-by: arendu <adithya.r@gmail.com>

* fixed checks for add_adapter

Signed-off-by: arendu <adithya.r@gmail.com>

Signed-off-by: arendu <adithya.r@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>

* fix groovy syntax

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* fix hypotheses return (#5253)

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* Update ASR scores table (#5254) (#5255)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>

* Support for inserting additional subsampling in conformer encoder (#5224)

* Change the default position of the reduction position to null and rename subsampling reduction to striding

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* Put the caching logic outside the conformer encoder

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add description of the reduction parameters in the configs

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update test_asr_exportables with correct reduction position value

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* asr and nmt tests in parallel

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* add more users who can trigger blossom-ci

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* path fix

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* fix paths, remove redundant test

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* add symlink

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* Modernize RNNT ONNX export and add TS export (#5248)

* Upgrade rnnt export for CUDA/CPU/TRT

Signed-off-by: smajumdar <titu1994@gmail.com>

* Update runtime script for onnx exported model to modern API

Signed-off-by: smajumdar <titu1994@gmail.com>

* Finalize code

Signed-off-by: smajumdar <titu1994@gmail.com>

* Remove comments

Signed-off-by: smajumdar <titu1994@gmail.com>

* Remove redundant stuff from tests

Signed-off-by: smajumdar <titu1994@gmail.com>

* Update test

Signed-off-by: smajumdar <titu1994@gmail.com>

* Remove onnx rnnt export test due to lack of onnxruntime install

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>

* update tutorials to use meeting config as default and VAD (#5237)

* update tutorials to use meeting config as default and VAD

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* update model path

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* Fix links to speaker identification notebook (#5260) (#5261)

Signed-off-by: SeanNaren <snarenthiran@nvidia.com>

Signed-off-by: SeanNaren <snarenthiran@nvidia.com>

Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>

* add shm-size

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>

* [TTS] Fastpitch energy condition and refactoring (#5218)

* Incorporating Energy conditioning in FastPitch

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Minor fixes in Energy conditioning in FastPitch

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Add Energy conditioning in FastPitch to infer method

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* adding fn to function names

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Incorporating Energy conditioning in FastPitch

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Minor fixes in Energy conditioning in FastPitch

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Add Energy conditioning in FastPitch to infer method

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* adding fn to function names

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove ifelse from batching, minor refactoring changes in energy code

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Refactor based on PR comments.

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Added support for not learning alignment in energy

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Fix typo in assert statemetn

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Renaming average_pitch to average_features

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Renaming len variable name as it is a keyword

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

* Renaming len variable name as it is a keyword

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* [TTS] HiFi-TTS Download Script (#5241)

* Hifi tts download script

Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>

* comment and remove imports

Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>

Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Specifying audio signal dropout separately for the Conformer Encoder (#5263)

* Fixed bug in transcribe_speech.py where decoding strategy was not being updated.

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add option to specify audio dropout separately for conformer encoders

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* Add audio dropout option to test_asr_exportables

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* Rename dropout_audio to dropout_pre_encode

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* Update the comments in squeezeformer configs referring to conformer modules

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* created (#5268)

* created

* bug

Signed-off-by: Dima Rekesh <drekesh@nvidia.com>

Signed-off-by: Dima Rekesh <drekesh@nvidia.com>
Co-authored-by: Dima Rekesh <drekesh@nvidia.com>

* [TTS] Add Mandarin/English Bilingual Recipe for Training Fastpitch Models (#5208)

* Add Chinese TTS tokenizer and G2P.
* Add data process script.
* Add tutorial.

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

* Minor typo fixes in TTS tutorial (#5266) (#5272)

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>

* Fix failing speaker counting for short audio samples (#5267)

* Add files for commit

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Added parallelism on p-value search

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Changed speaker clustering to accept torch.tensor

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Cleaned up the code and tested to have identical output

Signed-off-by: Taejin Park <tango4j@gmail.com>

* update on Notebook demo

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Added eigvalsh for faster eig val calculation:

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Remove NMESC_JitScriptedModule.ipynb

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Cleaned code and style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Modified MSDD framework to fit torch-scripted clustering

Signed-off-by: Taejin Park <tango4j@gmail.com>

* LGTM fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed all string based timestamps

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed unnecessary lines

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed redundant lines

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Add enhanced speaker count back

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fixed minor docstrings

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed import Counter

Signed-off-by: Taejin Park <tango4j@gmail.com>

Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Pcla tutorial fixes (#5271) (#5273)

* Fixed typos

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Fixed cell type and tatoeba reference

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Fixed typo

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Fixed branch variable

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>

* Add Gradio App to ASR Docs (#5270)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix bug into Dialogue tutorial (#5277) (#5280)

Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>

* [TTS] fixed type of filepath and rename openslr. (#5276)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* O2bert + apex pipeline functions (#5221)

* Global batch size support for validation

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Global batch size support for bert

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bert batch support

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bert batch size support

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* O2 support for bert

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_pretraining.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Update megatron_bert_model.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_config.yaml

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_model.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Bug fix

* Bug fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Bug fix

* Bug fix

* Bug fix

* Update megatron_bert_config.yaml

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Addressed Sandeeps comments

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Update Jenkinsfile

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Update Jenkinsfile

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Update megatron_bert_model.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Update Jenkinsfile

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Typo fix (#5288) (#5291)

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>

* Upperbound PTL (#5302)

* Upperbound PTL

Signed-off-by: smajumdar <titu1994@gmail.com>

* Upperbound PTL

Signed-off-by: smajumdar <titu1994@gmail.com>

* Upperbound PTL

Signed-off-by: smajumdar <titu1994@gmail.com>

* Upperbound PTL

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* Add support for Sampled Softmax for RNNT Joint (#5216)

* Initial prototype of SampldRNNTJoint

Signed-off-by: smajumdar <titu1994@gmail.com>

* Implement randperm based noise selection algo

Signed-off-by: smajumdar <titu1994@gmail.com>

* First working prototype of sampled rnnt !

Signed-off-by: smajumdar <titu1994@gmail.com>

* Add note for why we need this remap before i forget

Signed-off-by: smajumdar <titu1994@gmail.com>

* Finalize version that works with sampling

Signed-off-by: smajumdar <titu1994@gmail.com>

* Update docs for rnnt decoder and joint

Signed-off-by: smajumdar <titu1994@gmail.com>

* Remove the adjustment_val for softmax

Signed-off-by: smajumdar <titu1994@gmail.com>

* Update config and docs for Sampled Softmax

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* Update Interface(s) phonetic entry (#5212)

* change interface(s) phone

Signed-off-by: Jason <jasoli@nvidia.com>

* push version

Signed-off-by: Jason <jasoli@nvidia.com>

* update dict path

Signed-off-by: Jason <jasoli@nvidia.com>

Signed-off-by: Jason <jasoli@nvidia.com>

* [TTS] remove obsolete torch_tts unit test marker and replace with run_only_on('CPU') (#5307)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Fixes for Conformer-xl export (#5309)

* Fixing runtime check for ONNX > 2G

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Fixing ONNX export

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* add label inference support to EncDecSpeakerLabel class (#5278)

* add label inference support to EncDecSpeakerLabel class

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* add necessary tests

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* reflect on comments

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* grammatical correction

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* minor doc string changes

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>

* [TTS] bugfix IPAG2P and refactor to remove duplicate process. (#5304)

* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Update path to get_data.py in TTS tutorial (#5311)

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

* Add italian model checkpoints (#5315)

Signed-off-by: Igor Gitman <igitman@nvidia.com>

Signed-off-by: Igor Gitman <igitman@nvidia.com>

* Text Memmap Parsing Improvements (#5265)

* 1. Fixed text-memmap issue when boundary (new-line) is missing from end of file).

Signed-off-by: Micha Livne <mlivne@nvidia.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* 1. Fixed style.

Signed-off-by: Micha Livne <mlivne@nvidia.com>

* 1. Added support in paratial sample loading and alternative decoding.

Signed-off-by: Micha Livne <mlivne@nvidia.com>

* 1. Fixed syntax issues.

Signed-off-by: Micha Livne <mlivne@nvidia.com>

* 1. Minor change.

Signed-off-by: Micha Livne <mlivne@nvidia.com>

* 1. Extended flexibility of mapping indices.

Signed-off-by: Micha Livne <mlivne@nvidia.com>

* 1. Added validation ofdtype of indexing function.

Signed-off-by: Micha Livne <mlivne@nvidia.com>

Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>

* [TTS] Replace IPA lambda arguments with locale string (#5298)

* [TTS] Replace IPA lambda arguments with locale string
* [TTS] Add locale validation
* Fixed typos
* Return punctuation as sorted list

Signed-off-by: Ryan <rlangman@nvidia.com>

* Remove onnx graphsurgery from Dockerfile (#5320)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* Update refspec (#5321)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* Force wav file format for audio_filepath (#5323)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* Updates to T0 Dataset and Model (#5201)

* Fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fixes

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update config

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Restore function needed for NMT

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix config

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Change output file format from JSON to JSONL

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add T0 data preproc scripts

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Merge and multiprocessing

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix for is_correct

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Refactor T0 dataset

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add script to merge train folder into individual training files to minimize number of blends

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Minor changes

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Remove bin compat

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix header lines

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* [DOC] add sphinx-copybutton requirement to copy button on code snippets. (#5326)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* [TTS] expand to support flexible dictionary entry formats in IPAG2P. (#5318)

* expand to support flexible dictionary entry formats in IPAG2P.
* removed unused imports in test.collections.tts
* removed unused imports in nemo.collections.tts.modules
* removed unused imports in nemo_text_processing.text_normalization.zh
* updated unit tests with new cases
* renamed test function names because we only test IPAG2P rather than all classes in the modules.py.
* revise current test dict with a single space between word and pronunications.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* small bugfix for r1.13.0 (#5310) (#5325)

* typo fix
* udpate transcribe

Signed-off-by: fayejf <fayejf07@gmail.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>

* Option to pad the last validation input sequence if its smaller than the encoder sequence length for MegatronGPT (#5243)

* Option to pad the last input sequence of validation dataset if its smaller than the encoder sequence length for MegatronGPT

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* added default value for drop last argument

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Anmol Gupta <anmolg@nvidia.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>

* Add support for Hydra multirun to NeMo (#5159)

* Update execution doc and remove old snippet

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix types

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix defaults

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix types for ParallelAdapterConfig

Signed-off-by: smajumdar <titu1994@gmail.com>

* Add hash for config cache

Signed-off-by: smajumdar <titu1994@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add support to delete redundant ckpt files for HP search

Signed-off-by: smajumdar <titu1994@gmail.com>

* Correct config for IA3

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix check to <= 0

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* typo fix (#5328)

* Speed up HF data processing script for ASR (#5330)

* Correct hydra issue with relative filepaths

Signed-off-by: smajumdar <titu1994@gmail.com>

* Improve speed of dataset processing

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

* add precommit hood to automatic sort entries in requirements. (#5333)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* [TTS] update organization of model checkpoints and their pointers. (#5327)

* [TTS] update orgnization of model checkpoints and their pointers.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* move model name column to the 2nd col and correct model names as
predefined_model_name.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Add speaker clustering arguments to forward function (#5306)

* Move arguments to forward function

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Resolved type issue

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) (#5341)

* [STT] Add stt_ru_conformer_ctc_large

Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>

* [STT] Add stt_ru_conformer_transducer_large

Add stt_ru_conformer_transducer_large

Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* [TTS] bugfix for the script of generating mels. (#5344)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Fixing de-autocast (#5319)

* Fixing de-autocast

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Cleanup

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Refining export with max_dim/batch

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Moving cast utils to its own module

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Pcla tutorial fixes (#5313) (#5347)

* fixes

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* fixes

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* moved `create_text_and_labels` to token_classification_utils.py

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>

* bug (#5348)

Co-authored-by: Dima Rekesh <drekesh@nvidia.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>

* [Bugfix] Added rm -f / wget- nc command to avoid bash error in multispeaker sim notebook (#5292)

* Added rm -f command to avoid error message

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed unnecessary changes

Signed-off-by: Taejin Park <tango4j@gmail.com>

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [DOC] added ipython dependency to support IPython.sphinxext extension (#5345)

* [DOC] added ipython dependency to support IPython.sphinxext extension

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* revert ipython extension in the doc and replace ipython block with
shell-session.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Fix dialogue tutorial bug (#5297) (#5303)

* set add_pooling_layer=False for huggingface bert model

* remove add_pooling_layer=False and set find_unused_parameters=True

* set num_prompt_tokens to 0 for huggingface

Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Eric Harper <complex451@gmail.com>

* Fix issue with HF Model upload tutorial (#5359) (#5360)

* Add Gradio App to ASR Docs (#5270)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>
(cherry picked from commit e4b6a387e3b3d9cdf511f7b9bbb5e94925e48cc2)

* Fix issue with normalized config for dataset name

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>

* Bug fix (removing old compute consumed samples) (#5355)

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* removed uninstall nemo_cv and nemo_simple_gan and relax numba version… (#5332)

* Update reinstall.sh and requirements.

* removed nemo_cv and nemo_simple_gan in reinstall.sh.
* relaxed numba version limits.
* added tensorboard requirement to avoid any incpmpatible issue.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* revert changes for numba

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* Pipeline paralleism in Bert (#5293)

* Global batch size support for validation

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Global batch size support for bert

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bert batch support

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bert batch size support

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* O2 support for bert

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_pretraining.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Update megatron_bert_model.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_config.yaml

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_model.py

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* Bug fix

* Bug fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Bug fix

* Bug fix

* Bug fix

* Update megatron_bert_config.yaml

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* PPBert

* PPBert

* PPBert

* PPBert

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update megatron_bert_config.yaml

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>

* bug fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bug fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bug fix

* bug fix

* bug fix

* bug fix

Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>

* tutorial fixes (#5354) (#5361)

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>

* Enable mlflow logger (#4893)

* Enable mlflow logger

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* fix style

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* Add doc lines.

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* change default value

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* fix doc

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* addressed comments, added dataclass

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* fix style

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* fix doc

Signed-off-by: whrichd <trabeitwrq@gmail.com>

Signed-off-by: whrichd <trabeitwrq@gmail.com>

* Add SDP documentation (#5274) (#5376)

* Add details to SDP README.md

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Add docstring to WriteManifest processor

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Add docstring to CreateInitialManifestMLS

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Add ModifyManifestTextProcessor docstring

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Add ASRInference docstring

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Add base_processor docstrings

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Add minimal SDP docs page

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Update tools/speech_dataset_processor/README.md

Co-authored-by: Igor Gitman <igitman@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>

* Write simple README for SDP and move complex explanations to docs

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Remove incorrect type hints

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Make config example less confusing

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Fix typo

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Clarify that YAML file is config file in README

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Remove unused imports

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Remove SDP docs for now

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

* Remove links to docs in SDP README

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Igor Gitman <igitman@nvidia.com>

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Igor Gitman <igitman@nvidia.com>

* Rename Speech Dataset Processor to Speech Data Processor (#5378) (#5381)

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>

Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>

* fix for num worker 0 causing issues in losses after 1 epoch (#5379) (#5384)

Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com>

* [TTS] Add Spanish model documentation (#5390)

Signed-off-by: Ryan <rlangman@nvidia.com>

* [TTS] Add Spanish FastPitch training configs (#5383)

* [TTS] Add Spanish FastPitch training configs
* [TTS] Add single speaker Spanish configs

Signed-off-by: Ryan <rlangman@nvidia.com>

* Fix Python type hints according to Python Docs (#5370)

* Remove duplicated type annotations

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix tuple annotations in function return types

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Add necessary imports

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Add necessary imports

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix types in obvious places

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix types in obvious places

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix unused import (avoid quotes in type annotations)

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Revert "Fix unused import (avoid quotes in type annotations)"

This reverts commit ea433efcd9916abf8944879e791484a0a1437f83.

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Remove problematic import

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix list_available_models method type

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Revert some changes

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Revert quotes in list_available_models

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Force MHA QKV onto fp32 (#5391) (#5395)

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>

Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>

* Add cpWER for evaluation of ASR with diarization (#5279)

* Add cpWER calculation feature

Signed-off-by: Taejin Park <tango4j@gmail.com>

* added notebook

Signed-off-by: Taejin Park <tango4j@gmail.com>

* updated notebook and diarization_utils

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Minor update on tutorial notebook

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update on missing docstrings

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed an unfinished docstring

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed unused variables

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed dict input to list input

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* fixed LGTM issues

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed error in cpWER cal

Signed-off-by: Taejin Park <tango4j@gmail.com>

* fixed docstrings

Signed-off-by: Taejin Park <tango4j@gmail.com>

* fixed docstrings

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fix some of the typing issues, lower case names

Signed-off-by: SeanNaren <snarenthiran@nvidia.com>

* Replaced bruteforce with LSA alg for cpWER

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Reflected PR comments

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Cleaned notebook

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated notebook

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed LGTM warnings

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* added test_diar_metrics.py

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fixed typos

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed wrong type annotations

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Added bruteforce mode and its unit-test

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* LGTM issues fixed

Signed-off-by: Taejin Park <tango4j@gmail.com>

* reolve LGTM issues

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* unified speaker key in trans_dict

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Removed unused variable and imports

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Update nemo/collections/asr/parts/utils/diarization_utils.py

Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>

* Update nemo/collections/asr/parts/utils/diarization_utils.py

Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>

* moved all the diarization eval to der.py

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Update tests/collections/asr/test_diar_metrics.py

Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>

* der.py update on tests

Signed-off-by: Taejin Park <tango4j@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* unused imports and style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* unused import

Signed-off-by: Taejin Park <tango4j@gmail.com>

* reflected review comments

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed an import bug in tutorial notebook

Signed-off-by: Taejin Park <tango4j@gmail.com>

Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: SeanNaren <snarenthiran@nvidia.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>

* Added cast

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: arendu <adithya.r@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Signed-off-by: Dima Rekesh <drekesh@nvidia.com>
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Igor Gitman <igitman@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Ryan <rlangman@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: fayejf <fayejf07@gmail.com>
Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Signed-off-by: whrichd <trabeitwrq@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
Signed-off-by: David <amosalla@asu.edu>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Co-authored-by: anteju <108555623+anteju@users.noreply.github.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Co-authored-by: Subhankar Ghosh <subhankar2321@gmail.com>
Co-authored-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Co-authored-by: Dima Rekesh <bmwshop@gmail.com>
Co-authored-by: Dima Rekesh <drekesh@nvidia.com>
Co-authored-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: Jason <jasoli@nvidia.com>
Co-authored-by: Igor Gitman <igitman@nvidia.com>
Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Ryan Langman <rlangman@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Co-authored-by: anmolgupt <14880251+anmolgupt@users.noreply.github.com>
Co-authored-by: Anmol Gupta <anmolg@nvidia.com>
Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Co-authored-by: Riqiang Wang <43883260+whrichd@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Vladimir Bataev <vbataev@nvidia.com>
Co-authored-by: David <amosalla@asu.edu>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 29, 2022
…#5304)

* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Hainan Xu <hainanx@nvidia.com>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 29, 2022
…#5304)

* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Hainan Xu <hainanx@nvidia.com>
JimmyZhang12 pushed a commit to JimmyZhang12/NeMo that referenced this pull request Dec 14, 2022
…#5304)

* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
andrusenkoau pushed a commit to andrusenkoau/NeMo that referenced this pull request Jan 5, 2023
…#5304)

* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants