Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

main to ssl synthesis #9

Merged
merged 57 commits into from
Jul 28, 2022
Merged

main to ssl synthesis #9

merged 57 commits into from
Jul 28, 2022

Conversation

paarthneekhara
Copy link
Owner

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

michalivne and others added 30 commits July 6, 2022 10:37
* 1. Debugging.

Signed-off-by: Micha Livne <mlivne@cs.toronto.edu>

* 1. BART dataset fixes missing <EOS> for deocder output.

Signed-off-by: Micha Livne <mlivne@cs.toronto.edu>

* 1. Debugging.

Signed-off-by: Micha Livne <mlivne@cs.toronto.edu>

* 1. Debugging.

Signed-off-by: Micha Livne <mlivne@cs.toronto.edu>

* 1. Removed extra padding from BARTDataset.

Signed-off-by: Micha Livne <mlivne@cs.toronto.edu>
* Updated pipeline parallel code to speed up training

Signed-off-by: Virginia Adams <vadams@nvidia.com>

* Load global batch size not local mini batch size

Signed-off-by: Virginia Adams <vadams@nvidia.com>

* Python reformatting

Signed-off-by: Virginia Adams <vadams@nvidia.com>
* Temp

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add megatron dataset

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update config and fix global batch fetcher

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add dataset class

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update comments

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update yaml

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix duplicate yaml key

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Translate method and preprocess script for raw text

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove pdb

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix arg name

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix other arg

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Change sampler back

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Move back to global batch fetcher to use distributed sampler

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add text memmap data

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update monitor

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fixes for PP

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove unused import

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Truncate examples in text memmap

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* NMT training batch interpolation key

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* tarred data fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Change dataset type check

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix sampler

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Pass dataset cfg to determine type

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Log global step on validation step as well

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix NMT model saving with artifacts

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Initialize DDP in decode if not initialized. Needed for inference only mode

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Megatron NMT inference script

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Inference config file

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* hardcode max delta temporarily

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* detokenizer if processor is not none

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Sampler config

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Compat with configs without sampler arg

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Comment for validation dataset type

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix tokenizer building

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* CI test for megatron nmt

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix tokenizer in restore

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* O2 restore from fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove print

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Change tokenizer model name in config

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Logging

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Set seed for distributed sampler

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Cluster debugging messages

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix max generation delta

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* No LM Init

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Use nlp save restore connector

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove useless infer args

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Typo

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* UTF8 safe print of translation result

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add save restore connector back with comment

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Refactor

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix CI test

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add missing args

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Address comments

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Empty to restart

* Fix CI test

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Check for test ds

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* set fusion to false

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Initial perceiver encoder

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Perceiver with PP=1

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove init cross attn

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* CI test and remove init cross attn arg

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove init cross attn layers from file

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Clean up

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* update branch

Signed-off-by: ericharper <complex451@gmail.com>

* Set headscale false (NVIDIA#4364)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add wandb as dependency (NVIDIA#4365)

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Raise trainer error (NVIDIA#4356)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>

* Set headscale false (NVIDIA#4364) (NVIDIA#4366)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Finetuning changes for BART (NVIDIA#4003)

* Temp

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Checkpoint converter to nemo for bart

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>

* Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (NVIDIA#4357)

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix logging warning

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>

* Refactor bias act fusion

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update NMT config

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix electronic bug, new time ITN rule (NVIDIA#4355)

* fix electronic bug

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* add new itn time rule

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* revert domain changes

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* remove repetition

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* Update ci tests

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Correct support for dataclasses in default module dim (NVIDIA#4372)

* Correct support for dataclasses in default module dim

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix path for save of results

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* fix pad id bug (NVIDIA#4377)

Signed-off-by: Yi Dong <yidong@nvidia.com>

* Question answering bug fix (NVIDIA#4381)

* refactor dialogue state tracking for modelling/dataset interoperability

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style changes

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style raised by lgtm

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style formatting

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update template to include description of intent

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* changes based on requests in review

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add compatibility with assistant dataset

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove dialogue_state_tracking

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update huggingface utils for dialogue

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add docstrings for assistant data processsor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins for SGDGEN local checkpoint

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* use local vocab file for Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* patch for Jenkins CI using local file

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add slot filling prediction and metrics

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused code

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* refactor metrics code out of Dialogue GPT Model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate backward compatible support for IntentSlotClassificationModel (bert model)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* save prediction file for IntentSlotClassification

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue gpt model training for megatron gpt

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove batch generate for HF GPT2, which causes lower performance

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add few shot capability to dialogue gpt model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile and remove unused import

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update code description and clarity

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address PR comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate compatibility with ZeroShotIntentModel

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* rename folder to dialogue due to increased scope and further refactor for clarity

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* added dialogue GPT for sequence generation task (e.g. answer extender)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add CI test for DialogueGPTGenerationModel

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate DialogueS2SGenerationModel for generation task (e.g. answer extender)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* modify huggingface utils to support HF t5/BART models

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix bleu metric style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* debug bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* debug bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update 2 based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update 3 based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate sgd generation based on user user utterance and system slot-values to generate system utterance

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add validation model saving capabilities

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* cleaned up code for SGD Based Answer extender

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Dialogue Generation CI

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix Jenkins CI issue"

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add support for design dataset

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unnecessary imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support megatron for dialogue_s2s_generation_model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update CI

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update checkpoint and predictions filename to include epoch number

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate HF BART MNLI into zero shot intent model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate Dialogue Nearest Neighbour Model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* refactor Dialogue SGD Data Processor to make interface for models cleaner

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Dialogue S2S Generation model for DialogueSGDDataProcessor interface

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support sgd and drive thru datasets by zero shot model and nearest neighbour model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add prediction saving code to nearest neighbour and zero shot intent models

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo in sgd data processor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate Dialogue Mellon QA Data Processor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update mellon qa

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue.py to remove outdated info

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue_config.yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue_config.yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add dialogue docs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address review comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix for cfg

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* make dependency on apex optional

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* change NLPDDPluggin calling logic to make it possible to run without apex

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add first draft of tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* reduce ms marco size by removing lines without wellFormedAnswers

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address pr comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update colab tutorial link in dialogue docs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* include unit test and some refactor to facilitate unit test

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address pr issues

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove typos in dialogue tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support larger files for question answering

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unnecessary artifacts to reduce memory use

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* put 0 tensor to device

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update link within dialogue tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* restore previously delete files

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error handling when loss = nan

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update nan handling

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update spanning loss func

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update spanning loss

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix type error raised in qa_dataset.py

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add error checking message

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert back to float32

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert back to float32

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update exp logging

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update loading of large file from pickle to json

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update loading of large file from pickle to json

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* limit number of negative samples

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert post processing

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert post processing

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused methods and style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add more documentation

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* changes base on PR review

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* set wandb logger falseby default

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

* style fix

* correct typo

* style fix

* style fix

Co-authored-by: Zhilin Wang <zhilinw@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>

* Fix ASR Typos in tutorials (NVIDIA#4384)

* Fix typos

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (NVIDIA#4383)

Signed-off-by: tbartley94 <tbartley@nvidia.com>

Co-authored-by: tbartley94 <tbartley@nvidia.com>
(cherry picked from commit 0322b15)

Co-authored-by: Travis Bartley <Travismbartley@gmail.com>

* Add Docs for NeMo Adapters (NVIDIA#4369)

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update NeMo docs (NVIDIA#4397)

Signed-off-by: smajumdar <smajumdar@nvidia.com>

Co-authored-by: Eric Harper <complex451@gmail.com>

* Punctuation and capitalization tests race condition (NVIDIA#4399)

* Add draft of race condition fixes

Signed-off-by: PeganovAnton <peganoff2@mail.ru>

* Minor improvements

Signed-off-by: PeganovAnton <peganoff2@mail.ru>

* More race condition fixes

Signed-off-by: PeganovAnton <peganoff2@mail.ru>

* Improve error message

Signed-off-by: PeganovAnton <peganoff2@mail.ru>

* Improve error message

Signed-off-by: PeganovAnton <peganoff2@mail.ru>

* Improve error message

Signed-off-by: PeganovAnton <peganoff2@mail.ru>

* bias act fusion changes

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Address comments

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix geglu without fusion

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Reset files to main

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove hidden blocks

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>
Co-authored-by: Abhinav Khattar <aklife97@gmail.com>
Co-authored-by: ericharper <complex451@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com>
Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Zhilin Wang <zhilinw@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
Co-authored-by: Travis Bartley <Travismbartley@gmail.com>
Co-authored-by: PeganovAnton <peganoff2@mail.ru>
* initial commit

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Default maj_vote = False, max_rp=0.25

Signed-off-by: Taejin Park <tango4j@gmail.com>

* doc strings and style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Docstring minor edit

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Default False in the functions

Signed-off-by: Taejin Park <tango4j@gmail.com>

* fixed repeated variable

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Default as maj_vote=False

Signed-off-by: Taejin Park <tango4j@gmail.com>

* removed redundant part in wrtie_rttm func

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed unused function

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Updated and tested silence and very short samples

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Style fix and removing unnecessary parts

Signed-off-by: Taejin Park <tango4j@gmail.com>

* unused variables are removed

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed commented torch.jit.script

Signed-off-by: Taejin Park <tango4j@gmail.com>

* majority voting update

Signed-off-by: Taejin Park <tango4j@gmail.com>

* cancelling the update on speaker_utils and clus_diarizer

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* bug fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Added fp32 converting for torch.mm

Signed-off-by: Taejin Park <tango4j@gmail.com>

Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Signed-off-by: saarus72 <saarus72@gmail.com>

Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Adrian Lancucki <alancucki@users.noreply.github.com>

Co-authored-by: Adrian Lancucki <alancucki@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Dataloader, collector, loss and metric for multiscale diarization decoder  (NVIDIA#4187)

* First commit

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Checked funtionality and imports

Signed-off-by: Taejin Park <tango4j@gmail.com>

* fixed import issues

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed the changed made by mistake

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed LGTM errors 001

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed LGTM and style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Changed docstrings

Signed-off-by: Taejin Park <tango4j@gmail.com>

* LGTM again

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed unnecessary torch setting lines

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Style fix and isort

Signed-off-by: Taejin Park <tango4j@gmail.com>

* jbalam-nv comments reflected

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Reflected comments and created _diar_label.py

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Typo fix and style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Fixed target_spks[0] index error

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* LGTM unused import IterDataset

Signed-off-by: Taejin Park <tango4j@gmail.com>

* revert collection doc year

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Code format error in collections.py

Signed-off-by: Taejin Park <tango4j@gmail.com>

* fix collections space format error

Signed-off-by: Taejin Park <tango4j@gmail.com>

* merged main correctly

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Reflected all comments and tested

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix and LGTM

Signed-off-by: Taejin Park <tango4j@gmail.com>

* rttm_filepath to rttm_file and removed self included funcs, tested

Signed-off-by: Taejin Park <tango4j@gmail.com>

Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* removed references to data_dir

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* added missing parameters to data preparation script

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* removed unnecessary file extension check

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Add ASR CTC Decoding module (NVIDIA#4342)

* Initial commit

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Full support for decoding strategy

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Temp

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix labels of y_sequence

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Set support for sentencepiece subword merging

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix char and word based token merge alignment

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Revert incorrect change

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update docstring

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Improve compatibility with greedy tokens and log probs

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update scripts to use decoding strategy

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add tests and docs

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add tests and docs

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix speaker decoder timestamps

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix speaker decoder timestamps

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix decoding of ctc models

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Address reviewer comments

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Address reviewer comments

Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Option to disable mp in VAD via num_workers=1 (NVIDIA#4317)

* Option to disable mp in VAD via num_workers=1

In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1.

Signed-off-by: Georg Kucsko <gkucsko@gmail.com>

* add none handling

Signed-off-by: Georg Kucsko <gkucsko@gmail.com>

* additional none handling

Signed-off-by: Georg Kucsko <gkucsko@gmail.com>

Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* remove redundant bias expand (NVIDIA#4382)

* remove redundant bias expand

Signed-off-by: Xiaowei Ren <xren@nvidia.com>

* delete redundant code

Signed-off-by: Xiaowei Ren <xren@nvidia.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* fixed style

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Add option for specifying wandb save_dir from config (NVIDIA#4379)

* give option to user to specify wandb save dir via config

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* create save_dir directory for wandb logger if not exists

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* update save_dir get method with a default value

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (NVIDIA#4383)

Signed-off-by: tbartley94 <tbartley@nvidia.com>

Co-authored-by: tbartley94 <tbartley@nvidia.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (NVIDIA#4388)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Merge r1.10.0 main (NVIDIA#4398)

* update branch

Signed-off-by: ericharper <complex451@gmail.com>

* Set headscale false (NVIDIA#4364)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add wandb as dependency (NVIDIA#4365)

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Raise trainer error (NVIDIA#4356)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>

* Set headscale false (NVIDIA#4364) (NVIDIA#4366)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Finetuning changes for BART (NVIDIA#4003)

* Temp

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Checkpoint converter to nemo for bart

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>

* Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (NVIDIA#4357)

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix logging warning

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>

* Fix electronic bug, new time ITN rule (NVIDIA#4355)

* fix electronic bug

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* add new itn time rule

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* revert domain changes

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* remove repetition

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* Correct support for dataclasses in default module dim (NVIDIA#4372)

* Correct support for dataclasses in default module dim

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix path for save of results

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* fix pad id bug (NVIDIA#4377)

Signed-off-by: Yi Dong <yidong@nvidia.com>

* Question answering bug fix (NVIDIA#4381)

* refactor dialogue state tracking for modelling/dataset interoperability

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style changes

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style raised by lgtm

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style formatting

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update template to include description of intent

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* changes based on requests in review

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add compatibility with assistant dataset

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove dialogue_state_tracking

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update huggingface utils for dialogue

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add docstrings for assistant data processsor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins for SGDGEN local checkpoint

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* use local vocab file for Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* patch for Jenkins CI using local file

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add slot filling prediction and metrics

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused code

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* refactor metrics code out of Dialogue GPT Model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate backward compatible support for IntentSlotClassificationModel (bert model)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* save prediction file for IntentSlotClassification

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue gpt model training for megatron gpt

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove batch generate for HF GPT2, which causes lower performance

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add few shot capability to dialogue gpt model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile and remove unused import

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update code description and clarity

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address PR comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate compatibility with ZeroShotIntentModel

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* rename folder to dialogue due to increased scope and further refactor for clarity

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* added dialogue GPT for sequence generation task (e.g. answer extender)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add CI test for DialogueGPTGenerationModel

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate DialogueS2SGenerationModel for generation task (e.g. answer extender)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* modify huggingface utils to support HF t5/BART models

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix bleu metric style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* debug bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* debug bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update 2 based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update 3 based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate sgd generation based on user user utterance and system slot-values to generate system utterance

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add validation model saving capabilities

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* cleaned up code for SGD Based Answer extender

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Dialogue Generation CI

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix Jenkins CI issue"

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add support for design dataset

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unnecessary imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support megatron for dialogue_s2s_generation_model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update CI

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update checkpoint and predictions filename to include epoch number

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate HF BART MNLI into zero shot intent model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate Dialogue Nearest Neighbour Model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* refactor Dialogue SGD Data Processor to make interface for models cleaner

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Dialogue S2S Generation model for DialogueSGDDataProcessor interface

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support sgd and drive thru datasets by zero shot model and nearest neighbour model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add prediction saving code to nearest neighbour and zero shot intent models

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo in sgd data processor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate Dialogue Mellon QA Data Processor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update mellon qa

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue.py to remove outdated info

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue_config.yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue_config.yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add dialogue docs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address review comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix for cfg

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* make dependency on apex optional

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* change NLPDDPluggin calling logic to make it possible to run without apex

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add first draft of tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* reduce ms marco size by removing lines without wellFormedAnswers

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address pr comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update colab tutorial link in dialogue docs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* include unit test and some refactor to facilitate unit test

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address pr issues

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove typos in dialogue tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support larger files for question answering

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unnecessary artifacts to reduce memory use

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* put 0 tensor to device

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update link within dialogue tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* restore previously delete files

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error handling when loss = nan

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update nan handling

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update spanning loss func

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update spanning loss

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix type error raised in qa_dataset.py

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add error checking message

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert back to float32

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert back to float32

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update exp logging

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update loading of large file from pickle to json

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update loading of large file from pickle to json

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* limit number of negative samples

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert post processing

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert post processing

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused methods and style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add more documentation

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* changes base on PR review

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* set wandb logger falseby default

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

* style fix

* correct typo

* style fix

* style fix

Co-authored-by: Zhilin Wang <zhilinw@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>

* Fix ASR Typos in tutorials (NVIDIA#4384)

* Fix typos

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (NVIDIA#4383)

Signed-off-by: tbartley94 <tbartley@nvidia.com>

Co-authored-by: tbartley94 <tbartley@nvidia.com>
(cherry picked from commit 0322b15)

Co-authored-by: Travis Bartley <Travismbartley@gmail.com>

* Add Docs for NeMo Adapters (NVIDIA#4369)

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update NeMo docs (NVIDIA#4397)

Signed-off-by: smajumdar <smajumdar@nvidia.com>

Co-authored-by: Eric Harper <complex451@gmail.com>

* update branch

Signed-off-by: ericharper <complex451@gmail.com>

* remove Copy of

Signed-off-by: ericharper <complex451@gmail.com>

Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>
Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com>
Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Zhilin Wang <zhilinw@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
Co-authored-by: Travis Bartley <Travismbartley@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* [bugfix][TTS] pitch, voiced_mask, prob_voiced have the same values. (NVIDIA#4392)

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Fixing import error in some cases (NVIDIA#4401)

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Fixing bugs in calling method ctc_decoder_predictions_tensor. (NVIDIA#4414)

* updated ctc decoding calls.

Signed-off-by: Vahid <vnoroozi@nvidia.com>

* fixed the ones for timestamp_utils.py

Signed-off-by: Vahid <vnoroozi@nvidia.com>

* fixed the ones for timestamp_utils.py

Signed-off-by: Vahid <vnoroozi@nvidia.com>

* fixed the ones for timestamp_utils.py

Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* Update with new conformer checkpoints. (NVIDIA#4417)

Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

* [TTS] add static method decorator. (NVIDIA#4443)

* [TTS] add static method decorator.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* remove protect prefix

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* fixed style error

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>

Co-authored-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: Georg Kucsko <gkucsko@users.noreply.github.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Co-authored-by: Xiaowei Ren <103958965+xrennvidia@users.noreply.github.com>
Co-authored-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: Travis Bartley <Travismbartley@gmail.com>
Co-authored-by: tbartley94 <tbartley@nvidia.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>
Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com>
Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Zhilin Wang <zhilinw@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
…oading mel

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
* Add ITN pt

Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br>

* Fix style

Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br>

* Fix style

Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br>

* Update copyright year to 2022 on ITN pt rules and tests

Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>

Co-authored-by: Ante Jukić <ajukic@nvidia.com>
phoneme IY1 -> IH1 in NVIDIA
Added phonemes for CUSTOMIZABLE

Update cmudict file revision and its reference.

Signed-off-by: Jason Roche <jroche@nvidia.com>

Co-authored-by: Jason Roche <jroche@nvidia.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* add support for param groups

Signed-off-by: stevehuang52 <heh@nvidia.com>

* make config more general

Signed-off-by: stevehuang52 <heh@nvidia.com>

Co-authored-by: Eric Harper <complex451@gmail.com>
* initial commit

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* fixed silence wav file issue causing clustering to evaluate on null embeddings

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* fixed zero duration issue

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* updated with comments

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* minor doc change

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* update log

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* Runtime check refinements

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Added fp32 casting for ASR nets export

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* style

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Used torch.float32 for clarity

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>

* Fixing parameters passing

Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* initial_script

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* move old script

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* remove finetune func from label models

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* style clean

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* updated config

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* update tutorial

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* lgtm fixes

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* updated based on comments

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* update doc

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
…DIA#4511)

* Preserve logprobs when preserving alignments

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update tests for rnnt gredy and beam search

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update all dependents of alignments

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update docs

Signed-off-by: smajumdar <smajumdar@nvidia.com>
…_whitespace (NVIDIA#4525)

* Fix geglu without fusion

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add extra args

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Reset transformer

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix spm arg

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix help string

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add support for access mixin registry of custom losses

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* add support for asr custom losses

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update for l2 loss

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add unittests

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add unittests

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add unittests

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update registration of tensors to reset after finishing step

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remove comment

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remove comment

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update SSL models

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add support for validation step properly registering tensors

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Move reset of registry outside

Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
* fix duplex inference with grammars

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* add ci test for duplex, fix electronic last sym bug

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* test fix

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* update jenkins grammars

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* add pt to the docs

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* disable test

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins refactor

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* jenkins

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* test

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* test

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* test

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* test

Signed-off-by: ekmb <ebakhturina@nvidia.com>

Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
…DIA#4465)

* Add Bucketing support to TarredAudioToClassificationLabelDataset

Signed-off-by: Ewald Enzinger <ewald.enzinger@entn.at>
* add FT support

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* rm preproc

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* review changes

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* add CI

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* newline fix

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* CI fix

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* clean up

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* post training cleanup

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* test

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* revert

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* CI test

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* revert CI changes

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* original CI

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
* add nsys profiling

Signed-off-by: ericharper <complex451@gmail.com>

* only access omegaconf in setup

Signed-off-by: ericharper <complex451@gmail.com>

* use robust get_rank function

Signed-off-by: ericharper <complex451@gmail.com>

* simplify

Signed-off-by: ericharper <complex451@gmail.com>
* refactor dialogue state tracking for modelling/dataset interoperability

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style changes

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style raised by lgtm

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style formatting

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update template to include description of intent

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* changes based on requests in review

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add compatibility with assistant dataset

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove dialogue_state_tracking

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update huggingface utils for dialogue

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile for SGDGEN

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add docstrings for assistant data processsor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins for SGDGEN local checkpoint

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* use local vocab file for Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* patch for Jenkins CI using local file

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add slot filling prediction and metrics

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused code

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* refactor metrics code out of Dialogue GPT Model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate backward compatible support for IntentSlotClassificationModel (bert model)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* save prediction file for IntentSlotClassification

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue gpt model training for megatron gpt

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove batch generate for HF GPT2, which causes lower performance

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add few shot capability to dialogue gpt model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile and remove unused import

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update code description and clarity

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address PR comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate compatibility with ZeroShotIntentModel

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* rename folder to dialogue due to increased scope and further refactor for clarity

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* added dialogue GPT for sequence generation task (e.g. answer extender)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add CI test for DialogueGPTGenerationModel

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate DialogueS2SGenerationModel for generation task (e.g. answer extender)

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* modify huggingface utils to support HF t5/BART models

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix bleu metric style

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* debug bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* debug bleu metric

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update 2 based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update 3 based on PR NVIDIA#3893

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate sgd generation based on user user utterance and system slot-values to generate system utterance

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add validation model saving capabilities

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* cleaned up code for SGD Based Answer extender

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Dialogue Generation CI

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkinsfile

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix Jenkins CI issue"

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add support for design dataset

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unnecessary imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support megatron for dialogue_s2s_generation_model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update CI

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update checkpoint and predictions filename to include epoch number

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate HF BART MNLI into zero shot intent model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate Dialogue Nearest Neighbour Model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* refactor Dialogue SGD Data Processor to make interface for models cleaner

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update Dialogue S2S Generation model for DialogueSGDDataProcessor interface

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update jenkins

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support sgd and drive thru datasets by zero shot model and nearest neighbour model

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add prediction saving code to nearest neighbour and zero shot intent models

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix typo in sgd data processor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* integrate Dialogue Mellon QA Data Processor

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update mellon qa

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue.py to remove outdated info

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue_config.yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update dialogue_config.yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add dialogue docs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address review comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix for cfg

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* make dependency on apex optional

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* change NLPDDPluggin calling logic to make it possible to run without apex

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add first draft of tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* reduce ms marco size by removing lines without wellFormedAnswers

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address pr comments

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update colab tutorial link in dialogue docs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* include unit test and some refactor to facilitate unit test

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address pr issues

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove typos in dialogue tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* support larger files for question answering

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unnecessary artifacts to reduce memory use

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* put 0 tensor to device

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update link within dialogue tutorial

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* restore previously delete files

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error handling when loss = nan

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update nan handling

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update spanning loss func

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update spanning loss

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix type error raised in qa_dataset.py

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add error checking message

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert back to float32

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert back to float32

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update exp logging

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msgs

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update loading of large file from pickle to json

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update loading of large file from pickle to json

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* limit number of negative samples

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert post processing

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* revert post processing

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused methods and style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* add more documentation

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused imports

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* changes base on PR review

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* set wandb logger falseby default

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update interface with megatron gpt prompt learning

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update inline documentation

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* style fix

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update prompt_ids

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update error msg

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update config

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update config

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* set inference = False for dialgue prompt learning during trainng

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* set inference = False for dialgue prompt learning during trainng

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused code

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* update config yaml

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* fix bug for megatron gpt prompt learning

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* remove unused import

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address comments in PR

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address comments in PR

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

* address typo

Signed-off-by: Zhilin Wang <zhilinw@nvidia.com>

Co-authored-by: Zhilin Wang <zhilinw@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* update fastpitch to add export controls

Signed-off-by: Jason <jasoli@nvidia.com>

* final touchups

Signed-off-by: Jason <jasoli@nvidia.com>

* more final touchups

Signed-off-by: Jason <jasoli@nvidia.com>
… models (NVIDIA#4550)

Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
yzhang123 and others added 27 commits July 15, 2022 15:00
* added MLM Scoring

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix header

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* refactor

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix bug that made normalization options set

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix style

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix discrepancy of space versus no space to previous version e.g. < sixteen > and <sixteen>

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* remove and from cardinal when lm is used to reduce number of options

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix grammar

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix masked input for [MASK] token before mlm scoring

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* mask out everything apart from one semiotic token

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* reverted masking change and added roman to lm

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix slash, expand measure

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix masked scoring

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* audio based set fix for --lm

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* fix bug

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* added jenkins test

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* update jenkins

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix header

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix lgtm

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* add dependency

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* moved mlmscore file

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* moved hybrid to nemo_text_processing folder

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* update jenkins

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix path

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix test

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

* fix dataset license

Signed-off-by: Yang Zhang <yangzhang@nvidia.com>

Co-authored-by: ekmb <ebakhturina@nvidia.com>
* Removed NLPDDPPlugin Import check

Signed-off-by: Virginia Adams <vadams@nvidia.com>

* Python formatting fix

Signed-off-by: Virginia Adams <vadams@nvidia.com>

* changed app to app_state

Signed-off-by: Virginia Adams <vadams@nvidia.com>

* moved num workers check back to bottom

Signed-off-by: Virginia Adams <vadams@nvidia.com>

* Python code reformat

Signed-off-by: Virginia Adams <vadams@nvidia.com>
* Add length ratio filtering script

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix example

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove extra quotes

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: Abhinav Khattar <aklife97@gmail.com>
…DIA#4557)

* add script

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>

* style fix

Signed-off-by: Abhinav Khattar <aklife97@gmail.com>
…ng rate

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
… length

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>

Co-authored-by: Ante Jukić <ajukic@nvidia.com>
* add ngc catalan model resource

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update docs

Signed-off-by: stevehuang52 <heh@nvidia.com>
* fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Change max sequence length computation

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add args and refactor to support bart, ul2

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Add to CI test

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Comments

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Attempt fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Revert max seq length change

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix syntax error

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Revert BART change

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove comment

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove unused import

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added class for qa related metrics

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed BLEU code from QA metrics

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added classes for data handling and loading for BERT/T5/BART/GPT

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed unnecassary main function

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added classes for BERT, S2S(T5/BART), GPT question answering models

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* created separate modules for model specific input features

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* moved non-moodel methods to QAMetrics and refactored method names to more intuitive

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* changes classmethods to staticmethods

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed unnecassary copyright

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed deprecated input features file

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* abstracted cache filename, feature loading, feature dumping to QADataset

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed unused imports and added dataclass decorator

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed unused imports and refactored method name

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added base class for QA models and abstracted out common methods

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* moved non-model eval code and predictions file dump to metrics class

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added combined example of train/eval/test/inference for all qa models

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* renamed qa example file

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* fixed trailing whitespaces

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added type casting to float for logger warning

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed unsed import

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* converted cached filename creation to class method

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* moved common code in dataset classes to base class, renamed Features class to Example

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* converted base QA example class to dataclass

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* reduced code repition in prediciton evaluation

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* converted prediction output files to jsonl

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added flag for checking if ground truth present in context spans

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* converted predictions dump to jsonl from json

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* converted nbest predictions dump to jsonl from json

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* removed unused argument to no pad loss method

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added unit tests for qa metrics and dataset utilities

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* applied style fix on new files

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added integration tests

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* restored default values in qa config

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* renamed stage to avoid duplicate

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added init files for new modules

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* applied style fix for module init files

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added inline comments to make concise

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* specified class as abstract

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* specified .json format for output prediction files

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* created separate variable for answer in context check for readability

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* shifted stages to parallel

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* applied style fix

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* restored file modified by linter

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added transformers offline flag to true and moved all stages to parallel

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* moved inference code inside test_ds check

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added script for converting msmarco dataset to squad format

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added tutorial for question answering with generative models

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added copyright header

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* renamed old qa docs with _squad postfix and added docs for new qa modules

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* added generative qa architecture diagram

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* modified tutorial with colab testing changes, improved documentation

Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

* changed branch name to main in tutorial

* deprecated old QA tutorial

* deprecated old QA docs

* deprecated old QA example

* removed deprecated ci test for old qa example

* removed additional deprecated ci tests
* add kw asr models, add itn ru checkpoint (tagger-based)

Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com>

* add rw results to docs

Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com>

Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com>
Signed-off-by: Anas Abou Allaban <aabouallaban@pm.me>

Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com>

Co-authored-by: Eric Harper <complex451@gmail.com>
…VIDIA#4553)

* fix tarred dataset len when num shards is not divisible by workers

Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si>

* update error reporting on invalid `shard_strategy`

* update NLP/PC tarred dataset docstring

* add `shard_strategy` to NLP/PC `@dataclass`

* update NLP/PC tarred dataset docstring

* add `shard_strategy` to NLP/PC docs

* revert test with Dataloader retruning the actual data length

* make dataloader return actual num of samples, set `limit_train_baches` on `setup_*`

* update `shard_strategy` docstrings

Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si>

* update `tarred_dataset` documentation

Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si>

* fix style

* update documentation

Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si>

* updated docstrings

Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si>

Co-authored-by: PeganovAnton <peganoff2@mail.ru>
…nce (NVIDIA#4582)

[TTS][ASR] enabled overriding arguments for trimming the leading and trailing silence using librosa.effects.trim

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
…VIDIA#4607)

* auto switch conformer encoder adapter in_features

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

* update the norm and warmup default values in the adapters config file

Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>

Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
…ckpointing (rebased) (NVIDIA#4380)

* update gpt config and add docstring to parallel_lm_logits

Signed-off-by: ericharper <complex451@gmail.com>

* update parallel_lm_logits

Signed-off-by: ericharper <complex451@gmail.com>

* add CoreAttention and start updating ParallelAttention

Signed-off-by: ericharper <complex451@gmail.com>

* in progress

Signed-off-by: ericharper <complex451@gmail.com>

* fix args

Signed-off-by: ericharper <complex451@gmail.com>

* update ParallelTransformerLayer_

Signed-off-by: ericharper <complex451@gmail.com>

* update ParallelTransformer

Signed-off-by: ericharper <complex451@gmail.com>

* remove test_from_pretrained

Signed-off-by: ericharper <complex451@gmail.com>

* update args

Signed-off-by: ericharper <complex451@gmail.com>

* propogate args

Signed-off-by: ericharper <complex451@gmail.com>

* add transposes to GPTModel

Signed-off-by: ericharper <complex451@gmail.com>

* update matmul_input_buffer dynamically

Signed-off-by: ericharper <complex451@gmail.com>

* add sequence_parallel arg to post_language_model_processing

Signed-off-by: ericharper <complex451@gmail.com>

* allreduce sequence parallel layernorm

Signed-off-by: ericharper <complex451@gmail.com>

* typo

Signed-off-by: ericharper <complex451@gmail.com>

* flag is sequence_parallel_enabled

Signed-off-by: ericharper <complex451@gmail.com>

* add sequence parallel args

Signed-off-by: ericharper <complex451@gmail.com>

* add seq parallel arg to fwd/bwd func

Signed-off-by: ericharper <complex451@gmail.com>

* don't wrap model with ddp when using O2

Signed-off-by: ericharper <complex451@gmail.com>

* raise error when using method or num_layers with selective

Signed-off-by: ericharper <complex451@gmail.com>

* add sequence parallel for MixedFusedLayerNorm

Signed-off-by: ericharper <complex451@gmail.com>

* make sure checkpointing is set correctly

Signed-off-by: ericharper <complex451@gmail.com>

* make sure checkpointing is set correctly

Signed-off-by: ericharper <complex451@gmail.com>

* make sure checkpointing is set correctly

Signed-off-by: ericharper <complex451@gmail.com>

* check the right attribute

Signed-off-by: ericharper <complex451@gmail.com>

* fix args

Signed-off-by: ericharper <complex451@gmail.com>

* style

Signed-off-by: ericharper <complex451@gmail.com>

* don't sync after fwd/bwd if using seq par

Signed-off-by: ericharper <complex451@gmail.com>

* use base model for allreduce_grads

Signed-off-by: ericharper <complex451@gmail.com>

* remove extra layernorm

Signed-off-by: ericharper <complex451@gmail.com>

* revert norm_former_norm deletion

Signed-off-by: ericharper <complex451@gmail.com>

* move sync for allreduce grad to optimizer wrapper

Signed-off-by: ericharper <complex451@gmail.com>

* auto configure grad div ar fusion

Signed-off-by: ericharper <complex451@gmail.com>

* Initial rpe refactor

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Refactor RPE

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* add transposes to t5 forward

Signed-off-by: ericharper <complex451@gmail.com>

* Gradient Accumulation fusion to Linear layer weight gradient computation (NVIDIA#4494)

* Gradient Accumulation fusion to Linear layer weight gradient computation

* fix typo

* disable async when using gradient accumulation fusion

Signed-off-by: ericharper <complex451@gmail.com>

* add comment

Signed-off-by: ericharper <complex451@gmail.com>

* skip H2D copies of inputs (NVIDIA#4502)

add assert

* add rpe to core attention

Signed-off-by: ericharper <complex451@gmail.com>

* set sp to false if tp > 1. make cuda_device_max_connections configurable

Signed-off-by: ericharper <complex451@gmail.com>

* Refactor

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix key

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* convert to string

Signed-off-by: ericharper <complex451@gmail.com>

* revert

Signed-off-by: ericharper <complex451@gmail.com>

* revert

Signed-off-by: ericharper <complex451@gmail.com>

* determine if no_async for ColumnLinear

Signed-off-by: ericharper <complex451@gmail.com>

* add async_grad_allreduce to parallel_lm_logits for gpt

Signed-off-by: ericharper <complex451@gmail.com>

* add async_grad_allreduce to parallel_lm_logits for bert and t5

Signed-off-by: ericharper <complex451@gmail.com>

* disable gradient accumulation fusion when not using pipeline parallelism

Signed-off-by: ericharper <complex451@gmail.com>

* style

Signed-off-by: ericharper <complex451@gmail.com>

* Sequence parallel rebase with bugfixes (NVIDIA#4529)

* Support for class labels as strings

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix for micro/macro average

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Metric fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix geglu without fusion

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* style

Signed-off-by: ericharper <complex451@gmail.com>

* disable grad accumulation fusion with O1

Signed-off-by: ericharper <complex451@gmail.com>

* transpose prompt learning encoder input

Signed-off-by: ericharper <complex451@gmail.com>

* update container in jenkins

Signed-off-by: ericharper <complex451@gmail.com>

* add pleasefixme to retrieval tests

Signed-off-by: ericharper <complex451@gmail.com>

* comment retro test from jenkins

Signed-off-by: ericharper <complex451@gmail.com>

* Make RETRO SP compatible (NVIDIA#4565)

* fix sp for retro

Signed-off-by: Yi Dong <yidong@nvidia.com>

* more tests fixed

Signed-off-by: Yi Dong <yidong@nvidia.com>

* make it sp compatible

Signed-off-by: Yi Dong <yidong@nvidia.com>

* add transpose for bert

Signed-off-by: ericharper <complex451@gmail.com>

* fix bug in pooler

Signed-off-by: ericharper <complex451@gmail.com>

* remove unused import

Signed-off-by: ericharper <complex451@gmail.com>

* remove unused import

Signed-off-by: ericharper <complex451@gmail.com>

* fix output

Signed-off-by: ericharper <complex451@gmail.com>

* revert normformer delete

Signed-off-by: ericharper <complex451@gmail.com>

* use 22.07, comment broken jenkins test

Signed-off-by: ericharper <complex451@gmail.com>

* remove unused import

Signed-off-by: ericharper <complex451@gmail.com>

* revert test comment

Signed-off-by: ericharper <complex451@gmail.com>

* add model.optim.capturable=True

Signed-off-by: ericharper <complex451@gmail.com>

* add model.optim.capturable=True

Signed-off-by: ericharper <complex451@gmail.com>

* set num_workers=0

Signed-off-by: ericharper <complex451@gmail.com>

Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Sangkug Lym <slym@nvidia.com>
Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
* Fix word boundaries

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remove prints

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update notebook

Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: Anas Abou Allaban <aabouallaban@pm.me>
* Support listing Hugging Face model info

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add documentation about usage

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add documentation about usage

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update name of method, support list of model filters

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Improve docstring

Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: Ryan <rlangman@nvidia.com>
* Update audio_to_diar_label to train meeting data

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Style fix with --scope=nemo

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Style fix problem, re-run style fix

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Removed remaining commented lines

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Remove an unused variable

Signed-off-by: Taejin Park <tango4j@gmail.com>

* Reflected comments

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fixed

Signed-off-by: Taejin Park <tango4j@gmail.com>

* style fix for no reason

Signed-off-by: Taejin Park <tango4j@gmail.com>

Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
* Initial squeezeformer impl

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Start time reduce and recovery

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Working commit of time reduction and time recovery modules

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix issue with number of params being incorrect

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add initializations to the model

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix scheduler

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remove float()

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Correct order of operations

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Correct order of operations

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update time reduce PE to only update PE and nothing else

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix initialization

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix PE usage

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Comment out k2 for now

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add usage comments to buffered ctc script

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update docs

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add squeezeformer configs for CTC

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Mark squeezeformer as experimental

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add Jenkinsfile test

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add Jenkinsfile test

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix style

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Replace all with /content/

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Try Jenkinsfile Fix with closure

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update ctc config

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update ctc config

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update ctc config

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add squeezeformer

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add squeezeformer

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix Jenkinsfile

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Fix Jenkinsfile

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Try closure

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remove test

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Add back squeezeformer test

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remvoe script tag

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Update for review comments

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Remove experimental

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Correct an issue with RNNT alignments

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Correct an issue with RNNT metrics

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Code formatting

Signed-off-by: smajumdar <smajumdar@nvidia.com>

* Correct offset calculation for no look ahead

Signed-off-by: smajumdar <smajumdar@nvidia.com>
@paarthneekhara paarthneekhara merged commit 1a57a1a into ssl_synthesis Jul 28, 2022
paarthneekhara pushed a commit that referenced this pull request Aug 4, 2022
paarthneekhara added a commit that referenced this pull request Dec 5, 2023
* multitask dataset and phoneme tts

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* rbased with jason's branch

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* vocab size in sampling params issue

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* added 8 to prompt len to check

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* Bring back Attention mask

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* flash attention attention map config

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* attention prior false handling, and hardcoded attention mask to be used from dataset

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* loss mask was excluding the first label, so adjusted that

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* some comments for debugging eos id

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* inference related changes

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* custom inference wip

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* wip for phoneme tokenizer on tarred dataset

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* more batches in inference

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* handle attention prior not coming from the dataloader

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* some changes and minor bug fix

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

* option to use learned position embeddings

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>

---------

Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
Co-authored-by: Jason <jasoli@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet