-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' into dialogue_state_tracking_refactor
- Loading branch information
Showing
416 changed files
with
57,420 additions
and
40,695 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,6 +2,9 @@ name: CI-Import-Check | |
|
||
on: | ||
push: | ||
pull_request: | ||
paths: | ||
- "**" | ||
|
||
jobs: | ||
ci-import-check: | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,4 @@ | ||
Model,Model Base Class,Model Card | ||
stt_zh_citrinet_512,EncDecCTCModel,"https://ngc.nvidia.com/catalog/models/nvidia:nemo:stt_zh_citrinet_512" | ||
stt_zh_citrinet_1024_gamma_0_25,EncDecCTCModel,"https://ngc.nvidia.com/catalog/models/nvidia:nemo:stt_zh_citrinet_1024_gamma_0_25" | ||
stt_zh_conformer_transducer_large,EncDecCTCModel,"https://ngc.nvidia.com/catalog/models/nvidia:nemo:stt_zh_conformer_transducer_large" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
Resources and Documentation | ||
--------------------------- | ||
|
||
Hands-on speech recognition tutorial notebooks can be found under `the ASR tutorials folder <https://github.com/NVIDIA/NeMo/tree/v1.0.2/tutorials/asr/>`_. | ||
If you are a beginner to NeMo, consider trying out the `ASR with NeMo <https://github.com/NVIDIA/NeMo/tree/v1.0.2/tutorials/asr/ASR_with_NeMo.ipynb>`_ tutorial. | ||
This and most other tutorials can be run on Google Colab by specifying the link to the notebooks' GitHub pages on Colab. | ||
|
||
If you are looking for information about a particular ASR model, or would like to find out more about the model | ||
architectures available in the `nemo_asr` collection, refer to the :doc:`Models <./models>` section. | ||
|
||
NeMo includes preprocessing scripts for several common ASR datasets. The :doc:`Datasets <./datasets>` section contains instructions on | ||
running those scripts. It also includes guidance for creating your own NeMo-compatible dataset, if you have your own data. | ||
|
||
Information about how to load model checkpoints (either local files or pretrained ones from NGC), as well as a list of the checkpoints | ||
available on NGC are located on the :doc:`Checkpoints <./results>` section. | ||
|
||
Documentation regarding the configuration files specific to the ``nemo_asr`` models can be found on the :doc:`Configuration Files <./configs>` section. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
|
||
Resource and Documentation Guide | ||
-------------------------------- | ||
|
||
Hands-on speaker diarization tutorial notebooks can be found under ``<NeMo_git_root>/tutorials/speaker_tasks/``. | ||
|
||
There are tutorials for performing inference using :ref:`MarbleNet_model` and :ref:`TitaNet_model`, | ||
and how one can get ASR transcriptions combined with Speaker labels along with voice activity time stamps with NeMo asr collections. | ||
|
||
Most of the tutorials can be run on Google Colab by specifying the link to the notebooks' GitHub pages on Colab. | ||
|
||
If you are looking for information about a particular model used for speaker diarization inference, or would like to find out more about the model | ||
architectures available in the `nemo_asr` collection, check out the :doc:`Models <./models>` page. | ||
|
||
Documentation on dataset preprocessing can be found on the :doc:`Datasets <./datasets>` page. | ||
NeMo includes preprocessing scripts for several common ASR datasets, and this page contains instructions on running | ||
those scripts. | ||
It also includes guidance for creating your own NeMo-compatible dataset, if you have your own data. | ||
|
||
Information about how to load model checkpoints (either local files or pretrained ones from NGC), perform inference, as well as a list | ||
of the checkpoints available on NGC are located on the :doc:`Checkpoints <./results>` page. | ||
|
||
Documentation for configuration files specific to the ``nemo_asr`` models can be found on the | ||
:doc:`Configuration Files <./configs>` page. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
|
||
Resource and Documentation Guide | ||
-------------------------------- | ||
|
||
Hands-on speaker recognition tutorial notebooks can be found under | ||
`the speaker recognition tutorials folder <https://github.com/NVIDIA/NeMo/tree/main/tutorials/speaker_tasks/>`_. This and most other tutorials can be run on Google Colab by specifying the link to the notebooks' GitHub pages on Colab. | ||
|
||
If you are looking for information about a particular SpeakerNet model, or would like to find out more about the model | ||
architectures available in the ``nemo_asr`` collection, check out the :doc:`Models <./models>` page. | ||
|
||
Documentation on dataset preprocessing can be found on the :doc:`Datasets <./datasets>` page. | ||
NeMo includes preprocessing and other scripts for speaker_recognition in <nemo/scripts/speaker_tasks/> folder, and this page contains instructions on running | ||
those scripts. It also includes guidance for creating your own NeMo-compatible dataset, if you have your own data. | ||
|
||
Information about how to load model checkpoints (either local files or pretrained ones from NGC), perform inference, as well as a list | ||
of the checkpoints available on NGC are located on the :doc:`Checkpoints <./results>` page. | ||
|
||
Documentation for configuration files specific to the ``nemo_asr`` models can be found on the | ||
:doc:`Configuration Files <./configs>` page. | ||
|
||
|
||
For a clear step-by-step tutorial we advise you to refer to the tutorials found in `folder <https://github.com/NVIDIA/NeMo/tree/main/tutorials/speaker_tasks/>`_. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
Resource and Documentation Guide | ||
-------------------------------- | ||
|
||
Hands-on speech classification tutorial notebooks can be found under ``<NeMo_git_repo>/tutorials/asr/``. | ||
There are training and offline & online microphone inference tutorials for Speech Command Detection and Voice Activity Detection tasks. | ||
This and most other tutorials can be run on Google Colab by specifying the link to the notebooks' GitHub pages on Colab. | ||
|
||
If you are looking for information about a particular Speech Classification model or would like to find out more about the model | ||
architectures available in the `nemo_asr` collection, check out the :doc:`Models <./models>` page. | ||
|
||
Documentation on dataset preprocessing can be found on the :doc:`Datasets <./datasets>` page. | ||
NeMo includes preprocessing scripts for several common ASR datasets, and this page contains instructions on running | ||
those scripts. | ||
It also includes guidance for creating your own NeMo-compatible dataset, if you have your own data. | ||
|
||
Information about how to load model checkpoints (either local files or pretrained ones from NGC), perform inference, as well as a list | ||
of the checkpoints available on NGC are located on the :doc:`Checkpoints <./results>` page. | ||
|
||
Documentation for configuration files specific to the ``nemo_asr`` models can be found on the | ||
:doc:`Configuration Files <./configs>` page. |
Oops, something went wrong.