Skip to content

NVIDIA Neural Modules 1.6.0

Compare
Choose a tag to compare
@ericharper ericharper released this 29 Jan 04:53
· 2330 commits to main since this release

ASR

  • Add new features to ASR with diarization with modified tutorial and README. by @tango4j :: PR: #3007
  • Enable stateful decoding of RNNT over multiple transcribe calls by @titu1994 :: PR: #3037
  • Move vocabs from asr to common by @Oktai15 :: PR: #3084
  • Adding parallel transcribe for ASR models - suppports multi-gpu/multi-node by @VahidooX :: PR: #3017
  • CTC Conformer fixes for ONNX/TS export by @borisfom :: PR: #3072
  • Adding pretrained French ASR models to ctc_bpe and rnnt_bpe listings by @tbartley94 :: PR: #3225
  • adding german conformer ctc and rnnt by @yzhang123 :: PR: #3242
  • Add aishell and fisher dataset processing scripts for ASR by @jbalam-nv :: PR: #3203
  • Better default for RNNT greedy decoding by @titu1994 :: PR: #3332
  • Add uniform ASR evaluation script for all models by @titu1994 :: PR: #3334
  • CTC Segmentation-Citrinet support by @ekmb :: PR: #3279
  • Updates on ASR with diarization util files by @tango4j :: PR: #3359
  • Asr fr by @tbartley94 :: PR: #3404
  • Refactor ASR Examples Directory by @titu1994 :: PR: #3392
  • Asr patches by @titu1994 :: PR: #3443
  • Properly support -1 for labels in ctc char models by @titu1994 :: PR: #3487

TTS

  • MixerTTS, MixerTTSDataset and small updates in tts tokenizers by @Oktai15 :: PR: #2859
  • ONNX and TorchScript support for Mixer-TTS by @Oktai15 :: PR: #3082
  • Update name of files to one style in TTS folder by @Oktai15 :: PR: #3189
  • Update TTS Dataset, FastPitch with TTS dataset and small improvements in HiFiGAN by @Oktai15 :: PR: #3205
  • Add Beta-binomial Interpolator to TTSDataset by @Oktai15 :: PR: #3230
  • Normalizer to TTS models, TTS tokenizer updates, AxisKind updates by @Oktai15 :: PR: #3271
  • Update Mixer-TTS, FastPitch and TTSDataset by @Oktai15 :: PR: #3366
  • Minor Updates to TTS Finetuning by @blisc :: PR: #3455

NLP / NMT

Text Normalization / Inverse Text Normalization

NeMo Tools

  • CTC Segmentation-Citrinet support by @ekmb :: PR: #3279
  • Updated NumPy SDE requirement by @vsl9 :: PR: #3442

Export

Documentation

Bugfixes

Improvements: