Gradscaler flags #2281

Adel-Moumen · 2023-11-30T16:42:29Z

What does this PR do?

This PR add GradScaler flags to ease the customisation of fp16 flags. Furthermore, it adds back the check_loss_isfinite so that the training can be automatically interrupted when the model is locked in NaN/infs losses.

Before submitting

Did you read the contributor guideline?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Does your code adhere to project-specific code style and conventions?

PR review

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified
Confirm that the changes adhere to compatibility requirements (e.g., Python version, platform)
Review the self-review checklist to ensure the code is ready for review

Adel-Moumen · 2023-11-30T16:45:49Z

Hey @asumagic, could you please have a look at this PR? Do you feel it helps to customise more the fp16 training ?

TParcollet · 2023-11-30T22:37:26Z

speechbrain/core.py

@@ -555,6 +582,14 @@ class and override any methods for which the default behavior does not
            The location for performing computations.
        precision (str)
            One of ``fp32``, ``fp16``, ``bf16``.
+        gradscaler_init_scale (float)


I am not in favor in adding these parameters as basic Brain parameters. They will never be used by 99.99% of our users.

TParcollet · 2023-11-30T22:38:55Z

speechbrain/core.py

@@ -1133,6 +1174,7 @@ def fit_batch(self, batch):
            scaled_loss = self.scaler.scale(
                loss / self.grad_accumulation_factor
            )
+            self.check_loss_isfinite(scaled_loss)


Checking for all parameters to be finite is redundant with the GradScaler. It already does this. Also it can be crazy expensive for very large large models. Checking if the loss is not finite makes sense, not the parameters.

The grad scaler does this but it does not care for how long the parameters have gone non-finite. An idea I suggested yesterday was to occasionally check the gradscaler scale for insane values with a patience mechanism, as I've sometimes seen the scale vanish or explode when issues occurred.
That does induce a CPU-GPU sync though, but I'm not sure how often we have one in the first place.

I agree with @TParcollet. I will remove the part where I'm checking the NaNs/inf in the weights and will only cover the loss part. BTW, the function was also intended for other non-GradScaler use cases like fp32.

mravanelli · 2023-11-30T22:59:14Z

Personally, I'm not against adding this hparams. @Adel-Moumen what is the typical use case for that?

asumagic · 2023-12-01T08:39:33Z

I agree it might be a bit weird to have those flags exposed directly as a CLI parameter in the core especially since it is not obvious that those relate to the gradient scaler from their name alone.

Maybe it could be specified as an optional brain parameter of some sort? Not sure how feasible that is and how ergonomic it would be to use in the CLI.

Adel-Moumen · 2023-12-01T10:19:34Z

it is not obvious that those relate to the gradient scaler from their name alone.

They are all starting with gradscaler as the root of their name suggest that they are related to gradscaler :p

TBH, if you think no one will use them, I can remove them. However, I'd like to understand a bit better how to deal with these situations in the future. For instance, if I have some parameters like gradscaler scale_init, how should I provide to the user a way to modify them - without having to modify core.py manually AND without adding new flags in Brain class ?

asumagic · 2023-12-01T10:21:16Z

They are all starting with gradscaler as the root of their name suggest that they are related to gradscaler :p

Whoops, I didn't double check the code this morning. It wasn't the case when I checked yesterday :P

mravanelli · 2023-12-11T21:21:08Z

I think @Adel-Moumen is raising a good point here. Even if we think that most of the users won't change these options, what users are supposed to do if they want to change them? Changing core.py directly?

asumagic · 2023-12-12T08:56:21Z

I think @Adel-Moumen is raising a good point here. Even if we think that most of the users won't change these options, what users are supposed to do if they want to change them? Changing core.py directly?

You can hijack the Brain constructor in your training script to override the grad scaler defined by the core, but it's not very pretty...

mravanelli · 2023-12-13T15:55:03Z

As discussed privately, we remove all the gradscaler options but we keep the checks on the loss. Thank you @Adel-Moumen

@Adel-Moumen

* Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * rename HF's files * fix docstrings * fix args docstrings * fix docstrings * change classes' names * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Refactor HF interface, adapt recipes * Fix docstrings * commonvoice * switchboard * update readme * update readme * update lionk in test file * remove unused space token * update torchaudio * remove deprecated language model path * fix merge * fix vocab * fix switchboard * commit * fix test * fix style * remove unsued hparam * fix consistancy blank_skip_threshold * text frames * CTCPrefixBeamSearcher timestamps * pre-commit * test * test 2 * fix prints * update ctcprefixbeamsearch timestamps * remove frames from prefix bs * ≈Revert "remove frames from prefix bs" This reverts commit 30900d9. * remove prefix bs * ≈Revert "remove prefix bs" This reverts commit 2f0c3cd. * Revert "update ctcprefixbeamsearch timestamps" This reverts commit ce09e19. * Revert "fix prints" This reverts commit bf36037. * Revert "test 2" This reverts commit 84cda94. * Revert "test" This reverts commit f17349f. * Revert "pre-commit" This reverts commit 4e1cf0d. * Revert "CTCPrefixBeamSearcher timestamps" This reverts commit c3d3cf7. * Revert "text frames" This reverts commit e67c761. * Revert "fix consistancy blank_skip_threshold" This reverts commit f97a391. * Update ctc.py * arg / timestamps * precommit * timesteps -> text_frames * ls seq2seq * transformer ls * fix naming * librispeech * aishell * fix linter * precommit * switchboard * timit * Dynamic batching fixed * authors * fix conformer large * indent * Revert "Fix dynamic batching" (#2173) * update doctest skip * Fix dynamic batching (#2174) * Revert "Revert "Fix dynamic batching" (#2173)" This reverts commit faa5e76. * Update interfaces.py * Update interfaces.py * Update text_to_sequence.py * fix w2v * aishell * cv * ls transformer * ls ssl * switchboard * timit * precommit * fix indent * fix arg * unit test sorting * unittests * remove if main * Small fixes in averaging checkpoints (#2181) * add ckpt avg unittest * avoid hard-coding number of averages * last fixes * fix recipe test * fix recipe test * convert print into logger * fix transducer recipe * remove typing * fix merge * precommit * Update LibriSpeech.csv * update to new dynamic batching args * Update unstable branch with new commits (#2196) * hyper branch/conf -former fixes * remove ctc.py from doctest * get back ctc.py * remove doctest for torchaudio * adapt gpt recipe * adapt gpt recipe * small follow up fix on openrir * remove doc test (for now) * fix issue greedy search * docstring * pre-commit * Fix issues unstable (#2216) Thank you @Adel-Moumen! I did the tests again and everything works now. As for your points on the recipe tests, I agree. We can eventually do that in another PR. * Fix missing file / import in huggingface_transformers (#2224) * init/imports * comment * add partial import * wav2vec -> wav2vec2 * fix ci * Text based HF (#2214) * add mbart * Add tristage scheduler * Add mbart beam search * Add IWLST recipes * Add new models' inteference interface * Add info of new models * Add nllb scores * Add new models' info * Add test info IWSLT recipe * Add test info IWSLT recipe * add docstrings for S2STransformerBeamSearcher * Update IWSLT recipes * Update IWSLT recipes * fix doctest * add requirements * add protobuf * fix doctest * small fixes * Add protobuf install * Minor reform * Remove protobuf * Fix docstings * Fix docstrings * minor reform * remove labse * change authorship * remove comments * minor changes * change authorship * Fix recipe test * add info * Update README.md * Update README.md * change recipe structure --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> Co-authored-by: Adel Moumen <88119391+Adel-Moumen@users.noreply.github.com> * Neural LM Rescoring (#2187) * baserescorerinterface * add rescorers * first attempt * update code * 1.57 wer * update * update code * update code * docstring example rnn * updata loader * docstring example * tests * docstring example * update * tmpdir * change path * update doc * docstring * docstring args * doctest * fix docstring example * unnittest * interface * yamls update * full_infernece tests * model link * readme * yaml/inference tests * update res * fix wav2vec with wav2vec2 --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Add wrappers for Encodec and Vocos vocoders (#2231) * Add wrappers for Encodec and Vocos from Huggingface * Encodec: Add a comment * Encodec/Vocos: Add examples, restructure, fix masks * Vocos: Add a comment about the open pull request * Encodec/Vocos: Add the ability to customize save_path, fix a log message * Encodec/Vocos: Cosmetic changes * Vocos: Cosmetic changes * Encodec/Vocos: Remove the mandatory Vocos requirement * Vocos: Remove vocos from __init__.py * fix init * Vocos: Add a check for vocos in conftest.py * Vocos/Encodec: Update documentation, add bandwidth control * Fix old path in conftest.py * Cosmetic changes * Encodec/Vocos: Add support for embedding vectors * Encodec: Update example * Encodec/Vocodec: Add automatic reshaping, minor cosmetic changes --------- Co-authored-by: flexthink <flexthink@users.noreply.github.com> Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Semantically-Aligned Multimodal Utterance-level (SAMU) pre-training (#2223) * add mbart * Add tristage scheduler * Add mbart beam search * Add IWLST recipes * Add new models' inteference interface * Add info of new models * Add nllb scores * Add new models' info * Add test info IWSLT recipe * Add test info IWSLT recipe * add docstrings for S2STransformerBeamSearcher * Update IWSLT recipes * Update IWSLT recipes * fix doctest * add requirements * add protobuf * fix doctest * small fixes * Add protobuf install * Minor reform * Remove protobuf * Fix docstings * Fix docstrings * minor reform * remove labse * Add attention pooling * Add labse * Add info about SAMU * add iwslt recipes with samu * fix recipe test * fix comments * fix recipe test * change recipe structure * fix test recipe * Add new recipes * minor doctest change * minor doctest change * small changes * add dropbox links --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * fix norm (#2237) * Discrete SSL (#2233) * clustering training recipies for LibriSpeech for different SSL model * add Discrete Hubert Model * load from HF, fix minor issues * fix hyper-param value * fix precommit * fix flake8 * fix batch_size and n_clus values in hyperparams * fix typos * fix typo and some cleaning * fix precommit * fix device incompatibility and memroty issue * use fit instead of partial fit * add README file * add test recipies * remove unused fields from hparams * fix precommmit-yamllint - extra whitespace * add docstring for load_kmeans for Discrete_hubert.py * add discrete wavlm, wav2vec * avoid docstring testing for discrete_ssl models * fix docstring failed issue * add discrete_interface to conftest.py * fix precommit * Fixes for Encodec (#2240) * Add wrappers for Encodec and Vocos from Huggingface * Encodec: Add a comment * Encodec/Vocos: Add examples, restructure, fix masks * Vocos: Add a comment about the open pull request * Encodec/Vocos: Add the ability to customize save_path, fix a log message * Encodec/Vocos: Cosmetic changes * Vocos: Cosmetic changes * Encodec/Vocos: Remove the mandatory Vocos requirement * Vocos: Remove vocos from __init__.py * fix init * Vocos: Add a check for vocos in conftest.py * Vocos/Encodec: Update documentation, add bandwidth control * Fix old path in conftest.py * Cosmetic changes * Encodec/Vocos: Add support for embedding vectors * Encodec: Update example * Encodec/Vocodec: Add automatic reshaping, minor cosmetic changes * Encodec: Decoupled token extraction, fixed CPU/GPU issues * Encodec: Add renormalization --------- Co-authored-by: flexthink <flexthink@users.noreply.github.com> Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Refactoring of the 'fit_batch' function (#2010) * add dataclass * turn False * remove valid_step * update core.py * update core.py * update core.py * precommit * self.autocast + GradScaler enabled * freeze opt * naming * update core.py * comments * example transducer conformer * update core.py * small changes * naming + skip_grad_nans * doc * check * support cpu training * precision + doctrsting * name * change w2v * restore ckpt * remove file * remove casting * tests * whisper + fix tests * seq2seq ls * update transducer / transformer * remove on_optimizers_step_end + comments * update check yaml * remove default arg * add precision in yamls * add precision inside of the yamls * ckpt and scaler * run_opt outside brain + test * several recipe updates * improve w2v fit_batch fn * add arg * update name * timit * context manager * on_fit_batch_start * update CV * should_step with noam * add flag precision * naming * aishell * aishell * update recipes * so many recipes 0.0 * update recipes * last recipes * zero_grad * fix grad_accumulation_factor * update recipes * update auto_mix_prec flag * remove opt flag test * librispeech * cv ssl * audio mnist / realm * voicebank * fix rescuespeech * fix lr annealing * libritts * multiwoz * slurp nlu * should_step * update yamls * update yaml * update batch smpler tedlium * remove fit batch * precision flag * update sampler * add precision inside of the yamls * run_opt outside brain + test * fix auto_mix_prec flag * docstring * grad acc * failing test * update unittests * update jarod's pr * fix removed avg_checkpoint param * update path * fix some recipe tests * update samu recipe * fix hifigan/IWSLT * tedlium --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Refactor Augmentation (#2206) * update * update * change folder * remove unnecesary file * update folder structure * add noise, add rev * augmenter refactor * refactor augment + example in templace * fix tests + linters * address comments * supporting variable-length augmentations in augmenter (e.g., speed change) * lib refactor (splitting time and freq augmentations) * fine tune freq drop * refactor of specaugment (freq-domain) - part 1 * converted specaument (freq domain) * refactor random shift * implemented cutcat, swap, and random selection * extended unittests + small fixes * improvements and fixes in augment * plugged feature augmentation + various fixes and improvements * add sum_batch noise (similat to babble) + various fixes * add drop bit resolution * added coded augmentation * added more unittests * restore all augmentations * making AddReveb more similar to AddNoise * fix device mismatch + fix last batch management * add workes to speed up AddNoise and AddRev * improve comments in template yaml * speed up template (sorting dev and test) * extend augmenter by adding activation provability * implemented enable augmentation flag (useful of hparam tuning) + other improvements * plugged coded augment * fixed coded augment * remove old files * fix integration test * remove knowledge distill TIMIT reicpes. Too many yaml files to maintain * convert TIMIT * fix recipe * converted templates using EnvCorr * converted voxceleb * converted GSC + fixes on voxceleb * convrted UrbanSound8k * converted voicebank * converted other recipes * converted CommonLanguage, VoxLingua, timers-and-such * converted all recipes using envcorr * CommonVoice * REAL-M * Aishell1Mix * LibriMix * converted all recipes! * fix linters - part1 * fix linters - part2 * add a note in the template regarding augmentation * fix docstring tests * fix yamls * remove coded tests from docstring * revised coded tests * fix identation in codec.py * try to fix doc issue * revise lib header in codec.oy * fix doc * fix doc attempt * rename sections * fix doc * fix (most) recipe tests * fix other recipe tests * address comments * fix yaml * fix * convert recipe * fix recipes * fix aug in rescoring recipes * Delete tmpdir_vocoder directory * Refactor Inference (files and folders) (#2252) * refactor inference files and folders * fix some tests * fix some tests * fix doctest * import lib * small fixes * Fix beam search (#2253) * fix starting pos prefix_length * block path ctc + fix default value to the old one * fix issue with score being -inf * remoev print * precommit * Fix ctc beam search (#2263) * fix logprobs / space_token / warnings * fix space_token * pre-commit * space_token * simplify parameters * simplify yamls * remove comma * update beam search * fix vocab/str (#2265) * Fix blank index ctc (#2266) * update blank_index * whisper * revert change * mistake * Cv unstable merge (#2254) * add fr preproccesing to Common_voice_prepare.py * add CV , CTC, new languages * fix precommit and test * add transducer recipie * add transformer recipies * update augmentation of CTC recipies * update seq-to-seq recipies * fix whisper HF interface bug. (return str insted of list) * fix recipe tests * add fr preproccesing to Common_voice_prepare.py * add CV , CTC, new languages * fix precommit and test * add transducer recipie * add transformer recipies * update augmentation of CTC recipies * update seq-to-seq recipies * fix whisper HF interface bug. (return str insted of list) * fix recipe tests * modify beamsearch for CTC: ar.es.pt and zh-CN * fix interface conflict * fix transducer interface bug --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Add warnings and fix numba (#2271) * upperbound torch/tochaudio + remove opt dependancy * add back automix/bf flags * linters * oops * transformers back * test requirements * Fix Bug: CommonVoice Transformer Bug loading correct optimizer (#2278) * fix trnsfrm bug to load correct opt:adam vs sgd * add data_root to the path of common_voice_prepare.py * add epoch/_counter pretrainer to fr and it recepie * revert releative path change * fix opt bug without the need to add epoch_ckpt * add log and delete launch file * update the log message * update WeightedSSLModel (#2272) * update WeightedSSLModel * requirements.txt * fix pre-commit * Sg/dac (#2246) * introducing DAC * lint errors * black * documenttion * remove unused init file * Fixing tests * More doc strings * More doc strings * PR review * PR review * PR review * Update dac.py * Update dac.py * Update dac.py * make doctests smaller to avoid memory issues in CI * even smaller tests --------- Co-authored-by: Shubham Gupta <shubhamgupta@Shubhams-MacBook-Pro-2.local> Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * add quantization recipies fro IEMCAP, CV, LibriSpeech and LJSpeech (#2255) * add quantization recipies fro IEMCAP, CV, LibriSpeech and LJSpeech * update discrete_ssl models * add iemocap_prepare to main folder + add test * ix test for iemocap * fik typos * fix test recepies, minor dormat editting * fix typo in coomonvoice.csv * fix typo in yaml file * fix doctests (those that we do not run in the CI) --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * change emdedding type from long to float to vaoid getting al zeros embedding (#2292) * Update CVSS (#2285) * Update CVSS * Update train_fr-en.yaml * Update train_fr-en.yaml * Update HF interface (#2293) * RNN Tranducer Numba Loss: Add FP16 and BF16 support (code from Samsung AI Cambridge) (#2296) * Make lobes use fp32 when AMP is active (#2295) * Added utils.autocast with a fwd_default_precision function * Decorate all lobes to require float32 precision in AMP * Fix trailing space in docstring * Less confusing doc for fwd_default_precision * Be explicit that only fp inputs are affected by fwd_default_precision * Typo in docstring * Remove dtype annotation that is broken for some reason * Precommit checks will be the end of me * Fix tests * Add docstring to precision wrapper function * Fix style check again.. * adding support for fp16 transducer loss numba * adding support for fp16 transducer loss numba * fix fp16 transducer recipe * add note on half precision --------- Co-authored-by: asu <sdelang@sdelang.fr> Co-authored-by: Titouan Parcollet/Embedded AI /SRUK/Engineer/Samsung Electronics <t.parcollet@sruk-ccn4.eu.corp.samsungelectronics.net> Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Fix recipe tests for TransformerASR (#2282) * fix position embedding (#2283) * fix position embedding * use speechbrain internal postional encoding and generate mask from sequence lengths * call mask function from core for tacotron * minor fix * fix device * reduce training epochs * update links --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Gradscaler flags (#2281) * add flags for gradscaler * add check_loss_isfinite * update dict * typo * remove default * better message * fix pre-commit * remove checks * remove new arguments --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * add llama2 recipies (#2299) * add llama2 recipies * fix symbolic links * fix bug * remove unneccary input in docstring * fix typo * cleaning llama2 recepies * update readme * update interface and add licence to readme * fic doc string * fix precommit * fix extra-dependency * remove commented lines * inter epoch checkpoint * minor fixes * add extra req info in llama.py * fix linters --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * small fixes * make all recipes cpu-compliant + make recipe tests passing on both cpu and gpu * fix some broken links * remove link to private HF repo * remove link to private HF repo * fix libritts recipe test * fix ljspeech recipe test * Streamable Conformer-Transducer ASR model for LibriSpeech (#2140) * Introduce DCT+DCConv logic * DDP fix? * Batch of changes and things brought back * Streaming fixes (successfully trains) * WIP streaming code * WIP functional streaming code * Fix left context * Fix formatting * Cleanups and docs in streaming utils * Better comment hparams, change seed back to orig, improve naming * uncomment averaging stuff; it was some ipython issue * Remove pin_memory as it was not beneficial * More cleanups, comments on context stuff * More comments and TODOs * encode_streaming docstring * Dirty TransducerBeamSearcher change for streaming GS * Fix precommit * Fix encoders that do not support chunk_size * Pre-commit again * Make chunk_size type consistent * Fix formatting of doctest in split_wav_lens * Remove outdated TODO * Add hasattr streaming to retain model backcompat * Cleanup doc and naming for transducer_greedy_decode * Cite paper for chunked attention * Remove lost comment * Update comment in self-attention * Don't apply masked fill fix in the non-bool mask case * Added TODO README update * Revert change to custom_tgt_module; patching model instead * Remove added entry in README * Fix streaming conformer conv mismatch * More conformer conv adjustments * Adjust context size * Remove outdated comment * Fixed causal conformer decoder * Fix linting * Gate `custom_tgt_module` creation behind the presence of decoder layers * Re-enable checkpoint averaging * Change averaged ckpt count to 10 * Add new model results to README * WIP refactor: Introduce DCTConfig dataclass * Improved notice in README * Formatting and linting fixes * Attempt at fixing circular import? * utils can't depend on core it seems; move dct * Whoops, missed file * Add DCT test, fix issues * Remove now obsolete yaml variables for streaming * Formatting * Add dummy dct_config parameter to keep unsupported encoders working * Linting fix * Fix typo * Add note on runtime autocast accuracy * Fix very bad typo from refactor in YAML * Fix hasattr streaming check * Remove legacy comment * Fix left context size calculation in new mask code * Fix causal models in TransformerASR * Remove comment on high-level inference code * YAML formatting + commenting dynchunktrain stuff * Remove outdated comment about DCConv left contexts * Remove commented out debug prints from TransformerASR * Move DCT into utils again * Rename all(?) mentions of DCT to explicit dynamic chunk training * Clarify padding logic * Remove now-useless _do_conv, fix horrible formatting * Slightly fix formatting further * Add docstrings to forward_streaming methods * Add a reference on Dynamic Chunk Training * Rework conformer docstring docs * Update conformer author list, fix doc formatting for authors * Fix trailing whitespace in conformer * Improved comments in Conformer.forward * Added random dynchunktrain sampler example * More explicit names for mask functions in TransformerASR * Added docstring example on encode_streaming * Pre-commit fix * Fix typo in conformer * Initial streaming integration test * Precommit fix * Fix indent in YAML * More consistent spelling in streaming integration test * Update CommonVoice.csv * Add KenLM n-gram training recepie (#2304) * add kenlm training * fix precommit * update readmefile with new result * fix pre-commit * fix typo * fix commit reviews * fix bug in testing * add docstring and fix indentation * fix bug in ASR interface * change encoderasr interface to support ctc beam * add suppourt fro kenlm in enoderasr interface * fix typo * little changes in REAMDE files to improve clarity) * use binaries sources in bashrc * fix trailing-whitespace --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * Create Performance file (automatically) (#2314) * add performance readme builder * update recipe csv files * update README files * add not in prerelease test * added performance.md * fix linters * update info in README * Llama2 interface bug (#2318) * fix llama2 interface bug * fix minor bug * update multiwox.csv with correct db and HF link * New README file (#2315) * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Optimize masked Dynamic Chunk Convolution (#2308) * Reorganized some conformer convolution module to be faster * Completely get rid of the list of slices in the conformer conv module * Fix linter check * Remove unused variable * More unused variables.. * Remove unused import * Add conformer streaming code path test * Fix test formatting * small fixes in tests * Update RNNLM.yaml * BayesSpeech (#2326) * Create train_bayesspeech.py * Create bayesspeech.yaml * Update README.md * Update LibriSpeech.csv * add extra-req --------- Co-authored-by: Mirco Ravanelli <mirco.ravanelli@gmail.com> * adding new controllable exp scheduler * adding new controllable exp scheduler * update performance file * Update PERFORMANCE.md * Update README.md --------- Co-authored-by: mhn226 <mhn.22692@gmail.com> Co-authored-by: Adel Moumen <88119391+Adel-Moumen@users.noreply.github.com> Co-authored-by: Adel Moumen <adelmoumen.pro@gmail.com> Co-authored-by: Ha Nguyen <43038599+mhn226@users.noreply.github.com> Co-authored-by: flexthink <1496671+flexthink@users.noreply.github.com> Co-authored-by: flexthink <flexthink@users.noreply.github.com> Co-authored-by: Pooneh Mousavi <moosavi.pooneh@gmail.com> Co-authored-by: shubham-gupta-30 <127571426+shubham-gupta-30@users.noreply.github.com> Co-authored-by: Shubham Gupta <shubhamgupta@Shubhams-MacBook-Pro-2.local> Co-authored-by: Parcollet Titouan <parcollet.titouan@gmail.com> Co-authored-by: asu <sdelang@sdelang.fr> Co-authored-by: Titouan Parcollet/Embedded AI /SRUK/Engineer/Samsung Electronics <t.parcollet@sruk-ccn4.eu.corp.samsungelectronics.net> Co-authored-by: Luca Della Libera <34525085+lucadellalib@users.noreply.github.com> Co-authored-by: Yingzhi WANG <41187612+BenoitWang@users.noreply.github.com> Co-authored-by: BenoitWang <wangyingzhi666@gmail.com>

Adel-Moumen added 2 commits November 30, 2023 17:26

add flags for gradscaler

f720eab

add check_loss_isfinite

de7e0b8

Adel-Moumen changed the base branch from develop to unstable-v0.6 November 30, 2023 16:42

mravanelli self-requested a review November 30, 2023 16:58

mravanelli assigned Adel-Moumen Nov 30, 2023

mravanelli added the enhancement New feature or request label Nov 30, 2023

asumagic self-requested a review November 30, 2023 17:02

Adel-Moumen added 5 commits November 30, 2023 21:22

update dict

18a7021

typo

1faa548

remove default

65612e2

better message

1d2d16d

fix pre-commit

7ffb517

TParcollet requested changes Nov 30, 2023

View reviewed changes

remove checks

744ae6d

remove new arguments

9c32cab

mravanelli merged commit 30ca4b2 into unstable-v0.6 Dec 13, 2023
5 checks passed

mravanelli deleted the gradscaler_flags branch December 13, 2023 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradscaler flags #2281

Gradscaler flags #2281

Adel-Moumen commented Nov 30, 2023

Adel-Moumen commented Nov 30, 2023

TParcollet Nov 30, 2023

TParcollet Nov 30, 2023

asumagic Dec 1, 2023 •

edited

Adel-Moumen Dec 1, 2023

mravanelli commented Nov 30, 2023

asumagic commented Dec 1, 2023

Adel-Moumen commented Dec 1, 2023

asumagic commented Dec 1, 2023

mravanelli commented Dec 11, 2023

asumagic commented Dec 12, 2023

mravanelli commented Dec 13, 2023

Gradscaler flags #2281

Gradscaler flags #2281

Conversation

Adel-Moumen commented Nov 30, 2023

What does this PR do?

PR review

Adel-Moumen commented Nov 30, 2023

TParcollet Nov 30, 2023

Choose a reason for hiding this comment

TParcollet Nov 30, 2023

Choose a reason for hiding this comment

asumagic Dec 1, 2023 • edited

Choose a reason for hiding this comment

Adel-Moumen Dec 1, 2023

Choose a reason for hiding this comment

mravanelli commented Nov 30, 2023

asumagic commented Dec 1, 2023

Adel-Moumen commented Dec 1, 2023

asumagic commented Dec 1, 2023

mravanelli commented Dec 11, 2023

asumagic commented Dec 12, 2023

mravanelli commented Dec 13, 2023

asumagic Dec 1, 2023 •

edited