Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix save_best missing chpt bug, update for setup_tokenizer() changes #3932

Merged
merged 2 commits into from
Apr 5, 2022

Conversation

ekmb
Copy link
Collaborator

@ekmb ekmb commented Apr 5, 2022

What does this PR do ?

[NLP] bug fixes for duplex model

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • [ X] Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

ekmb added 2 commits April 4, 2022 21:13
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Copy link
Collaborator

@ericharper ericharper left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks!

@ericharper ericharper merged commit c2ec807 into r1.8.0 Apr 5, 2022
@ericharper ericharper deleted the duplex_fix branch April 5, 2022 15:47
ericharper pushed a commit that referenced this pull request Apr 8, 2022
…3932)

* fix save_best missing chpt bug, update for setup_tokenizer() changes

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* style fix

Signed-off-by: ekmb <ebakhturina@nvidia.com>
titu1994 added a commit that referenced this pull request Apr 9, 2022
* update version

Signed-off-by: ericharper <complex451@gmail.com>

* Stateless timer fix for PTL 1.6 (#3925)

* Stateless timer fix for PTL 1.6

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Stateless timer PTL test

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix year

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Remove unused imports

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* GPU test

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* clean import

Signed-off-by: ericharper <complex451@gmail.com>

Co-authored-by: ericharper <complex451@gmail.com>

* fix save_best missing chpt bug, update for setup_tokenizer() changes (#3932)

* fix save_best missing chpt bug, update for setup_tokenizer() changes

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* style fix

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* Fix divide by world size (#3941)

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* remove old doc (#3946)

Signed-off-by: ekmb <ebakhturina@nvidia.com>

* Fix issues with librosa deprecations (#3950)

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix issue with Segfault in ASR models (#3956)

* Fix issue with Segfault in ASR models

Signed-off-by: smajumdar <titu1994@gmail.com>

* Add docstring

Signed-off-by: smajumdar <titu1994@gmail.com>

* Fix notebook bugs for branch r1.8.0 (#3948)

* load the model from ngc

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix all biomegatron notebook

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix the typos

Signed-off-by: Yi Dong <doyend@gmail.com>

* remove output

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix isort

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix merge error

Signed-off-by: Yi Dong <doyend@gmail.com>

* change ntpath for isort workaround

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix unit test

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix ci

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix ci bert pretraining

Signed-off-by: Yi Dong <doyend@gmail.com>

* make it compatible with main

Signed-off-by: Yi Dong <doyend@gmail.com>

* add the teste for biomegatron ner

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix argument

Signed-off-by: Yi Dong <doyend@gmail.com>

* fix usablity issue

Signed-off-by: Yi Dong <doyend@gmail.com>

* work around

Signed-off-by: Yi Dong <doyend@gmail.com>

Co-authored-by: Yi Dong <doyend@gmail.com>
Co-authored-by: Eric Harper <complex451@gmail.com>

* Fix global batch fit loop (#3936)

* add lightning module hooks for global batch

Signed-off-by: ericharper <complex451@gmail.com>

* clean scripts

Signed-off-by: ericharper <complex451@gmail.com>

* style

Signed-off-by: ericharper <complex451@gmail.com>

* remove unused import

Signed-off-by: ericharper <complex451@gmail.com>

* DP=1 fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* set num dataset workers to 2

Signed-off-by: ericharper <complex451@gmail.com>

* update validation_loop with GlobalDataFetcher

Signed-off-by: ericharper <complex451@gmail.com>

* add test global data fetcher

Signed-off-by: ericharper <complex451@gmail.com>

* Drop last for test ds

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix test epoch end

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Style

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix eval

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix reconfigure microbatch in the complete method

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* add comments

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Set init consumed samples

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* fix shuffle

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* add save_restore_connector arg

Signed-off-by: ericharper <complex451@gmail.com>

* Fix padding for labels and loss mask

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* GLUE/XNLI CI tests

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* limit val batches in hydra fix

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Restart CI

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Fix unittest

Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>

* Update max_epochs on megatron configs (#3958)

* update config

Signed-off-by: ericharper <complex451@gmail.com>

* update config

Signed-off-by: ericharper <complex451@gmail.com>

* update branch

Signed-off-by: ericharper <complex451@gmail.com>

* update version

Signed-off-by: ericharper <complex451@gmail.com>

Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
Co-authored-by: Yi Dong <doyend@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants