-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LM adapted T5 dataset #3654
LM adapted T5 dataset #3654
Conversation
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 2 alerts when merging b255e58 into 058fa38 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 99fabbb into 461a866 - view on LGTM.com new alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very useful PR! Looks good. See minor comments below.
nemo/collections/nlp/data/language_modeling/megatron/lm_adapted_t5_dataset.py
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/lm_adapted_t5_dataset.py
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/lm_adapted_t5_dataset.py
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/lm_adapted_t5_dataset.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/misc_utils.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/misc_utils.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/misc_utils.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/data/language_modeling/megatron/misc_utils.py
Outdated
Show resolved
Hide resolved
nemo/collections/nlp/models/language_modeling/megatron_t5_model.py
Outdated
Show resolved
Hide resolved
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 2 alerts when merging 377342c into 6a517f0 - view on LGTM.com new alerts:
|
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 2 alerts when merging a826ac7 into 6a517f0 - view on LGTM.com new alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for all the updates. LGTM!
This pull request introduces 2 alerts when merging 0f1579d into fbbfb08 - view on LGTM.com new alerts:
|
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 2 alerts when merging 9d4c3aa into aeeb0d2 - view on LGTM.com new alerts:
|
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 2 alerts when merging fd8c672 into 277b088 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 69acc37 into b466ebc - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 7d1626f into b5012d0 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 5da9724 into 7231aca - view on LGTM.com new alerts:
|
@@ -106,6 +106,7 @@ model: | |||
dataloader_type: single # cyclic | |||
masked_lm_prob: 0.15 | |||
short_seq_prob: 0.1 | |||
dataset_type: 't5' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe add possible types here in a comment like # t5, ...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks!
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 2 alerts when merging acf4909 into 2ebca22 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 77ae69f into 8ffc92e - view on LGTM.com new alerts:
|
nemo/collections/nlp/data/language_modeling/megatron/dataset_utils.py
Outdated
Show resolved
Hide resolved
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
This pull request introduces 3 alerts when merging ed3be60 into 37fe5b4 - view on LGTM.com new alerts:
|
This pull request introduces 3 alerts when merging 8d026b1 into a8f29af - view on LGTM.com new alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
* LM adapted T5 dataset Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style fixes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * File renaming Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * change assert to raising valueerror Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style fixes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Printing changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style fixes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out ICT dataset Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: MaximumEntropy sandeep.subramanian.1@umontreal.ca
What does this PR do ?
Adds the ability to use the prefix-LM objective for T5 which reuses the GPT dataset class that already exists.
Collection: NLP
Changelog
Usage
Add the
dataset_type
flag to eithert5
ort5_prefix_lm
inmegatron_t5_config.yaml
.Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information