-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix missing parallelisms #9725
Fix missing parallelisms #9725
Conversation
b3528ea
to
fc57e53
Compare
fc57e53
to
47927b5
Compare
Signed-off-by: Maanu Grover <maanug@nvidia.com>
47927b5
to
999efcc
Compare
fixed base branch and target branch |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks!
Signed-off-by: Maanu Grover <maanug@nvidia.com>
48c5346
to
047421f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Re-approving.
* pass cp and ep cfg to init mp Signed-off-by: Maanu Grover <maanug@nvidia.com> * update test Signed-off-by: Maanu Grover <maanug@nvidia.com> --------- Signed-off-by: Maanu Grover <maanug@nvidia.com>
I'd like to add this to main, but instead of manual copy use a function to copy all config attributes that end with _paralle_size. Made main...akoumparouli/copy_parallel_size for this purpose. |
* pass cp and ep cfg to init mp Signed-off-by: Maanu Grover <maanug@nvidia.com> * update test Signed-off-by: Maanu Grover <maanug@nvidia.com> --------- Signed-off-by: Maanu Grover <maanug@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* pass cp and ep cfg to init mp Signed-off-by: Maanu Grover <maanug@nvidia.com> * update test Signed-off-by: Maanu Grover <maanug@nvidia.com> --------- Signed-off-by: Maanu Grover <maanug@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* pass cp and ep cfg to init mp Signed-off-by: Maanu Grover <maanug@nvidia.com> * update test Signed-off-by: Maanu Grover <maanug@nvidia.com> --------- Signed-off-by: Maanu Grover <maanug@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* pass cp and ep cfg to init mp * update test --------- Signed-off-by: Maanu Grover <maanug@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Maanu Grover <109391026+maanug-nv@users.noreply.github.com> Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
* pass cp and ep cfg to init mp * update test --------- Signed-off-by: Maanu Grover <maanug@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Maanu Grover <109391026+maanug-nv@users.noreply.github.com> Signed-off-by: Vivian Chen <xuanzic@example.com>
What does this PR do ?
CP and EP config values were not being passed to model parallel initialization methods.
Collection: [llm]
Changelog
Usage
# Add a code snippet demonstrating how to use this
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information