-
Couldn't load subscription status.
- Fork 74
Setup rccl for PyTorch #309
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Setup rccl for PyTorch #309
Conversation
|
@pytorchbot retest this please |
|
@iotamudelta unrelated issue, but thought I'd bring it to your attention: the caffe2 test build failed with segfault for |
|
@jithunnair-amd yeah, it's been giving us trouble. I'd like to understand why all of these tests suddenly have issues. @lcskrishna could you have a look into this? thanks! |
|
@pytorchbot retest this please |
This seems like a conflict merge gone wrong? I think they removed mkldnn.py upstream. |
|
@jithunnair-amd yes, i needed to do some merging after the last IFU in this PR - please do check. |
|
@iotamudelta At this point, I think all the changes needed to integrate rccl into "c10d->rccl" path are in this PR. Once the CI docker gets updated to have the rccl package in it, please let me know and I can make it a REQUIRED dependency in the cmake. |
Summary: Pull Request resolved: pytorch/translate#309 Pull Request resolved: pytorch#16481 This gives us a boolean flag `quantize` on the `BeamSearch` module that allows us to apply FBGEMM quantization to a pretrained PyTorch model and export this to PyTorch native runtime. Reviewed By: jmp84 Differential Revision: D13514776 fbshipit-source-id: 3f7cbff0782aae54c9623ad1ea7e66d7f49e2b32
|
@jithunnair-amd if you can resolve the conflicts, this should now be mergable with the CI update :-) |
|
I decided to reapply the changes at the top of tree instead of resolving conflicts, since they changed a few setup files. I'll file a new PR once they respond to my query about a recent change that causes a compilation issue when building distributed library on ROCm.
Get Outlook for Android<https://aka.ms/ghei36>
…________________________________
From: iotamudelta <notifications@github.com>
Sent: Saturday, February 9, 2019 10:38:23 PM
To: ROCmSoftwarePlatform/pytorch
Cc: Nair, Jithun; Mention
Subject: Re: [ROCmSoftwarePlatform/pytorch] [WIP] Setup rccl for PyTorch (#309)
@jithunnair-amd<https://github.com/jithunnair-amd> if you can resolve the conflicts, this should now be mergable with the CI update :-)
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#309 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AkIT-KRzBspLu7HjDQERbXibngFFWHzWks5vL6I_gaJpZM4YPVOR>.
|
|
@jithunnair-amd IFU completed. |
|
Reworked changes in #347, so closing this PR |
No description provided.