New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't pip install horovod for rocm 5.0+ #3537
Comments
Any update on this? |
Hi @xiaoyu-work, there's been an AMD-compatibility commit to Horovod master recently (via PR #3486), but I'm not sure if it covers your problem. I believe you are also supposed to set the environment variable So you could give this one a shot: |
Hi @maxhgerlach, Thanks for your reply. No, it doesn't work. I tried |
Thanks for checking. Maybe the mechanism at horovod/horovod/torch/CMakeLists.txt Line 30 in 1b3452f
@weihanmines, is the build with PyTorch and ROCM also something you are going to look into perspectively? |
Hi @maxhgerlach , |
Hi @xiaoyu-work, would you mind sharing your commands for installation and envs? |
Hi @weihanmines, my env is a little bit complex, but you can repro this issue by:
|
@xiaoyu-work, there's a proposed fix in PR #3588. If you like, you could test if it fixes your problem. I think
should work to install Horovod from that branch. |
One thing to note: I think it's also necessary to define |
@maxhgerlach @ronakmal Thanks for the help! That PR works on ROCm 5.0+! |
Great, thanks for confirming that! |
Environment:
Checklist:
Bug report:
When I "pip install horovod" for rocm 5.0.1 and rocm 5.1.1, got error:
Stacktrace:
Error as above. I tried ROCM 5.0.1 and ROCM 5.1.1, and both failed.
Can you please take a look?
Thanks
The text was updated successfully, but these errors were encountered: