New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update builder.py #5249
Update builder.py #5249
Conversation
Fix deepspeed2 with ROCm
@microsoft-github-policy-service agree |
@ehartford, thanks for the PR. Can you please share a bit more about the issue that this fixes? |
Ubuntu Server 20.04
DeepSpeed Zero1 was working but DeepSpeed Zero2 wasn't working.
So, I delete DeepSpeed and install manually from source. I set environment variables like this:
Then when I try to do
So I asked Claude Opus to fix it, and it suggested the change in this PR. After I made that change, then I was able to compile and install. And then, DeepSpeed Zero2 was working after that. |
this doesn't solve the issue imo, it just makes it more confusing, calling a cuda specific function ( and to truly solve this issue, what's needed is a |
ok. |
@ehartford, thanks for sharing these details. I am glad that you are unblocked. Could you please create ticket for this issue with the details above? That would be very helpful for our investigation. Thanks! |
Fix deepspeed2 with ROCm