Skip to content

Conversation

@jammm
Copy link
Contributor

@jammm jammm commented Aug 7, 2025

Helps fix imports for transformers lib which seems to import torch.distributed. ROCm/TheRock#1202
It may or may not help in proper WIndows support for torch.distributed (since RCCL isn't on Windows atm), but it does let us build pytorch w/ USE_DISTRIBUTED=ON, which helps fix import errors like the above, so users on their single GPU systems can run comfyui without facing import errors.

Tested on Windows w/ PyTorch main branch via. TheRock.

@meta-cla meta-cla bot added the CLA Signed label Aug 7, 2025
@jammm jammm changed the title [ROCm] Support GLOO on Windows [ROCm] Support building PyTorch with GLOO on Windows Aug 7, 2025
@Nem404
Copy link

Nem404 commented Aug 14, 2025

@jammm sorry if I’m being impatient, but since this PR was approved, could it be merged?

@jammm
Copy link
Contributor Author

jammm commented Aug 14, 2025

@jammm sorry if I’m being impatient, but since this PR was approved, could it be merged?

It's not up to me to merge this unfortunately. The gloo maintainers need to merge it. It's my first PR here actually, so not sure of the exact flow.

@Nem404
Copy link

Nem404 commented Aug 14, 2025

It's not up to me to merge this unfortunately. The gloo maintainers need to merge it. It's my first PR here actually, so not sure of the exact flow.

Oh, I didn’t know that. Np, then we’ll wait for them.
I just thought anything is welcome that could help with: ROCm/TheRock#1202

Copy link
Member

@d4l3k d4l3k left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, sorry for the delay!

If you're on the PyTorch slack feel free to ping me directly there for faster reviews

@d4l3k
Copy link
Member

d4l3k commented Aug 21, 2025

@jammm @Nem404 we don't currently have any ROCm CI builds for Gloo. If you're interested in contributing one that would make it a lot easier to prevent regressions like this in the future

@ScottTodd
Copy link

@jammm @Nem404 we don't currently have any ROCm CI builds for Gloo. If you're interested in contributing one that would make it a lot easier to prevent regressions like this in the future

We're working on contributing Windows ROCm CI builds with pytorch/pytorch#159520 and could enable USE_GLOO there too (I have it disabled in the scripts I'm starting at pytorch/pytorch#160776 right now).

@d4l3k d4l3k merged commit 1b4337a into pytorch:main Aug 21, 2025
8 checks passed
@d4l3k
Copy link
Member

d4l3k commented Aug 21, 2025

@ScottTodd that sounds reasonable to me but we should also add something to this repo. PyTorch pins a commit from Gloo so it's always trailing. We don't run PT CI on Gloo repo, only run PT CI when bumping the Gloo pinned commit

@Nem404
Copy link

Nem404 commented Aug 21, 2025

Eyy good to see things evolving here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants