Skip to content

FSDP does not work on GLOO backend #74041

@rohan-varma

Description

@rohan-varma

🐛 Describe the bug

It uses _allgather_base, but there is no support for this in Gloo backend:

RuntimeError: no support for _allgather_base in Gloo process group

Versions

main

cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @gqchen @aazzolini @osalpekar @jiayisuse @H-Huang @kwen2501 @awgu @penguinwu @fegin @pietern @rohan-varma @SciPioneer

Metadata

Metadata

Assignees

Labels

better-engineeringRelatively self-contained tasks for better engineering contributorsmodule: fsdponcall: distributedAdd this issue/PR to distributed oncall triage queuetriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions