Skip to content

Conversation

@tushar00jain
Copy link
Contributor

Summary: pass full allreduce options to the pg allreduce to avoid the watchdog abort from getting triggered

Differential Revision: D84101243

Summary:

- update the callback to work with the new ManagedWork
- provide an option to use bucketization using env var

Differential Revision: D84101245
Summary: pass full allreduce options to the pg allreduce to avoid the watchdog abort from getting triggered

Differential Revision: D84101243
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 7, 2025
@meta-codesync
Copy link

meta-codesync bot commented Oct 7, 2025

@tushar00jain has exported this pull request. If you are a Meta employee, you can view the originating Diff in D84101243.

tushar00jain added a commit to tushar00jain/torchft that referenced this pull request Oct 7, 2025
Summary:

pass full allreduce options to the pg allreduce to avoid the watchdog abort from getting triggered

Differential Revision: D84101243
@meta-codesync meta-codesync bot closed this in 8521e3d Oct 8, 2025
@meta-codesync
Copy link

meta-codesync bot commented Oct 8, 2025

This pull request has been merged in 8521e3d.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot. fb-exported Merged meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants