[UCC] Add pre & post processing for CPU collectives #89030

kirteshpatil · 2022-11-15T01:41:07Z

Summary: The CPU block in collective_post was missing pre & post processing. The reduce-scatter implementaion expects use of pre-processing callback to flatten the input tensors, however, the missing invocation meant grabage values were being passed.

Test Plan: Tested the reduce-scatter collective using PARAM

Reviewed By: eastzone

Differential Revision: D41291592

pytorch-bot · 2022-11-15T01:41:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89030

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 69f2156:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2022-11-15T01:41:10Z

The committers listed above are authorized under a signed CLA.

✅ login: kirteshpatil / name: kp (a41e0c0)

facebook-github-bot · 2022-11-15T01:42:08Z

This pull request was exported from Phabricator. Differential Revision: D41291592

Summary: Pull Request resolved: pytorch#89030 The CPU block in `collective_post` was missing pre & post processing. The reduce-scatter implementaion expects use of pre-processing callback to flatten the input tensors, however, the missing invocation meant grabage values were being passed. Test Plan: Tested the reduce-scatter collective using PARAM Reviewed By: eastzone Differential Revision: D41291592 fbshipit-source-id: 9881745a04d950e7e169eb4d49aa818cb25488b0

facebook-github-bot · 2022-11-15T01:51:14Z

This pull request was exported from Phabricator. Differential Revision: D41291592

facebook-github-bot · 2022-11-15T07:30:06Z

This pull request was exported from Phabricator. Differential Revision: D41291592

Summary: Pull Request resolved: pytorch#89030 The CPU block in `collective_post` was missing pre & post processing. The reduce-scatter implementaion expects use of pre-processing callback to flatten the input tensors, however, the missing invocation meant grabage values were being passed. Test Plan: Tested the reduce-scatter collective using PARAM Reviewed By: eastzone, kingchc Differential Revision: D41291592 fbshipit-source-id: 2f7b318be6cbcab433070e1c88da3a9f69a187f1

facebook-github-bot · 2022-11-15T20:18:14Z

This pull request was exported from Phabricator. Differential Revision: D41291592

Summary: Pull Request resolved: pytorch#89030 The CPU block in `collective_post` was missing pre & post processing. The reduce-scatter implementaion expects use of pre-processing callback to flatten the input tensors, however, the missing invocation meant grabage values were being passed. Test Plan: Tested the reduce-scatter collective using PARAM Reviewed By: eastzone, kingchc Differential Revision: D41291592 fbshipit-source-id: 6f6e29e5585db43b87740ae35e1a7c24781f08e1

kwen2501

LGTM.

kit1980 · 2022-11-15T23:13:55Z

/easycla

kirteshpatil · 2022-11-16T01:10:46Z

@pytorchbot merge

pytorchmergebot · 2022-11-16T01:13:05Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-11-16T02:03:33Z

Merge failed

Reason: 2 additional jobs have failed, first few of them are: trunk ,trunk / macos-12-py3-x86-64 / test (default, 2, 2, macos-12)

Details for Dev Infra team

Raised by workflow job

Summary: Pull Request resolved: pytorch#89030 The CPU block in `collective_post` was missing pre & post processing. The reduce-scatter implementaion expects use of pre-processing callback to flatten the input tensors, however, the missing invocation meant grabage values were being passed. Test Plan: Tested the reduce-scatter collective using PARAM Reviewed By: eastzone, kingchc Differential Revision: D41291592 fbshipit-source-id: c8e7a639d78039951d85e33509ac141934a8c837

facebook-github-bot · 2022-11-16T07:21:22Z

This pull request was exported from Phabricator. Differential Revision: D41291592

kirteshpatil · 2022-11-16T16:37:58Z

@pytorchbot merge

pytorchmergebot · 2022-11-16T16:40:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: The CPU block in `collective_post` was missing pre & post processing. The reduce-scatter implementaion expects use of pre-processing callback to flatten the input tensors, however, the missing invocation meant grabage values were being passed. Test Plan: Tested the reduce-scatter collective using PARAM Reviewed By: eastzone Differential Revision: D41291592 Pull Request resolved: pytorch#89030 Approved by: https://github.com/kingchc, https://github.com/kwen2501

kirteshpatil requested review from mrshenli, zhaojuanmao, pritamdamania87, rohan-varma, H-Huang, awgu and kwen2501 as code owners November 15, 2022 01:41

pytorch-bot bot added the release notes: distributed (c10d) release notes category label Nov 15, 2022

facebook-github-bot added the fb-exported label Nov 15, 2022

kirteshpatil force-pushed the export-D41291592 branch from db84c37 to cb7f05b Compare November 15, 2022 01:51

kirteshpatil force-pushed the export-D41291592 branch from cb7f05b to c605a42 Compare November 15, 2022 07:30

kirteshpatil force-pushed the export-D41291592 branch from c605a42 to a41e0c0 Compare November 15, 2022 20:18

kingchc approved these changes Nov 15, 2022

View reviewed changes

kwen2501 approved these changes Nov 15, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 16, 2022

kirteshpatil force-pushed the export-D41291592 branch from a41e0c0 to 69f2156 Compare November 16, 2022 07:21

pytorchmergebot added the Merged label Nov 16, 2022

pytorchmergebot closed this in fe276ea Nov 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[UCC] Add pre & post processing for CPU collectives #89030

[UCC] Add pre & post processing for CPU collectives #89030

kirteshpatil commented Nov 15, 2022

pytorch-bot bot commented Nov 15, 2022 •

edited

linux-foundation-easycla bot commented Nov 15, 2022 •

edited

facebook-github-bot commented Nov 15, 2022

facebook-github-bot commented Nov 15, 2022

facebook-github-bot commented Nov 15, 2022

facebook-github-bot commented Nov 15, 2022

kwen2501 left a comment

kit1980 commented Nov 15, 2022

kirteshpatil commented Nov 16, 2022

pytorchmergebot commented Nov 16, 2022

pytorchmergebot commented Nov 16, 2022

facebook-github-bot commented Nov 16, 2022

kirteshpatil commented Nov 16, 2022

pytorchmergebot commented Nov 16, 2022

[UCC] Add pre & post processing for CPU collectives #89030

[UCC] Add pre & post processing for CPU collectives #89030

Conversation

kirteshpatil commented Nov 15, 2022

pytorch-bot bot commented Nov 15, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89030

✅ No Failures

linux-foundation-easycla bot commented Nov 15, 2022 • edited

facebook-github-bot commented Nov 15, 2022

facebook-github-bot commented Nov 15, 2022

facebook-github-bot commented Nov 15, 2022

facebook-github-bot commented Nov 15, 2022

kwen2501 left a comment

Choose a reason for hiding this comment

kit1980 commented Nov 15, 2022

kirteshpatil commented Nov 16, 2022

pytorchmergebot commented Nov 16, 2022

Merge started

pytorchmergebot commented Nov 16, 2022

Merge failed

facebook-github-bot commented Nov 16, 2022

kirteshpatil commented Nov 16, 2022

pytorchmergebot commented Nov 16, 2022

Merge started

pytorch-bot bot commented Nov 15, 2022 •

edited

linux-foundation-easycla bot commented Nov 15, 2022 •

edited