[DTensor] implement dist_cat as a sharding prop rule #92677

XilunWu · 2023-01-20T09:34:42Z

Stack from ghstack (oldest at bottom):

-> [DTensor] implement dist_cat as a sharding prop rule #92677

[ghstack-poisoned]

pytorch-bot · 2023-01-20T09:34:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92677

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3ea6b8c:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: a27c5a3dffcb7b29dc8d22de45013a199f2ffb00 Pull Request resolved: #92677

[ghstack-poisoned]

ghstack-source-id: d4cbf117b4e630d80fe39229251d73724fae55cc Pull Request resolved: #92677

wanchaol

See comments inlined, we should make sure the xfail of cat to be removed so that it passes all the possible cases.

test/distributed/_tensor/test_dtensor_ops.py

test/distributed/_tensor/test_tensor_ops.py

torch/distributed/_tensor/ops/tensor_ops.py

wanchaol · 2023-01-23T16:37:39Z

torch/distributed/_tensor/ops/tensor_ops.py

+    return output_sharding
+
+
+def _update_schema_suggestion_for_cat(


can you tell me what exactly this function is doing? it looks like a lot of duplicate logic with the rule itself and I am not quite sure what this function is used for.

einop_rule expects the op_schema argument to have its args_schema in form [DTensorSpec, DTensorSpec, ...] but when it's passed into cat_rule the schema is actually [List[DTensorSpec]]. That's why I convert the args_schema at the beginning of cat_rule (https://github.com/pytorch/pytorch/pull/92677/files#diff-ebc7be1151cf411ce7edf46c4ca1cabb74cd953a2bdf47e04b4cc733c31f6085R492) before feeding it into einop_rule. Thus, we need to convert it back if a schema_suggestion is present here.

torch/distributed/_tensor/ops/tensor_ops.py

[ghstack-poisoned]

ghstack-source-id: 8ec685e5998e48321a16a752d2b1a7c5a6c84ed4 Pull Request resolved: #92677

[ghstack-poisoned]

ghstack-source-id: f536f348cc4ed8e049eb9bdf1462415c18a89839 Pull Request resolved: #92677

wanchaol

lgtm, thanks for working on it! left a couple of suggestions and some question.

torch/distributed/_tensor/ops/tensor_ops.py

wanchaol · 2023-01-26T21:33:22Z

torch/distributed/_tensor/ops/tensor_ops.py

+            dim_word = free_dim[:dim] + alphabet[i] + free_dim[dim:]
+            einop_notation_list.append(dim_word)
+        else:
+            einop_notation_list.append(alphabet[i])


is this the empty tensor annotation where it have a single char?

Not entirely for empty tensor but empty tensor whose ndim is smaller than other tensors. This is for case like concatenating Tensor([], shape=torch.Size([0])) with Tensor([[1, 2], [3, 4]], shape=torch.Size([2, 2])).

In this case, an empty annotation may still work but we want to ensure that the dim char for cat_dim in output tensor annotation must appear in input as well. Adding each input tensor's cat_dim dim char into annotation guarantees that.

[ghstack-poisoned]

ghstack-source-id: 4c2f6291dbcdad2b1674b3db36dc46a21a9159ce Pull Request resolved: #92677

XilunWu · 2023-01-27T02:12:37Z

@pytorchmergebot merge -g

pytorchmergebot · 2023-01-27T02:14:14Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[DTensor] implement dist_cat as a sharding prop rule

815f56c

[ghstack-poisoned]

XilunWu requested review from mrshenli, zhaojuanmao, rohan-varma, H-Huang, awgu, kwen2501 and wanchaol as code owners January 20, 2023 09:34

XilunWu added a commit that referenced this pull request Jan 20, 2023

[DTensor] implement dist_cat as a sharding prop rule

a152bf2

ghstack-source-id: a27c5a3dffcb7b29dc8d22de45013a199f2ffb00 Pull Request resolved: #92677

XilunWu added the release notes: distributed (dtensor) release notes category label Jan 20, 2023

Update on "[DTensor] implement dist_cat as a sharding prop rule"

0e9ffe3

[ghstack-poisoned]

XilunWu added a commit that referenced this pull request Jan 20, 2023

[DTensor] implement dist_cat as a sharding prop rule

b95f493

ghstack-source-id: d4cbf117b4e630d80fe39229251d73724fae55cc Pull Request resolved: #92677

wanchaol requested changes Jan 23, 2023

View reviewed changes

Update on "[DTensor] implement dist_cat as a sharding prop rule"

41145df

[ghstack-poisoned]

XilunWu added a commit that referenced this pull request Jan 26, 2023

[DTensor] implement dist_cat as a sharding prop rule

1bad96f

ghstack-source-id: 8ec685e5998e48321a16a752d2b1a7c5a6c84ed4 Pull Request resolved: #92677

Update on "[DTensor] implement dist_cat as a sharding prop rule"

cb506a5

[ghstack-poisoned]

XilunWu added a commit that referenced this pull request Jan 26, 2023

[DTensor] implement dist_cat as a sharding prop rule

f75a84f

ghstack-source-id: f536f348cc4ed8e049eb9bdf1462415c18a89839 Pull Request resolved: #92677

wanchaol approved these changes Jan 26, 2023

View reviewed changes

XilunWu added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 26, 2023

Update on "[DTensor] implement dist_cat as a sharding prop rule"

3ea6b8c

[ghstack-poisoned]

XilunWu added a commit that referenced this pull request Jan 26, 2023

[DTensor] implement dist_cat as a sharding prop rule

0d90fb8

ghstack-source-id: 4c2f6291dbcdad2b1674b3db36dc46a21a9159ce Pull Request resolved: #92677

pytorchmergebot added the Merged label Jan 27, 2023

pytorchmergebot closed this in 8b3e01c Jan 27, 2023

XilunWu deleted the gh/XilunWu/12/head branch April 11, 2023 21:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DTensor] implement dist_cat as a sharding prop rule #92677

[DTensor] implement dist_cat as a sharding prop rule #92677

XilunWu commented Jan 20, 2023 •

edited

pytorch-bot bot commented Jan 20, 2023 •

edited

wanchaol left a comment

wanchaol Jan 23, 2023

XilunWu Jan 26, 2023

wanchaol left a comment

wanchaol Jan 26, 2023

XilunWu Jan 26, 2023 •

edited

XilunWu commented Jan 27, 2023

pytorchmergebot commented Jan 27, 2023

		return output_sharding


		def _update_schema_suggestion_for_cat(

[DTensor] implement dist_cat as a sharding prop rule #92677

[DTensor] implement dist_cat as a sharding prop rule #92677

Conversation

XilunWu commented Jan 20, 2023 • edited

pytorch-bot bot commented Jan 20, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92677

✅ No Failures

wanchaol left a comment

Choose a reason for hiding this comment

wanchaol Jan 23, 2023

Choose a reason for hiding this comment

XilunWu Jan 26, 2023

Choose a reason for hiding this comment

wanchaol left a comment

Choose a reason for hiding this comment

wanchaol Jan 26, 2023

Choose a reason for hiding this comment

XilunWu Jan 26, 2023 • edited

Choose a reason for hiding this comment

XilunWu commented Jan 27, 2023

pytorchmergebot commented Jan 27, 2023

Merge started

XilunWu commented Jan 20, 2023 •

edited

pytorch-bot bot commented Jan 20, 2023 •

edited

XilunWu Jan 26, 2023 •

edited