[DTensor] Add a private util for sharding tensor #142288

kwen2501 · 2024-12-07T01:19:35Z

Stack from ghstack (oldest at bottom):

-> [DTensor] Add a private util for sharding tensor #142288

Locally shards a full tensor based on indicated sharding arrangement, and returns a DTensor containing the local shard.

warning: This is a private API purposed to skip the communication otherwise required by distribute_tensor. It is only applicable to a case where all ranks have the same full_tensor.

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @tianyu-l @XilunWu

[ghstack-poisoned]

pytorch-bot · 2024-12-07T01:19:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142288

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fb3b2a1 with merge base 61dc5e9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

wz337

Stamped to unblock.

wz337 · 2024-12-07T03:02:26Z

torch/distributed/tensor/_api.py

+    Locally shards a full tensor based on indicated sharding arrangement, and
+    returns a DTensor containing the local shard.
+
+    .. warning:: This is a private API purposed to skip the communication


Could we say This is a private API that is subject to change. It is ...

Thanks, added now.

wz337 · 2024-12-07T03:06:01Z

torch/distributed/tensor/_api.py

+        for cur_shape, cur_offset in zip(shape, offset)
+    ]
+    local_tensor = full_tensor[slices]
+    return DTensor.from_local(local_tensor, device_mesh, placements)


You will need to pass the shape and stride for uneven tensor for from_local. Otherwise, the shape and stride would be inferred from the local tensor as it is uniformly distributed. See example: https://github.com/pytorch/pytorch/blob/main/torch/distributed/_state_dict_utils.py#L566-L572

Thanks for the heads-up. I don't have the shape or stride info, and this function assumes things are the same across ranks, so the infer would be fine.

Locally shards a full tensor based on indicated sharding arrangement, and returns a DTensor containing the local shard. warning: This is a private API purposed to skip the communication otherwise required by `distribute_tensor`. It is only applicable to a case where all ranks have the same `full_tensor`. cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k c-p-i-o tianyu-l XilunWu [ghstack-poisoned]

ghstack-source-id: fa35b0e Pull Request resolved: #142288

kwen2501 · 2024-12-07T05:28:09Z

@pytorchbot merge -f "CI was green; minor edits to comments"

pytorchmergebot · 2024-12-07T05:30:08Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Locally shards a full tensor based on indicated sharding arrangement, and returns a DTensor containing the local shard. warning: This is a private API purposed to skip the communication otherwise required by `distribute_tensor`. It is only applicable to a case where all ranks have the same `full_tensor`. Pull Request resolved: pytorch#142288 Approved by: https://github.com/wz337

[DTensor] Add a private util for sharding tensor

83ef94f

[ghstack-poisoned]

pytorch-bot bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Dec 7, 2024

kwen2501 requested a review from wz337 December 7, 2024 01:38

kwen2501 added the release notes: distributed (dtensor) release notes category label Dec 7, 2024

wz337 approved these changes Dec 7, 2024

View reviewed changes

wz337 added topic: not user facing topic category module: dtensor distributed tensor tag and removed release notes: distributed (dtensor) release notes category labels Dec 7, 2024

wz337 reviewed Dec 7, 2024

View reviewed changes

kwen2501 added a commit that referenced this pull request Dec 7, 2024

[DTensor] Add a private util for sharding tensor

07a9b30

ghstack-source-id: fa35b0e Pull Request resolved: #142288

pytorchmergebot added the merging label Dec 7, 2024

pytorchmergebot closed this in a58d2f1 Dec 7, 2024

pytorchmergebot added Merged and removed merging labels Dec 7, 2024

github-actions bot deleted the gh/kwen2501/112/head branch January 7, 2025 02:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DTensor] Add a private util for sharding tensor #142288

[DTensor] Add a private util for sharding tensor #142288

Uh oh!

kwen2501 commented Dec 7, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 7, 2024 •

edited

Loading

Uh oh!

wz337 left a comment

Uh oh!

wz337 Dec 7, 2024

Uh oh!

kwen2501 Dec 7, 2024

Uh oh!

wz337 Dec 7, 2024

Uh oh!

kwen2501 Dec 7, 2024

Uh oh!

kwen2501 commented Dec 7, 2024

Uh oh!

pytorchmergebot commented Dec 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[DTensor] Add a private util for sharding tensor #142288

[DTensor] Add a private util for sharding tensor #142288

Uh oh!

Conversation

kwen2501 commented Dec 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142288

✅ No Failures

Uh oh!

wz337 left a comment

Choose a reason for hiding this comment

Uh oh!

wz337 Dec 7, 2024

Choose a reason for hiding this comment

Uh oh!

kwen2501 Dec 7, 2024

Choose a reason for hiding this comment

Uh oh!

wz337 Dec 7, 2024

Choose a reason for hiding this comment

Uh oh!

kwen2501 Dec 7, 2024

Choose a reason for hiding this comment

Uh oh!

kwen2501 commented Dec 7, 2024

Uh oh!

pytorchmergebot commented Dec 7, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kwen2501 commented Dec 7, 2024 •

edited

Loading

pytorch-bot bot commented Dec 7, 2024 •

edited

Loading