Fix bug visualizing 1D Tensor using rich #152871

wangkuiyi · 2025-05-05T21:54:41Z

I didn't fix the bug earlier because the example script didn't exhaustively present all combinations of 1D/2D tensor, 1D/2D mesh, and all possible sharding specs. Therefore, in this PR, I enriched the example script to cover all possible combinations.

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k

pytorch-bot · 2025-05-05T21:54:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152871

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 1630ad1 with merge base a769114 ():

NEW FAILURE - The following job has failed:

Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/_inductor/codegen/wrapper.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

wangkuiyi · 2025-05-05T21:56:51Z

@pytorchbot label "release notes: distributed (dtensor)"

wangkuiyi · 2025-05-06T05:40:44Z

torch/distributed/tensor/debug/_visualize_sharding.py

-    dtensor_height = shape[0] if len(shape) > 0 else 1
-    dtensor_width = shape[1] if len(shape) > 0 else shape[0]
+    dtensor_height = shape[0]
+    dtensor_width = shape[1] if len(shape) == 2 else 1


Here I am fixing a bug that I created in the previous PR. When the tensor is 1D, consider it a column vector.

wangkuiyi · 2025-05-06T05:43:10Z

torch/distributed/tensor/debug/_visualize_sharding.py

+        )
+        for device_index, (shape, offset) in device_shard_shape_and_offsets.items()
+    }
+


Here I am fixing another bug. When the tensor is 1D, the shape and offset of each shard is a 1-tuple. As we want to draw them in the 2D screen space, we need to extend each 1-tuple into a 2-tuple. In particular, expand the width of shard to 1, and extend the offset on the x-axis of the screen to be 0.

wangkuiyi · 2025-05-06T05:44:23Z

torch/distributed/tensor/examples/visualize_sharding_example.py

 """
 To run the example, use the following command:
-torchrun --standalone --nnodes=1 --nproc-per-node=4 visualize_sharding_example.py
+TERM=xterm-256color torchrun --nproc-per-node=4 visualize_sharding_example.py


The environment variable XTERM controls the terminal's coloring capability. Change the default value to xterm-256color to release the power of your terminal app.

wanchaol

thanks for the fix!

wanchaol · 2025-05-07T06:01:41Z

@pytorchbot merge -f "inductor lint error not related to the PR"

pytorchmergebot · 2025-05-07T06:04:11Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

wangkuiyi · 2025-05-07T16:20:27Z

follow up : #152027

Fix bug visualizing 1D Tensor using rich

b19b8c5

pytorch-bot bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label May 5, 2025

wangkuiyi marked this pull request as draft May 5, 2025 21:54

pytorch-bot bot added the release notes: distributed (dtensor) release notes category label May 5, 2025

pytorchbot added the open source label May 5, 2025

Exhaustive examples

dea2f0a

wangkuiyi marked this pull request as ready for review May 6, 2025 05:39

wangkuiyi changed the title ~~[WIP] Fix bug visualizing 1D Tensor using rich~~ Fix bug visualizing 1D Tensor using rich May 6, 2025

wangkuiyi commented May 6, 2025

View reviewed changes

Fix lint errors

1630ad1

wanchaol approved these changes May 6, 2025

View reviewed changes

pytorchmergebot added the merging label May 7, 2025

pytorchmergebot added the Merged label May 7, 2025

pytorchmergebot closed this in 93a0a7a May 7, 2025

pytorchmergebot removed the merging label May 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug visualizing 1D Tensor using rich #152871

Fix bug visualizing 1D Tensor using rich #152871

Uh oh!

wangkuiyi commented May 5, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 5, 2025 •

edited

Loading

Uh oh!

wangkuiyi commented May 5, 2025

Uh oh!

wangkuiyi May 6, 2025

Uh oh!

wangkuiyi May 6, 2025

Uh oh!

wangkuiyi May 6, 2025

Uh oh!

wanchaol left a comment

Uh oh!

wanchaol commented May 7, 2025

Uh oh!

pytorchmergebot commented May 7, 2025

Uh oh!

wangkuiyi commented May 7, 2025

Uh oh!

Uh oh!

Fix bug visualizing 1D Tensor using rich #152871

Fix bug visualizing 1D Tensor using rich #152871

Uh oh!

Conversation

wangkuiyi commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152871

❌ 1 New Failure

Uh oh!

wangkuiyi commented May 5, 2025

Uh oh!

wangkuiyi May 6, 2025

Choose a reason for hiding this comment

Uh oh!

wangkuiyi May 6, 2025

Choose a reason for hiding this comment

Uh oh!

wangkuiyi May 6, 2025

Choose a reason for hiding this comment

Uh oh!

wanchaol left a comment

Choose a reason for hiding this comment

Uh oh!

wanchaol commented May 7, 2025

Uh oh!

pytorchmergebot commented May 7, 2025

Merge started

Uh oh!

wangkuiyi commented May 7, 2025

Uh oh!

Uh oh!

wangkuiyi commented May 5, 2025 •

edited

Loading

pytorch-bot bot commented May 5, 2025 •

edited

Loading