ROCm ❤ TensorExpr #45506

t-vi · 2020-09-29T15:49:10Z

This might be an alternative to reverting #45396 .
The obvious rough edge is that I'm not really seeing the work group limits that TensorExpr produces.

dr-ci · 2020-09-29T15:50:30Z

💊 CI failures summary and remediations

As of commit 71bf92a (more details on the Dr. CI page):

2/3 failures possibly* introduced in this PR
- 1/2 non-CircleCI failure(s)
1/3 broken upstream at merge base b66ac1e on Sep 29 from 12:16pm to 12:31pm PDT (1 commit; b66ac1e - 147c88e)

XLA failure

Job pytorch_xla_linux_bionic_py3_6_clang9_build is failing. Please create an issue with title prefixed by [PT_BREAK] in pytorch/xla and link to to this PR. If you have questions, please reach out to @ailzhang / @dlibenzi / @JackCaoG.

🚧 1 fixed upstream failure:

These were probably caused by upstream breakages that were already fixed.

Please rebase on the viable/strict branch (expand for instructions)

Since your merge base is older than viable/strict, run these commands:

git fetch https://github.com/pytorch/pytorch viable/strict
git rebase FETCH_HEAD

Check out the recency history of this "viable master" tracking branch.

pytorch_ios_11_2_1_x86_64_build on Sep 29 from 12:16pm to 12:31pm PDT (1 commit; b66ac1e - 147c88e)
- 🔁 rerun

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.7-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 20 times.

t-vi · 2020-09-29T15:51:59Z

@jeffdaily @ZolotukhinM @mruberry @walterddr

Krovatkin

facebook-github-bot

@Krovatkin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

nickgg · 2020-09-29T16:27:38Z

@t-vi yeah, uh we don't limit the blockDim in the Cuda backend, we just know that (currently) we limit the thread loop to 512 elements. A definite TODO, but for now might make sense to disable it for ROCm and come back to it.

t-vi · 2020-09-29T16:56:08Z

Doesn't Cuda have bounds, too? I thought 1024 was the block size limit there.

zheng-xq

LGTM in general. Some minor comments.

torch/csrc/jit/tensorexpr/cuda_codegen.cpp

zheng-xq · 2020-09-29T17:20:51Z

torch/csrc/jit/tensorexpr/cuda_codegen.cpp

Please give the constant 128 and 1024 a name here. So people know its meaning.

Also if possible, please include a reference/link where they come from, so future developers know how to update them. Thanks!

Would describing them in the comment more explicitly work or do you prefer a #define or somesuch?

I would prefer a "static const int kBlcokSizeLimit..." or something like that. I think having both comments and variable names will be even more helpful for future developers. But it is up to you. Thanks!

I think with the new commentary, it should work. It's a much better comment now, thank you for suggesting that it needed improvement.
What do you think?

facebook-github-bot

@Krovatkin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-09-30T02:16:03Z

@Krovatkin merged this pull request in 22a34bc.

facebook-github-bot · 2020-09-30T02:17:03Z

@Krovatkin merged this pull request in 22a34bc.

t-vi requested a review from apaszke as a code owner September 29, 2020 15:49

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Sep 29, 2020

t-vi force-pushed the ROCm_❤_TensorExpr branch from 22ef336 to e80289b Compare September 29, 2020 15:54

mruberry requested review from ZolotukhinM and removed request for apaszke September 29, 2020 15:55

Krovatkin self-requested a review September 29, 2020 15:57

t-vi force-pushed the ROCm_❤_TensorExpr branch from e80289b to aff3290 Compare September 29, 2020 15:57

Krovatkin approved these changes Sep 29, 2020

View reviewed changes

facebook-github-bot reviewed Sep 29, 2020

View reviewed changes

Krovatkin requested a review from zheng-xq September 29, 2020 16:00

pytorchbot added the open source label Sep 29, 2020

zheng-xq reviewed Sep 29, 2020

View reviewed changes

ROCm ❤ TensorExpr

b307983

t-vi force-pushed the ROCm_❤_TensorExpr branch 2 times, most recently from a63b39c to f92f089 Compare September 29, 2020 19:34

better commentary

a3dac27

t-vi force-pushed the ROCm_❤_TensorExpr branch from f92f089 to a3dac27 Compare September 29, 2020 19:34

make minimum/maximum shared __device__

71bf92a

facebook-github-bot reviewed Sep 29, 2020

View reviewed changes

facebook-github-bot closed this in 22a34bc Sep 29, 2020

facebook-github-bot added the merged label Sep 30, 2020

mruberry added the Merged label Oct 28, 2020

ROCm ❤ TensorExpr #45506

ROCm ❤ TensorExpr #45506

Uh oh!

Conversation

t-vi commented Sep 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Sep 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

XLA failure

🚧 1 fixed upstream failure:

ci.pytorch.org: 1 failed

Uh oh!

t-vi commented Sep 29, 2020

Uh oh!

Krovatkin left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

nickgg commented Sep 29, 2020

Uh oh!

t-vi commented Sep 29, 2020 via email

Uh oh!

zheng-xq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zheng-xq Sep 29, 2020

Choose a reason for hiding this comment

Uh oh!

t-vi Sep 29, 2020

Choose a reason for hiding this comment

Uh oh!

zheng-xq Sep 29, 2020

Choose a reason for hiding this comment

Uh oh!

t-vi Sep 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 30, 2020

Uh oh!

facebook-github-bot commented Sep 30, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

t-vi commented Sep 29, 2020 •

edited

Loading

dr-ci bot commented Sep 29, 2020 •

edited

Loading

t-vi Sep 29, 2020 •

edited

Loading