Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer #122297

chunyuan-w · 2024-03-20T08:58:21Z

Stack from ghstack (oldest at bottom):

-> Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer #122297

For at::scalar_tensor the default dtype will be float (link to scalar_tensor, link to default dtype) if we don't set the dtype value. However, the input scalar value is not necessarily a float value. With torch::tensor(x), the dtype of the tensor will be decided according to the dtype of the scalar.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-03-20T08:58:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/122297

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit ab42345 with merge base 6502c88 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: fefdca5e4d66873bdf3c5519cef9cbaa5263ef74 Pull Request resolved: #122297

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: e19c495456e5649156745577281628076878e4d9 Pull Request resolved: #122297

test/inductor/test_torchinductor.py

For `at::scalar_tensor` the default dtype will be `float` ([link to scalar_tensor](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/aten/src/ATen/native/TensorFactories.cpp#L856), [link to default dtype](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/c10/core/TensorOptions.h#L551)) if we don't set the `dtype` value. However, the input scalar value is not necessarily a `float` value. With `torch::tensor(x)`, the dtype of the tensor will be decided according to the dtype of the scalar. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 42a8a0537bff727c38f82af6e0aebe0707e5e09b Pull Request resolved: #122297

chunyuan-w · 2024-03-25T07:09:29Z

@desertfire I just noticed that #118024 changed torch::tensor to at::scalar_tensor. May I know if there's a specific reason for this? I found that with at::scalar_tensor, the scalar type of the tensor might be inconsistent with the scalar value and will cause regression issue of several models in #122292.

desertfire · 2024-03-25T13:12:23Z

@desertfire I just noticed that #118024 changed torch::tensor to at::scalar_tensor. May I know if there's a specific reason for this? I found that with at::scalar_tensor, the scalar type of the tensor might be inconsistent with the scalar value and will cause regression issue of several models in #122292.

It is related to https://github.com/pytorch/pytorch/pull/118024/files/bba213c151f2c8e7a29635d273a73a6c98d24393#r1463364631. at::scalar_tensor is more accurate in this use case. Is it a dtype propagation problem here?

chunyuan-w · 2024-03-26T02:03:25Z

@desertfire I just noticed that #118024 changed torch::tensor to at::scalar_tensor. May I know if there's a specific reason for this? I found that with at::scalar_tensor, the scalar type of the tensor might be inconsistent with the scalar value and will cause regression issue of several models in #122292.

It is related to https://github.com/pytorch/pytorch/pull/118024/files/bba213c151f2c8e7a29635d273a73a6c98d24393#r1463364631. at::scalar_tensor is more accurate in this use case. Is it a dtype propagation problem here?

Considering an int scalar value 2, at::scalar_tensor(2) becomes a Float type tensor.
The issues in #122292 is that this scalar value output is later used as a size which requires it to be an int instead of a float.

I tried to get the dtype of the scalar value and pass it to at::scalar_tensor as the dtype input argument during the codegen time but sometimes the scalar value is a Symbol and I failed to find a good way to infer the dtype of this Symbol, while with torch::tensor, the dtype of the output tensor will match the dtype of the scalar input value, so I changed at::scalar_tensor back to torch::tensor in non abi compatible mode. May I know if you have other suggestions for the fix of this issue?

chunyuan-w · 2024-03-29T03:27:19Z

Hi @desertfire this issue will cause regression against the PyTorch 2.2 release and we're trying to see if it's possible to fix it in 2.3. May I know if you have other suggestions regarding the current fix approach?

desertfire

ok for the non abi-compatible mode, although we will have to come back to fix once I turned on the abi-compatible mode as default.

chunyuan-w · 2024-04-01T01:30:16Z

@pytorchbot merge

pytorchmergebot · 2024-04-01T01:32:25Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

chunyuan-w · 2024-04-01T01:36:19Z

ok for the non abi-compatible mode, although we will have to come back to fix once I turned on the abi-compatible mode as default.

Oh okay. Btw, when do we plan to turn on the abi-compatible mode as default?

) For `at::scalar_tensor` the default dtype will be `float` ([link to scalar_tensor](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/aten/src/ATen/native/TensorFactories.cpp#L856), [link to default dtype](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/c10/core/TensorOptions.h#L551)) if we don't set the `dtype` value. However, the input scalar value is not necessarily a `float` value. With `torch::tensor(x)`, the dtype of the tensor will be decided according to the dtype of the scalar. Pull Request resolved: pytorch#122297 Approved by: https://github.com/jgong5, https://github.com/desertfire

desertfire · 2024-04-01T20:32:29Z

ok for the non abi-compatible mode, although we will have to come back to fix once I turned on the abi-compatible mode as default.

Oh okay. Btw, when do we plan to turn on the abi-compatible mode as default?

We do, once its coverage has reached a reasonable level.

…#122297) (#123064) For `at::scalar_tensor` the default dtype will be `float` ([link to scalar_tensor](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/aten/src/ATen/native/TensorFactories.cpp#L856), [link to default dtype](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/c10/core/TensorOptions.h#L551)) if we don't set the `dtype` value. However, the input scalar value is not necessarily a `float` value. With `torch::tensor(x)`, the dtype of the tensor will be decided according to the dtype of the scalar. Pull Request resolved: #122297 Approved by: https://github.com/jgong5, https://github.com/desertfire

) For `at::scalar_tensor` the default dtype will be `float` ([link to scalar_tensor](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/aten/src/ATen/native/TensorFactories.cpp#L856), [link to default dtype](https://github.com/pytorch/pytorch/blob/0d8e960f74acd359358e0b729c4803d2b71849e5/c10/core/TensorOptions.h#L551)) if we don't set the `dtype` value. However, the input scalar value is not necessarily a `float` value. With `torch::tensor(x)`, the dtype of the tensor will be decided according to the dtype of the scalar. Pull Request resolved: pytorch#122297 Approved by: https://github.com/jgong5, https://github.com/desertfire

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer

b93f27c

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Mar 20, 2024

chunyuan-w added a commit that referenced this pull request Mar 20, 2024

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer

7b6ba5a

ghstack-source-id: fefdca5e4d66873bdf3c5519cef9cbaa5263ef74 Pull Request resolved: #122297

chunyuan-w marked this pull request as draft March 20, 2024 08:59

pytorchbot added the open source label Mar 20, 2024

chunyuan-w mentioned this pull request Mar 21, 2024

[inductor][cpu] fastNLP_Bert, hf_BigBird, hf_Reformer, soft_actor_critic fp32 Dynamic shape CPP wrapper accuracy crashed #122292

Closed

chunyuan-w added a commit that referenced this pull request Mar 21, 2024

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer

a4d59d7

ghstack-source-id: e19c495456e5649156745577281628076878e4d9 Pull Request resolved: #122297

chunyuan-w marked this pull request as ready for review March 22, 2024 06:29

chunyuan-w requested a review from jgong5 March 22, 2024 06:29

jgong5 approved these changes Mar 25, 2024

View reviewed changes

test/inductor/test_torchinductor.py Outdated Show resolved Hide resolved

chunyuan-w added a commit that referenced this pull request Mar 25, 2024

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer

8335d22

ghstack-source-id: 42a8a0537bff727c38f82af6e0aebe0707e5e09b Pull Request resolved: #122297

chunyuan-w requested a review from desertfire March 25, 2024 07:09

chunyuan-w added the topic: not user facing topic category label Mar 29, 2024

desertfire approved these changes Mar 29, 2024

View reviewed changes

chunyuan-w added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 31, 2024

pytorchmergebot added the merging label Apr 1, 2024

pytorchmergebot closed this in 8b7da5b Apr 1, 2024

pytorchmergebot added Merged and removed merging labels Apr 1, 2024

This was referenced Apr 1, 2024

[CherryPick] Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer (#122297) #123064

Merged

[v.2.3.0] Release Tracker #121760

Closed

github-actions bot deleted the gh/chunyuan-w/3/head branch May 2, 2024 01:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer #122297

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer #122297

chunyuan-w commented Mar 20, 2024 •

edited

pytorch-bot bot commented Mar 20, 2024 •

edited

chunyuan-w commented Mar 25, 2024

desertfire commented Mar 25, 2024

chunyuan-w commented Mar 26, 2024

chunyuan-w commented Mar 29, 2024

desertfire left a comment

chunyuan-w commented Apr 1, 2024

pytorchmergebot commented Apr 1, 2024

chunyuan-w commented Apr 1, 2024

desertfire commented Apr 1, 2024

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer #122297

Inductor cpp wrapper: fix dtype of ShapeAsConstantBuffer #122297

Conversation

chunyuan-w commented Mar 20, 2024 • edited

pytorch-bot bot commented Mar 20, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/122297

✅ No Failures

chunyuan-w commented Mar 25, 2024

desertfire commented Mar 25, 2024

chunyuan-w commented Mar 26, 2024

chunyuan-w commented Mar 29, 2024

desertfire left a comment

Choose a reason for hiding this comment

chunyuan-w commented Apr 1, 2024

pytorchmergebot commented Apr 1, 2024

Merge started

chunyuan-w commented Apr 1, 2024

desertfire commented Apr 1, 2024

chunyuan-w commented Mar 20, 2024 •

edited

pytorch-bot bot commented Mar 20, 2024 •

edited