fix aliasing for primtorch view meta kernels #86285

bdhirsh · 2022-10-05T17:23:47Z

Fixes #86284

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2022-10-05T17:23:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86285

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4b7afaa:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Fixes #86284 [ghstack-poisoned]

mruberry · 2022-10-05T21:22:28Z

The approach looks reasonable to me. What about extending test_python_ref_meta to detect this:

pytorch/test/test_ops.py

Line 174 in 17addb3

def test_python_ref_meta(self, device, dtype, op):

IvanYashchuk · 2022-10-06T10:27:08Z

Is the only difference here that with alias=True the resulting tensor would return True for tensor._is_view()?

import torch
a = torch.zeros(3, 2, device="meta")
b = torch._prims.TensorMeta(a, shape=a.shape, strides=a.stride())
print(b._is_view()) # False
c = a.as_strided(a.shape, a.stride())
print(c._is_view()) # True
assert a.storage().data_ptr() == b.storage().data_ptr() == c.storage().data_ptr() # It's all 0 for meta

bdhirsh · 2022-10-06T14:15:11Z

@IvanYashchuk since the output isn't a view of the input, then its metadata can also be incorrect. For example if the input is some strided tensor with a non-zero storage offset, then that won't get propagated to the output:

>>> a = torch.ones(2, 2, device='meta')[1]
>>> a.storage_offset()
2
>>> out_aten = a.as_strided(a.shape, a.stride())
>>> out_aten.storage_offset() # prints 2, the same as the input
2
>>> out_prim = torch._prims.TensorMeta(a, shape=a.shape, strides=a.stride())
>>> out_prim.storage_offset() # should print 2, prints 0!
0
>>>

bdhirsh · 2022-10-06T14:25:08Z

@mruberry Thanks for the pointer to test_python_ref_meta. It looks like today, that test currently checks the output shapes match, but not strides or storage offsets. The two ways I could imagine updating that test are:

(1) Update the test to check for the _is_view() relationship (this seems a bit weird since it's a private API that's dependent on autograd, but maybe this would be fine?)
(1) updating the test to always check that storage offsets match. This is the "broken" thing that I originally noticed. That would mean that the tests always check that sizes and storage_offset match, but no strides. Does that sound reasonable to you?

FWIW, I'm also adding a cross ref test that tests correctness of all of the python decomps and meta functions in torch/_decomp/decompositions.py and torch/_meta_registrations.py. Some of those decomps call into prims, which is where the problem first showed up. Although those two files probably don't comprehensively call into the prims, so a separate prim test sounds good to me.

mruberry · 2022-10-06T14:38:29Z

@mruberry Thanks for the pointer to test_python_ref_meta. It looks like today, that test currently checks the output shapes match, but not strides or storage offsets. The two ways I could imagine updating that test are:

(1) Update the test to check for the _is_view() relationship (this seems a bit weird since it's a private API that's dependent on autograd, but maybe this would be fine?)

Maybe TestViewOps in test_view_ops.py can provide some inspiration. See

pytorch/test/test_view_ops.py

Line 96 in cebf08a

def is_view_of(self, base, other):

(1) updating the test to always check that storage offsets match. This is the "broken" thing that I originally noticed. That would mean that the tests always check that sizes and storage_offset match, but no strides. Does that sound reasonable to you?

Stride-testing has been disabled for the moment. I spent a good amount of time trying to emulate PyTorch's striding logic, but PyTorch is pretty inconsistent in how it handles strides itself. See #78050. @ezyang concludes that we'd like to emulate strides, but I'm not sure how we would do this.

FWIW, I'm also adding a cross ref test that tests correctness of all of the python decomps and meta functions in torch/_decomp/decompositions.py and torch/_meta_registrations.py. Some of those decomps call into prims, which is where the problem first showed up. Although those two files probably don't comprehensively call into the prims, so a separate prim test sounds good to me.

There are several existing tests in test_ops.py for Python reference consistency, see

pytorch/test/test_ops.py

Line 339 in cebf08a

def test_python_ref(self, device, dtype, op):

These might be interesting to look at when developing a consistency test for decompositions. Ideally I think we'd like to see all decompositions ported to become Python references.

ezyang · 2022-10-06T15:28:06Z

@bdhirsh, why don't you just convert these into direct as_strided calls on the input tensor, rather than going through the TensorMeta constructor?

bdhirsh · 2022-10-06T16:50:59Z

@bdhirsh, why don't you just convert these into direct as_strided calls on the input tensor, rather than going through the TensorMeta constructor?

Sounds good - you're right, seems cleaner

Fixes #86284 [ghstack-poisoned]

bdhirsh · 2022-10-10T14:25:19Z

@mruberry Updated the PR - I had some issues trying to test the _is_view() property. Since that's not really the property that I cared about in the first place (storage offset was incorrect), I updated the op info tests to check for storage offset, which I confirmed exercises the problem from #86284. Let me know if you're happy with it

albanD

The storage_offset argument is missing. Sounds good otherwise.

albanD · 2022-10-10T14:26:28Z

torch/_prims/__init__.py

@@ -1173,7 +1173,7 @@ def _greater_than_reduce(acc, x):
        else:
            new_strides.append(0)

-    return TensorMeta(a, shape=shape, strides=new_strides)
+    return a.as_strided(shape, new_strides)


All the functions here need a storage_offset=a.storage_offset() argument!

Fixes #86284 [ghstack-poisoned]

albanD

SGTM

Fixes #86284 [ghstack-poisoned]

mruberry

LGTM!

Fixes #86284 [ghstack-poisoned]

fix aliasing for primtorch view meta kernels

b6f426d

[ghstack-poisoned]

bdhirsh mentioned this pull request Oct 5, 2022

[test] add cross-ref tests for python meta kernels #86228

Closed

facebook-github-bot added the cla signed label Oct 5, 2022

bdhirsh mentioned this pull request Oct 5, 2022

primtorch view op meta kernels don't respect aliasing info #86284

Closed

bdhirsh added 2 commits October 5, 2022 11:03

Update on "fix aliasing for primtorch view meta kernels"

b8eb088

Fixes #86284 [ghstack-poisoned]

Update on "fix aliasing for primtorch view meta kernels"

1fb8746

Fixes #86284 [ghstack-poisoned]

bdhirsh mentioned this pull request Oct 5, 2022

add myself for dynamic shapes PR review #86292

Closed

mruberry requested review from IvanYashchuk and mruberry October 5, 2022 21:20

Update on "fix aliasing for primtorch view meta kernels"

a9c2aca

Fixes #86284 [ghstack-poisoned]

bdhirsh mentioned this pull request Oct 7, 2022

fix some composite compliance ops for functionalization #86470

Closed

Update on "fix aliasing for primtorch view meta kernels"

6d916fc

Fixes #86284 [ghstack-poisoned]

albanD reviewed Oct 10, 2022

View reviewed changes

Update on "fix aliasing for primtorch view meta kernels"

0346158

Fixes #86284 [ghstack-poisoned]

Update on "fix aliasing for primtorch view meta kernels"

c8511b9

Fixes #86284 [ghstack-poisoned]

albanD approved these changes Oct 11, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 11, 2022

Update on "fix aliasing for primtorch view meta kernels"

be17817

Fixes #86284 [ghstack-poisoned]

mruberry approved these changes Oct 11, 2022

View reviewed changes

bdhirsh added 5 commits October 11, 2022 18:20

Update on "fix aliasing for primtorch view meta kernels"

818397b

Fixes #86284 [ghstack-poisoned]

Update on "fix aliasing for primtorch view meta kernels"

dbb798b

Fixes #86284 [ghstack-poisoned]

Update on "fix aliasing for primtorch view meta kernels"

9042fea

Fixes #86284 [ghstack-poisoned]

Update on "fix aliasing for primtorch view meta kernels"

0e9eda5

Fixes #86284 [ghstack-poisoned]

Update on "fix aliasing for primtorch view meta kernels"

4b7afaa

Fixes #86284 [ghstack-poisoned]

pytorchmergebot closed this in 6907db3 Oct 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix aliasing for primtorch view meta kernels #86285

fix aliasing for primtorch view meta kernels #86285

bdhirsh commented Oct 5, 2022 •

edited

pytorch-bot bot commented Oct 5, 2022 •

edited

mruberry commented Oct 5, 2022

IvanYashchuk commented Oct 6, 2022

bdhirsh commented Oct 6, 2022 •

edited

bdhirsh commented Oct 6, 2022

mruberry commented Oct 6, 2022

ezyang commented Oct 6, 2022

bdhirsh commented Oct 6, 2022

bdhirsh commented Oct 10, 2022

albanD left a comment

albanD Oct 10, 2022

albanD left a comment

mruberry left a comment

fix aliasing for primtorch view meta kernels #86285

fix aliasing for primtorch view meta kernels #86285

Conversation

bdhirsh commented Oct 5, 2022 • edited

pytorch-bot bot commented Oct 5, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86285

✅ No Failures

mruberry commented Oct 5, 2022

IvanYashchuk commented Oct 6, 2022

bdhirsh commented Oct 6, 2022 • edited

bdhirsh commented Oct 6, 2022

mruberry commented Oct 6, 2022

ezyang commented Oct 6, 2022

bdhirsh commented Oct 6, 2022

bdhirsh commented Oct 10, 2022

albanD left a comment

Choose a reason for hiding this comment

albanD Oct 10, 2022

Choose a reason for hiding this comment

albanD left a comment

Choose a reason for hiding this comment

mruberry left a comment

Choose a reason for hiding this comment

bdhirsh commented Oct 5, 2022 •

edited

pytorch-bot bot commented Oct 5, 2022 •

edited

bdhirsh commented Oct 6, 2022 •

edited