preserve non-dense or overlapping tensor's layout in *_like functions #46046

glaringlee · 2020-10-08T17:50:40Z

Stack from ghstack:

*preserve non-dense or overlapping tensor's layout in *_like functions #46046 preserve non-dense or overlapping tensor's layout in _like functions

*_like functions are used in pytorch to create a new tensor with the same shape of the input tensor. But we don’t always preserve the layout permutation of the tensor. Current behavior is that, for a dense and non-overlapping tensor, its layout permutation is preserved. For eg. passing a channel last contiguous tensor t with ‘shape/stride’ (2, 4, 3, 2)/(24, 1, 8, 4) to empty_like(t) function will create a new tensor with exactly the same ‘shape/stride’ as the input tensor t. However, if the input tensor is non-dense or has overlap, we simply create a contiguous tensor based on input tensor’s shape, so the tensor layout permutation is lost.

This PR preserves the layout permutation for non-dense or overlapping tensor. The strides propagation rule that used in this PR is exactly the same as what is being used in TensorIterator.

The test is based on empty_like. We compare the output tensor's sizes/strides among the empty_like old code, empty_like new code, and TensorIterator based operator exp(). For each sizes/strides we create tensor from both empty_like() and exp() using the following code and record the output tensor's sizes/stride.

a = torch.empty_strided([sizes], [strides])
b = torch.empty_like(a)
c = torch.exp(a)
// record the result of b (in both old and new code) and c to create the list below

The output strides changes are listed below:

sizes/strides	old	new	unary op	description
(4,3,2)/(10,1,2)	(4,3,2)/(6,2,1)	(4,3,2)/(6,1,3)	(4,3,2)/(6,1,3)	non-dense but overlap
(4,3,2)/(10,0,3)	(4,3,2)/(6,2,1)	(4,3,2)/(6,2,1)	(4,3,2)/(6,2,1)	non-dense but overlap (0 strided)
(4,3,2)/(12,2,6)	(4,3,2)/(6,2,1)	(4,3,2)/(6,1,3)	(4,3,2)/(6,1,3)	non-dense non-overlap (sliced)
(4,3,2)/(10,1,3)	(4,3,2)/(6,2,1)	(4,3,2)/(6,1,3)	(4,3,2)/(6,1,3)	non-dense non-overlap (with gap)
(4,1,1,2)/(10,0,0,2)	(4,1,1,2)/(2,2,2,1)	(4,1,1,2)/(2,2,2,1)	(4,1,1,2)/(2,2,2,1)	non-dense non-overlap (0 strided)
(4,1,2)/(10,3,3)	(4,1,2)/(2,2,1)	(4,1,2)/(2,1,1)	(4,1,2)/(2,1,1)	overlap (with equal strides)
(4,2,2,2)/(2,0,0,6)	(4,2,2,2)/(8,4,2,1)	(4,2,2,2)/(1,8,4,16)	(4,2,2,2)/(1,8,4,16)	overlap (0 strided with equal size)

This is to solve the non-dense tensor layout problem in #45505

TODO:

Fix all the BC broken test cases in pytorch
Investigate if any fb internal tests are broken (Update: None so far)

This change will cover all kinds of non-dense tensors.

Differential Revision: D24288970

[ghstack-poisoned]

ghstack-source-id: e5abe64 Pull Request resolved: #46046

dr-ci · 2020-10-08T22:38:50Z

💊 CI failures summary and remediations

As of commit b844a54 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 67 times.

This is to solve the non-dense tensor layout problem in #45505 This change will cover all kinds of non-dense tensor. [ghstack-poisoned]

ghstack-source-id: 4bb98c0 Pull Request resolved: #46046

This is to solve the non-dense tensor layout problem in #45505 TODO: Add test cases Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

ghstack-source-id: 4c16476 Pull Request resolved: #46046

This is to solve the non-dense tensor layout problem in #45505 TODO: Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

ghstack-source-id: 8b54390 Pull Request resolved: #46046

This is to solve the non-dense tensor layout problem in #45505 TODO: Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

ghstack-source-id: 65666e1 Pull Request resolved: #46046

…*_like functions" *_like functions are used in pytorch to create a new tensor with the same shape of the input tensor. But we don’t always preserve the layout permutation of the tensor. Current behavior is that, for a dense and non-overlapping tensor, its layout permutation is preserved. For eg. passing a channel last contiguous tensor t with ‘shape/stride’ (2, 4, 3, 2)/(24, 1, 8, 4) to empty_like(t) function will create a new tensor with exactly the same ‘shape/stride’ as the input tensor t. However, if the input tensor is non-dense or has overlap, we simply create a contiguous tensor based on input tensor’s shape, so the tensor layout permutation is lost. This PR preserves the layout permutation for non-dense or overlapping tensor. The strides propagation rule that used in this PR is exactly the same as what is being used in TensorIterator. The behavior changes are listed below: | code | old | new | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|------------------------------------------------------| | #strided tensors a=torch.randn(2,3,8)[:,:,::2].permute(2,0,1) print(a.stride()) print(a.exp().stride()) print((a+a).stride()) out = torch.empty(0) torch.add(a,a,out=out) print(out.stride()) | (2, 24, 8) (6, 3, 1) (1, 12, 4) (6, 3, 1) | (2, 24, 8) (1, 12, 4) (1, 12, 4) (1, 12, 4) | | #memory dense tensors a=torch.randn(3,1,1).as_strided((3,1,1), (1,3,3)) print(a.stride(), (a+torch.randn(1)).stride()) a=torch.randn(2,3,4).permute(2,0,1) print(a.stride()) print(a.exp().stride()) print((a+a).stride()) out = torch.empty(0) torch.add(a,a,out=out) print(out.stride()) | (1, 3, 3) (1, 1, 1) (1, 12, 4) (6, 3, 1) (1, 12, 4) (6, 3, 1) | (1, 3, 3) (1, 3, 3) (1, 12, 4) (1, 12, 4) (1, 12, 4) (1, 12, 4) | This is to solve the non-dense tensor layout problem in #45505 TODO: - [x] Fix all the BC broken test cases in pytorch - [ ] Investigate if any fb internal tests are broken This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

…*_like functions" *_like functions are used in pytorch to create a new tensor with the same shape of the input tensor. But we don’t always preserve the layout permutation of the tensor. Current behavior is that, for a dense and non-overlapping tensor, its layout permutation is preserved. For eg. passing a channel last contiguous tensor t with ‘shape/stride’ (2, 4, 3, 2)/(24, 1, 8, 4) to empty_like(t) function will create a new tensor with exactly the same ‘shape/stride’ as the input tensor t. However, if the input tensor is non-dense or has overlap, we simply create a contiguous tensor based on input tensor’s shape, so the tensor layout permutation is lost. This PR preserves the layout permutation for non-dense or overlapping tensor. The strides propagation rule that used in this PR is exactly the same as what is being used in TensorIterator. The behavior changes are listed below: | code | old | new | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|------------------------------------------------------| | #strided tensors a=torch.randn(2,3,8)[:,:,::2].permute(2,0,1) print(a.stride()) print(a.exp().stride()) print((a+a).stride()) out = torch.empty(0) torch.add(a,a,out=out) print(out.stride()) | (2, 24, 8) (6, 3, 1) (1, 12, 4) (6, 3, 1) | (2, 24, 8) (1, 12, 4) (1, 12, 4) (1, 12, 4) | | #memory dense tensors a=torch.randn(3,1,1).as_strided((3,1,1), (1,3,3)) print(a.stride(), (a+torch.randn(1)).stride()) a=torch.randn(2,3,4).permute(2,0,1) print(a.stride()) print(a.exp().stride()) print((a+a).stride()) out = torch.empty(0) torch.add(a,a,out=out) print(out.stride()) | (1, 3, 3) (1, 1, 1) (1, 12, 4) (6, 3, 1) (1, 12, 4) (6, 3, 1) | (1, 3, 3) (1, 3, 3) (1, 12, 4) (1, 12, 4) (1, 12, 4) (1, 12, 4) | This is to solve the non-dense tensor layout problem in #45505 TODO: - [x] Fix all the BC broken test cases in pytorch - [ ] Investigate if any fb internal tests are broken This change will cover all kinds of non-dense tensors. Differential Revision: [D24288970](https://our.internmc.facebook.com/intern/diff/D24288970) [ghstack-poisoned]

ghstack-source-id: b61fb32 Pull Request resolved: #46046

…*_like functions" *_like functions are used in pytorch to create a new tensor with the same shape of the input tensor. But we don’t always preserve the layout permutation of the tensor. Current behavior is that, for a dense and non-overlapping tensor, its layout permutation is preserved. For eg. passing a channel last contiguous tensor t with ‘shape/stride’ (2, 4, 3, 2)/(24, 1, 8, 4) to empty_like(t) function will create a new tensor with exactly the same ‘shape/stride’ as the input tensor t. However, if the input tensor is non-dense or has overlap, we simply create a contiguous tensor based on input tensor’s shape, so the tensor layout permutation is lost. This PR preserves the layout permutation for non-dense or overlapping tensor. The strides propagation rule that used in this PR is exactly the same as what is being used in TensorIterator. The test is based on empty_like. We compare the output tensor's sizes/strides among the empty_like old code, empty_like new code, and TensorIterator based operator exp(). For each sizes/strides we create tensor from both empty_like() and exp() using the following code and record the output tensor's sizes/stride. ``` a = torch.empty_strided([sizes], [strides]) b = torch.empty_like(a) c = torch.exp(a) ``` The output strides changes are listed below: |sizes/strides |old |new |unary op |description | |--------------------|---------------------|--------------------|--------------------|------------------------------------| |(4,3,2)/(10,1,2) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense but overlap | |(4,3,2)/(10,0,3) |(4,3,2)/(6,2,1) |(4,3,2)/(6,2,1) |(4,3,2)/(6,2,1) |non-dense but overlap (0 strided) | |(4,3,2)/(12,2,6) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense non-overlap (sliced) | |(4,3,2)/(10,1,3) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense non-overlap (with gap) | |(4,1,1,2)/(10,0,0,2)|(4,1,1,2)/(2,2,2,1) |(4,1,1,2)/(2,2,2,1) |(4,1,1,2)/(2,2,2,1) |non-dense non-overlap (0 strided) | |(4,1,2)/(10,3,3) |(4,1,2)/(2,2,1) |(4,1,2)/(2,1,1) |(4,1,2)/(2,1,1) |overlap (with equal strides) | |(4,2,2,2)/(2,0,0,6) |(4,2,2,2)/(8,4,2,1) |(4,2,2,2)/(1,8,4,16)|(4,2,2,2)/(1,8,4,16)|overlap (0 strided with equal size) | This is to solve the non-dense tensor layout problem in #45505 TODO: - [x] Fix all the BC broken test cases in pytorch - [ ] Investigate if any fb internal tests are broken This change will cover all kinds of non-dense tensors. Differential Revision: [D24288970](https://our.internmc.facebook.com/intern/diff/D24288970) [ghstack-poisoned]

…e functions" *_like functions are used in pytorch to create a new tensor with the same shape of the input tensor. But we don’t always preserve the layout permutation of the tensor. Current behavior is that, for a dense and non-overlapping tensor, its layout permutation is preserved. For eg. passing a channel last contiguous tensor t with ‘shape/stride’ (2, 4, 3, 2)/(24, 1, 8, 4) to empty_like(t) function will create a new tensor with exactly the same ‘shape/stride’ as the input tensor t. However, if the input tensor is non-dense or has overlap, we simply create a contiguous tensor based on input tensor’s shape, so the tensor layout permutation is lost. This PR preserves the layout permutation for non-dense or overlapping tensor. The strides propagation rule that used in this PR is exactly the same as what is being used in TensorIterator. The test is based on empty_like. We compare the output tensor's sizes/strides among the empty_like old code, empty_like new code, and TensorIterator based operator exp(). For each sizes/strides we create tensor from both empty_like() and exp() using the following code and record the output tensor's sizes/stride. ``` a = torch.empty_strided([sizes], [strides]) b = torch.empty_like(a) c = torch.exp(a) // record the result of b (in both old and new code) and c to create the list below ``` The output strides changes are listed below: |sizes/strides |old |new |unary op |description | |--------------------|---------------------|--------------------|--------------------|------------------------------------| |(4,3,2)/(10,1,2) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense but overlap | |(4,3,2)/(10,0,3) |(4,3,2)/(6,2,1) |(4,3,2)/(6,2,1) |(4,3,2)/(6,2,1) |non-dense but overlap (0 strided) | |(4,3,2)/(12,2,6) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense non-overlap (sliced) | |(4,3,2)/(10,1,3) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense non-overlap (with gap) | |(4,1,1,2)/(10,0,0,2)|(4,1,1,2)/(2,2,2,1) |(4,1,1,2)/(2,2,2,1) |(4,1,1,2)/(2,2,2,1) |non-dense non-overlap (0 strided) | |(4,1,2)/(10,3,3) |(4,1,2)/(2,2,1) |(4,1,2)/(2,1,1) |(4,1,2)/(2,1,1) |overlap (with equal strides) | |(4,2,2,2)/(2,0,0,6) |(4,2,2,2)/(8,4,2,1) |(4,2,2,2)/(1,8,4,16)|(4,2,2,2)/(1,8,4,16)|overlap (0 strided with equal size) | This is to solve the non-dense tensor layout problem in #45505 TODO: - [x] Fix all the BC broken test cases in pytorch - [x] Investigate if any fb internal tests are broken (Update: None so far) This change will cover all kinds of non-dense tensors. Differential Revision: [D24288970](https://our.internmc.facebook.com/intern/diff/D24288970) [ghstack-poisoned]

ghstack-source-id: c4eb551 Pull Request resolved: #46046

ngimel

Looks good, thank you! Nice that all *like functions are handled by changing just empty_like.

…e functions" *_like functions are used in pytorch to create a new tensor with the same shape of the input tensor. But we don’t always preserve the layout permutation of the tensor. Current behavior is that, for a dense and non-overlapping tensor, its layout permutation is preserved. For eg. passing a channel last contiguous tensor t with ‘shape/stride’ (2, 4, 3, 2)/(24, 1, 8, 4) to empty_like(t) function will create a new tensor with exactly the same ‘shape/stride’ as the input tensor t. However, if the input tensor is non-dense or has overlap, we simply create a contiguous tensor based on input tensor’s shape, so the tensor layout permutation is lost. This PR preserves the layout permutation for non-dense or overlapping tensor. The strides propagation rule that used in this PR is exactly the same as what is being used in TensorIterator. The test is based on empty_like. We compare the output tensor's sizes/strides among the empty_like old code, empty_like new code, and TensorIterator based operator exp(). For each sizes/strides we create tensor from both empty_like() and exp() using the following code and record the output tensor's sizes/stride. ``` a = torch.empty_strided([sizes], [strides]) b = torch.empty_like(a) c = torch.exp(a) // record the result of b (in both old and new code) and c to create the list below ``` The output strides changes are listed below: |sizes/strides |old |new |unary op |description | |--------------------|---------------------|--------------------|--------------------|------------------------------------| |(4,3,2)/(10,1,2) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense but overlap | |(4,3,2)/(10,0,3) |(4,3,2)/(6,2,1) |(4,3,2)/(6,2,1) |(4,3,2)/(6,2,1) |non-dense but overlap (0 strided) | |(4,3,2)/(12,2,6) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense non-overlap (sliced) | |(4,3,2)/(10,1,3) |(4,3,2)/(6,2,1) |(4,3,2)/(6,1,3) |(4,3,2)/(6,1,3) |non-dense non-overlap (with gap) | |(4,1,1,2)/(10,0,0,2)|(4,1,1,2)/(2,2,2,1) |(4,1,1,2)/(2,2,2,1) |(4,1,1,2)/(2,2,2,1) |non-dense non-overlap (0 strided) | |(4,1,2)/(10,3,3) |(4,1,2)/(2,2,1) |(4,1,2)/(2,1,1) |(4,1,2)/(2,1,1) |overlap (with equal strides) | |(4,2,2,2)/(2,0,0,6) |(4,2,2,2)/(8,4,2,1) |(4,2,2,2)/(1,8,4,16)|(4,2,2,2)/(1,8,4,16)|overlap (0 strided with equal size) | This is to solve the non-dense tensor layout problem in #45505 TODO: - [x] Fix all the BC broken test cases in pytorch - [x] Investigate if any fb internal tests are broken (Update: None so far) This change will cover all kinds of non-dense tensors. Differential Revision: [D24288970](https://our.internmc.facebook.com/intern/diff/D24288970) [ghstack-poisoned]

ghstack-source-id: 9008dc0 Pull Request resolved: #46046

facebook-github-bot · 2020-10-21T04:11:58Z

@glaringlee merged this pull request in a651b87.

facebook-github-bot · 2020-10-21T04:12:16Z

@glaringlee merged this pull request in a651b87.

preserve non-dense tensor's layout in *_like functions

aba989c

[ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Oct 8, 2020

preserve non-dense tensor's layout in *_like functions

37b652c

ghstack-source-id: e5abe64 Pull Request resolved: #46046

glaringlee changed the title ~~preserve non-dense tensor's layout in *_like functions~~ [WIP]preserve non-dense tensor's layout in *_like functions Oct 8, 2020

Update on "[WIP]preserve non-dense tensor's layout in *_like functions"

55869ba

This is to solve the non-dense tensor layout problem in #45505 This change will cover all kinds of non-dense tensor. [ghstack-poisoned]

Update on "[WIP]preserve non-dense tensor's layout in *_like functions"

558be34

This is to solve the non-dense tensor layout problem in #45505 This change will cover all kinds of non-dense tensor. [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Oct 9, 2020

preserve non-dense tensor's layout in *_like functions

a91255e

ghstack-source-id: 4bb98c0 Pull Request resolved: #46046

Update on "[WIP]preserve non-dense tensor's layout in *_like functions"

b9af6fb

This is to solve the non-dense tensor layout problem in #45505 TODO: Add test cases Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Oct 10, 2020

preserve non-dense tensor's layout in *_like functions

11eaeb4

ghstack-source-id: 4c16476 Pull Request resolved: #46046

lixinyu added 2 commits October 12, 2020 07:51

Update on "[WIP]preserve non-dense tensor's layout in *_like functions"

1ad79eb

This is to solve the non-dense tensor layout problem in #45505 TODO: Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

Update on "[WIP]preserve non-dense tensor's layout in *_like functions"

3240805

This is to solve the non-dense tensor layout problem in #45505 TODO: Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Oct 12, 2020

preserve non-dense tensor's layout in *_like functions

23045af

ghstack-source-id: 8b54390 Pull Request resolved: #46046

Update on "[WIP]preserve non-dense tensor's layout in *_like functions"

ed10e33

This is to solve the non-dense tensor layout problem in #45505 TODO: Fix all the BC broken test cases This change will cover all kinds of non-dense tensors. [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Oct 13, 2020

preserve non-dense tensor's layout in *_like functions

1bd65d8

ghstack-source-id: 65666e1 Pull Request resolved: #46046

glaringlee changed the title ~~[WIP]preserve non-dense tensor's layout in *_like functions~~ [WIP]preserve non-dense or overlapping tensor's layout in *_like functions Oct 13, 2020

glaringlee pushed a commit that referenced this pull request Oct 14, 2020

preserve non-dense tensor's layout in *_like functions

a449cc1

ghstack-source-id: b61fb32 Pull Request resolved: #46046

glaringlee changed the title ~~[WIP]preserve non-dense or overlapping tensor's layout in *_like functions~~ preserve non-dense or overlapping tensor's layout in *_like functions Oct 14, 2020

glaringlee requested review from VitalyFedyunin and ngimel October 14, 2020 13:53

glaringlee pushed a commit that referenced this pull request Oct 14, 2020

preserve non-dense tensor's layout in *_like functions

2df61a3

ghstack-source-id: c4eb551 Pull Request resolved: #46046

ngimel approved these changes Oct 19, 2020

View reviewed changes

glaringlee pushed a commit that referenced this pull request Oct 20, 2020

preserve non-dense tensor's layout in *_like functions

c9b2f4e

ghstack-source-id: 9008dc0 Pull Request resolved: #46046

facebook-github-bot closed this in a651b87 Oct 21, 2020

facebook-github-bot added the Merged label Oct 21, 2020

facebook-github-bot deleted the gh/glaringlee/31/head branch October 24, 2020 14:15

albanD added the module: bc-breaking Related to a BC-breaking change label Mar 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

preserve non-dense or overlapping tensor's layout in *_like functions #46046

preserve non-dense or overlapping tensor's layout in *_like functions #46046

Uh oh!

glaringlee commented Oct 8, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Oct 8, 2020 •

edited

Loading

Uh oh!

ngimel left a comment

Uh oh!

facebook-github-bot commented Oct 21, 2020

Uh oh!

facebook-github-bot commented Oct 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

preserve non-dense or overlapping tensor's layout in *_like functions #46046

preserve non-dense or overlapping tensor's layout in *_like functions #46046

Uh oh!

Conversation

glaringlee commented Oct 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Oct 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

ngimel left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Oct 21, 2020

Uh oh!

facebook-github-bot commented Oct 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

glaringlee commented Oct 8, 2020 •

edited

Loading

dr-ci bot commented Oct 8, 2020 •

edited

Loading