[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen #33792

glaringlee · 2020-02-25T23:36:03Z

Stack from ghstack:

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen #33792 [pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

Differential Revision: D20107158

[ghstack-poisoned]

ghstack-source-id: f239351 Pull Request resolved: #33792

dr-ci · 2020-02-26T01:02:24Z

💊 CircleCI build failures summary and remediations

As of commit 3229b44 (more details on the Dr. CI page):

✅ None of the build failures appear to be your fault 💚

2/2 tentatively recognized as flaky ❄️
- Click here to rerun these jobs

❄️ 2 tentatively flaky failures

2 failures tentatively classified as flaky but have not launched reruns to confirm:

pytorch_linux_xenial_cuda10_1_cudnn7_py3_multigpu_test (1/2)

Step: "Test" (full log | pattern match details) ❄️

Mar 03 20:35:22 RuntimeError: Error downloading resource!

Mar 03 20:35:22  
Mar 03 20:35:22 During handling of the above exception, another exception occurred: 
Mar 03 20:35:22  
Mar 03 20:35:22 Traceback (most recent call last): 
Mar 03 20:35:22   File "tools/download_mnist.py", line 87, in <module> 
Mar 03 20:35:22     main() 
Mar 03 20:35:22   File "tools/download_mnist.py", line 80, in main 
Mar 03 20:35:22     download(path, url, options.quiet) 
Mar 03 20:35:22   File "tools/download_mnist.py", line 41, in download 
Mar 03 20:35:22     raise RuntimeError('Error downloading resource!') 
Mar 03 20:35:22 RuntimeError: Error downloading resource! 
Mar 03 20:35:22 + cleanup 
Mar 03 20:35:22 + retcode=1 
Mar 03 20:35:22 + set +x 
Mar 03 20:35:22 =================== sccache compilation log =================== 
Mar 03 20:35:22 =========== If your build fails, please take a look at the log above for possible reasons =========== 
Mar 03 20:35:22 Compile requests                 0 
Mar 03 20:35:22 Compile requests executed        0 
Mar 03 20:35:22 Cache hits                       0 
Mar 03 20:35:22 Cache misses                     0 
Mar 03 20:35:22 Cache timeouts                   0

pytorch_linux_xenial_cuda10_1_cudnn7_py3_NO_AVX2_test (2/2)

Step: "Test" (full log | pattern match details) ❄️

Mar 03 21:46:13 RuntimeError: Error downloading resource!

Mar 03 21:46:13  
Mar 03 21:46:13 During handling of the above exception, another exception occurred: 
Mar 03 21:46:13  
Mar 03 21:46:13 Traceback (most recent call last): 
Mar 03 21:46:13   File "tools/download_mnist.py", line 87, in <module> 
Mar 03 21:46:13     main() 
Mar 03 21:46:13   File "tools/download_mnist.py", line 80, in main 
Mar 03 21:46:13     download(path, url, options.quiet) 
Mar 03 21:46:13   File "tools/download_mnist.py", line 41, in download 
Mar 03 21:46:13     raise RuntimeError('Error downloading resource!') 
Mar 03 21:46:13 RuntimeError: Error downloading resource! 
Mar 03 21:46:13 + cleanup 
Mar 03 21:46:13 + retcode=1 
Mar 03 21:46:13 + set +x 
Mar 03 21:46:13 =================== sccache compilation log =================== 
Mar 03 21:46:13 =========== If your build fails, please take a look at the log above for possible reasons =========== 
Mar 03 21:46:13 Compile requests                15 
Mar 03 21:46:13 Compile requests executed        0 
Mar 03 21:46:13 Cache hits                       0 
Mar 03 21:46:13 Cache misses                     0 
Mar 03 21:46:13 Cache timeouts                   0

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

This comment has been revised 13 times.

aten/src/ATen/native/LinearAlgebra.cpp

zou3519 · 2020-02-26T16:10:50Z

aten/src/ATen/function_wrapper.py

        if isinstance(resize, str):
            return "{}.resize_({}.sizes());".format(arg['name'], resize)
        else:
-            resize_scalar = arg.get('resize_scalar', False)


The existence of resize_scalar makes it seem like ger, at some point in the past, supported accepting a 0D tensor. It doesn't accept 0D tensors on master and no one has complained about it, so this change seems fine to me.

If we want to be really safe we can try to figure out if torch.ger accepted a 0D tensor at any time in the past and figure out when torch.ger stopped accepting a 0D tensor, if at all.

Checked a little bit. I think people who made the _th_ger change wanted to make this resize safe.
In legacy code, the size check is inside addr function, but this resize happens before calling addr. _th_ger call addr underline, and addr doesn't allow 0D vec. So ger doesn't support 0D vec anyway

zou3519

The new resize semantics introduced in this PR are bc breaking; we should either:

follow the old behavior
follow the old behavior and issue a deprecation warning.

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

ghstack-source-id: e114c2e Pull Request resolved: #33792

glaringlee · 2020-02-27T00:09:08Z

discussed with @zou3519 , we will keep the old TH resize behavior in this PR. We will open a new PR if we need to deprecated the legacy resize behavior.

gchanan · 2020-02-28T21:08:55Z

aten/src/ATen/native/LinearAlgebra.cpp

+Tensor& ger_out(Tensor &result, const Tensor& self, const Tensor& vec2) {
+  check_1d(self, "self", "ger");
+  check_1d(vec2, "vec2", "ger");
+  if (result.dim() != 2 || result.size(0) != self.size(0) || result.size(1) != vec2.size(0)) {


nit: I think it's clearer if you do something like

if (result.sizes() != {self.size(0), vec2.size(0)}) { result.resize_({...}); }

Will do if there are more dimensions to check. For 2 dimensions, I think this is still fine and save a memory allocation, faster.

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

ghstack-source-id: 036fc7d Pull Request resolved: #33792

glaringlee · 2020-03-02T16:08:46Z

This breaks XLA CI test, added Ailing to give update here once XLA side is ready

ailzhang · 2020-03-02T22:12:16Z

@pytorchbot rebase this please

ailzhang · 2020-03-02T22:14:14Z

ehhh what happened to our pytorchbot? :P
@glaringlee I think this PR should be fine if you rebase on top of master. Would you mind rebasing and see whether XLA test pass?
The current failure is caused by my update of our API between PT and XLA.

aten/src/ATen/native/LinearAlgebra.cpp

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

ghstack-source-id: c222f88 Pull Request resolved: #33792

ailzhang

Thanks!

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

ghstack-source-id: b2498ae Pull Request resolved: #33792

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

ghstack-source-id: fb1deb7 Pull Request resolved: #33792

facebook-github-bot · 2020-03-04T05:17:28Z

@glaringlee merged this pull request in 57c1b80.

…ytorch#33792) Summary: Pull Request resolved: pytorch#33792 Test Plan: Imported from OSS Differential Revision: D20107158 Pulled By: glaringlee fbshipit-source-id: bceddb2d39d3abf36f277daba537677312449c9c

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

99a29c9

[ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Feb 25, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

9228e8a

ghstack-source-id: f239351 Pull Request resolved: #33792

glaringlee requested review from gchanan and zou3519 February 25, 2020 23:38

zou3519 reviewed Feb 26, 2020

View reviewed changes

aten/src/ATen/native/LinearAlgebra.cpp Show resolved Hide resolved

zou3519 reviewed Feb 26, 2020

View reviewed changes

aten/src/ATen/native/LinearAlgebra.cpp Outdated Show resolved Hide resolved

zou3519 reviewed Feb 26, 2020

View reviewed changes

zou3519 requested changes Feb 26, 2020

View reviewed changes

Update on "[pytorch]Migrate _th_ger to Aten and kill resize_scalar in…

333b146

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Feb 26, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

cee3f2f

ghstack-source-id: e114c2e Pull Request resolved: #33792

glaringlee requested a review from zou3519 February 27, 2020 00:05

gchanan reviewed Feb 28, 2020

View reviewed changes

gchanan approved these changes Feb 28, 2020

View reviewed changes

zou3519 approved these changes Feb 28, 2020

View reviewed changes

Update on "[pytorch]Migrate _th_ger to Aten and kill resize_scalar in…

a33b3ee

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Feb 29, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

e1ad9fa

ghstack-source-id: 036fc7d Pull Request resolved: #33792

glaringlee requested a review from ailzhang March 2, 2020 16:08

ailzhang closed this Mar 2, 2020

ailzhang reopened this Mar 2, 2020

ailzhang reviewed Mar 2, 2020

View reviewed changes

aten/src/ATen/native/LinearAlgebra.cpp Outdated Show resolved Hide resolved

Update on "[pytorch]Migrate _th_ger to Aten and kill resize_scalar in…

36ae26c

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

glaringlee requested a review from ailzhang March 2, 2020 23:13

Update on "[pytorch]Migrate _th_ger to Aten and kill resize_scalar in…

1b3b40a

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Mar 3, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

ccd0be8

ghstack-source-id: c222f88 Pull Request resolved: #33792

ailzhang approved these changes Mar 3, 2020

View reviewed changes

Update on "[pytorch]Migrate _th_ger to Aten and kill resize_scalar in…

58f35ee

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Mar 3, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

435d4a7

ghstack-source-id: b2498ae Pull Request resolved: #33792

Update on "[pytorch]Migrate _th_ger to Aten and kill resize_scalar in…

3229b44

… codegen" Differential Revision: [D20107158](https://our.internmc.facebook.com/intern/diff/D20107158) [ghstack-poisoned]

glaringlee pushed a commit that referenced this pull request Mar 3, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen

a1c2290

ghstack-source-id: fb1deb7 Pull Request resolved: #33792

facebook-github-bot closed this in 57c1b80 Mar 4, 2020

facebook-github-bot added the merged label Mar 4, 2020

facebook-github-bot deleted the gh/glaringlee/7/head branch March 7, 2020 15:18

This was referenced May 5, 2020

Migrate ger from the TH to Aten (CUDA) #24570

Closed

Migrate ger from the TH to Aten (CPU) #24706

Closed

mruberry added the Merged label Oct 28, 2020

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen #33792

[pytorch]Migrate _th_ger to Aten and kill resize_scalar in codegen #33792

Uh oh!

Conversation

glaringlee commented Feb 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Feb 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CircleCI build failures summary and remediations

❄️ 2 tentatively flaky failures

pytorch_linux_xenial_cuda10_1_cudnn7_py3_multigpu_test (1/2)

pytorch_linux_xenial_cuda10_1_cudnn7_py3_NO_AVX2_test (2/2)

Uh oh!

Uh oh!

Uh oh!

zou3519 Feb 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

glaringlee Feb 27, 2020

Choose a reason for hiding this comment

Uh oh!

zou3519 left a comment

Choose a reason for hiding this comment

Uh oh!

glaringlee commented Feb 27, 2020

Uh oh!

gchanan Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

glaringlee Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

glaringlee commented Mar 2, 2020

Uh oh!

ailzhang commented Mar 2, 2020

Uh oh!

ailzhang commented Mar 2, 2020

Uh oh!

Uh oh!

ailzhang left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Mar 4, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

glaringlee commented Feb 25, 2020 •

edited

Loading

dr-ci bot commented Feb 26, 2020 •

edited

Loading

zou3519 Feb 26, 2020 •

edited

Loading