[numpy] torch.{all, any} : Extend Dtype Support #44790

kshitij12345 · 2020-09-16T15:55:59Z

Reference #44779

dr-ci · 2020-09-16T16:25:17Z

💊 CI failures summary and remediations

As of commit 1b242ef (more details on the Dr. CI page):

3/3 failures possibly* introduced in this PR
- 1/3 non-CircleCI failure(s)

🕵️ 2 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

binary_linux_libtorch_3_7m_cpu_devtoolset7_shared-with-deps_build (1/2)

Step: "Checkout pytorch/builder repo" (full log | diagnosis details | 🔁 rerun)

fatal: reference is not a tree: cd5a9b73c3028d2496666201588111a8c8d84878

+ git submodule update --init --recursive 
Warning: Permanently added the RSA host key for IP address '140.82.112.3' to the list of known hosts.  
fatal: reference is not a tree: cd5a9b73c3028d2496666201588111a8c8d84878 
Unable to checkout 'cd5a9b73c3028d2496666201588111a8c8d84878' in submodule path 'third_party/nccl/nccl' 
+ sleep 4 
+ git submodule update --init --recursive 
fatal: reference is not a tree: cd5a9b73c3028d2496666201588111a8c8d84878 
Unable to checkout 'cd5a9b73c3028d2496666201588111a8c8d84878' in submodule path 'third_party/nccl/nccl' 
+ sleep 8 
+ git submodule update --init --recursive 
fatal: reference is not a tree: cd5a9b73c3028d2496666201588111a8c8d84878 
Unable to checkout 'cd5a9b73c3028d2496666201588111a8c8d84878' in submodule path 'third_party/nccl/nccl'

pytorch_xla_linux_bionic_py3_6_clang9_test (2/2)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

Nov 10 11:14:45 FAIL [0.175s]: test_all_any_vs_numpy_xla_uint8 (__main__.TestTorchDeviceTypeXLA)

Nov 10 11:14:45     return DeviceTypeTestBase.assertEqual(self, x, y, *args, **kwargs) 
Nov 10 11:14:45   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py", line 1037, in assertEqual 
Nov 10 11:14:45     exact_dtype=exact_dtype, exact_device=exact_device) 
Nov 10 11:14:45   File "/var/lib/jenkins/workspace/xla/test/pytorch_test_base.py", line 552, in assertEqual 
Nov 10 11:14:45     return DeviceTypeTestBase.assertEqual(self, x, y, *args, **kwargs) 
Nov 10 11:14:45   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py", line 1151, in assertEqual 
Nov 10 11:14:45     super().assertEqual(x, y, msg=msg) 
Nov 10 11:14:45 AssertionError: True != 46 
Nov 10 11:14:45  
Nov 10 11:14:45 ====================================================================== 
Nov 10 11:14:45 FAIL [0.175s]: test_all_any_vs_numpy_xla_uint8 (__main__.TestTorchDeviceTypeXLA) 
Nov 10 11:14:45 ---------------------------------------------------------------------- 
Nov 10 11:14:45 Traceback (most recent call last): 
Nov 10 11:14:45   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 272, in instantiated_test 
Nov 10 11:14:45     result = test_fn(self, *args) 
Nov 10 11:14:45   File "/var/lib/jenkins/workspace/xla/test/../../test/test_torch.py", line 19633, in test_all_any_vs_numpy 
Nov 10 11:14:45     _test_all_any(x) 
Nov 10 11:14:45   File "/var/lib/jenkins/workspace/xla/test/../../test/test_torch.py", line 19618, in _test_all_any 
Nov 10 11:14:45     self.compare_with_numpy(torch.all, np.all, x) 
Nov 10 11:14:45   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py", line 913, in compare_with_numpy 
Nov 10 11:14:45     self.assertEqual(np_result, torch_result, **kwargs)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.8-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 29 times.

heitorschueroff

LGTM. Thanks for this PR, it's a very welcomed change.

Note: Now that torch.all and torch.any supports all dtypes, we should document it in the public APIs as mentioned here #44779, but this can be a separate PR.

facebook-github-bot

@heitorschueroff has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

heitorschueroff

It looks like the XLA failures are related to the changes. Could you look into what's causing it please?

kshitij12345 · 2020-10-19T15:53:16Z

@heitorschueroff Thanks for looking at it.

As for XLA, I m not really sure what is happening.
Maybe @JackCaoG can have a look.

Thanks!

JackCaoG · 2020-11-05T18:52:11Z

@kshitij12345 XLA change is ready, I will merge it when this pr is merged.

heitorschueroff · 2020-11-05T19:35:39Z

@JackCaoG Thanks for updating XLA. @kshitij12345 Could you rebase please, I'll merge it then.

kshitij12345 · 2020-11-06T13:48:40Z

@heitorschueroff Have fixed the conflict. ROCm failure looks irrelevant.

facebook-github-bot

@heitorschueroff has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

heitorschueroff · 2020-11-06T13:59:09Z

@heitorschueroff Have fixed the conflict. ROCm failure looks irrelevant.

Thank you for this contribution, I'm important your changes now.

heitorschueroff

It looks like I missed some details in my review. Our internal tests on phabricator are complaining. I left some comments from phabricator, they should be fairly quick to fix and then I can land it without problems.

heitorschueroff · 2020-11-09T21:23:05Z