Fix several test_ops cuda dtypes tests #60922

xwang233 · 2021-06-29T02:20:00Z

Close #60443

facebook-github-bot · 2021-06-29T02:20:07Z

💊 CI failures summary and remediations

As of commit 05fca2e (more details on the Dr. CI page and at hud.pytorch.org/pr/60922):

✅ None of the CI failures appear to be your fault 💚

1/1 broken upstream at merge base 95cada8 on Jul 06 from 7:29am to 1:05pm

🚧 1 fixed upstream failure:

These were probably caused by upstream breakages that were already fixed.

Please rebase on the viable/strict branch (expand for instructions)

If your commit is older than viable/strict, run these commands:

git fetch https://github.com/pytorch/pytorch viable/strict
git rebase FETCH_HEAD

pytorch_linux_xenial_py3_clang5_asan_test2 on Jul 06 from 7:29am to 1:05pm (864dcbb - 635d864)
- 🔁 rerun

Preview docs built from this PR

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

ptrblck · 2021-06-29T06:06:39Z

torch/testing/_internal/common_methods_invocations.py

@@ -6038,7 +6038,8 @@ def gradcheck_wrapper_triangular_input(op, input, *args, upper=False, **kwargs):
           dtypesIfCPU=all_types_and_complex(),
           dtypesIfCUDA=floating_and_complex_types_and(torch.float16, *[torch.bfloat16] if CUDA11OrLater else []),
           dtypesIfROCM=floating_types_and(torch.half, torch.bfloat16),
-           backward_dtypesIfCUDA=floating_and_complex_types_and(torch.float16),
+           backward_dtypesIfCUDA=floating_and_complex_types_and(torch.float16,
+                                                                *[torch.bfloat16] if SM60OrLater else []),


I don't think the sm_60 check would be sufficient, as bfloat16 should have been introduced in CUDA11 (https://docs.nvidia.com/cuda/archive/11.0/cuda-toolkit-release-notes/index.html#cuda-general-new-features), so this could fail on 10 and previous releases. Wouldn't if CUDA11OrLater as done for the forward pass also work here?

It unfortunately failed in my previous commit which was using CUDA11OrLater, see https://app.circleci.com/pipelines/github/pytorch/pytorch/343601/workflows/695dad4b-86d7-49f4-b4c5-e2aa7b1d0bfa/jobs/14465553

We may use something like (CUDA11OrLater and SM60OrLater).

Did you want to try that change, @xwang233?

The skip on line 6079 should be removed once the dtypes are corrected

mruberry · 2021-06-30T09:44:56Z

Added the ci/master label so this tests against more configs. I also suspect this is a combination of CUDA11OrLater + a particular SM.

mruberry · 2021-07-06T11:43:29Z

torch/testing/_internal/common_methods_invocations.py

@@ -6545,7 +6546,8 @@ def gradcheck_wrapper_triangular_input(op, input, *args, upper=False, **kwargs):
           dtypesIfCPU=all_types_and_complex(),
           dtypesIfCUDA=floating_types_and(torch.float16, *[torch.bfloat16] if CUDA11OrLater else [],
                                           torch.complex64, torch.complex128),
-           backward_dtypesIfCUDA=floating_types_and(torch.float16, torch.complex64, torch.complex128),
+           backward_dtypesIfCUDA=floating_types_and(torch.float16, *[torch.bfloat16] if SM60OrLater else [],


The skip on line 6557 needs to be removed

mruberry · 2021-07-06T11:43:45Z

torch/testing/_internal/common_methods_invocations.py

@@ -6904,7 +6906,7 @@ def gradcheck_wrapper_triangular_input(op, input, *args, upper=False, **kwargs):
           op=lambda tensors, equation: torch.einsum(equation, tensors),
           dtypes=all_types_and_complex_and(torch.half, torch.bfloat16),
           dtypesIfCUDA=floating_and_complex_types_and(torch.half, *[torch.bfloat16] if CUDA11OrLater else []),
-           backward_dtypesIfCUDA=floating_and_complex_types_and(torch.half),
+           backward_dtypesIfCUDA=floating_and_complex_types_and(torch.half, *[torch.bfloat16] if SM60OrLater else []),


The skip on line 6915 needs to be removed

xwang233 · 2021-07-09T03:15:54Z

torch/testing/_internal/common_methods_invocations.py

@@ -6069,13 +6069,11 @@ def gradcheck_wrapper_triangular_input(op, input, *args, upper=False, **kwargs):
           dtypesIfCPU=all_types_and_complex(),
           dtypesIfCUDA=floating_and_complex_types_and(torch.float16, *[torch.bfloat16] if CUDA11OrLater else []),
           dtypesIfROCM=floating_types_and(torch.half, torch.bfloat16),
-           backward_dtypesIfCUDA=floating_and_complex_types_and(torch.float16),
+           backward_dtypesIfCUDA=floating_and_complex_types_and(torch.float16,
+                                                                *[torch.bfloat16] if (SM60OrLater and CUDA11OrLater) else []),


ping @mruberry . Seems like the CI is happy with the new flags

mruberry

Thanks @xwang233!

facebook-github-bot · 2021-07-09T05:02:03Z

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-07-09T16:29:52Z

@mruberry merged this pull request in c966ce6.

fix

8561bcc

facebook-github-bot added the cla signed label Jun 29, 2021

pytorchbot added the open source label Jun 29, 2021

xwang233 requested review from mruberry and ptrblck June 29, 2021 02:23

jbschlosser added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 29, 2021

fix

2a07f25

ptrblck reviewed Jun 29, 2021

View reviewed changes

Merge remote-tracking branch 'upstream/master' into cuda_dtypes_fix

8251de7

mruberry added the ci/master label Jun 30, 2021

Merge remote-tracking branch 'upstream/master' into cuda_dtypes_fix

fc3426e

mruberry reviewed Jul 6, 2021

View reviewed changes

xwang233 added 2 commits July 6, 2021 10:13

Merge remote-tracking branch 'upstream/master' into cuda_dtypes_fix

93fd817

SM60OrLater and CUDA11OrLater

05fca2e

xwang233 commented Jul 9, 2021

View reviewed changes

mruberry approved these changes Jul 9, 2021

View reviewed changes

facebook-github-bot closed this in c966ce6 Jul 9, 2021

facebook-github-bot added the Merged label Jul 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix several test_ops cuda dtypes tests #60922

Fix several test_ops cuda dtypes tests #60922

Uh oh!

xwang233 commented Jun 29, 2021

Uh oh!

facebook-github-bot commented Jun 29, 2021 •

edited

Loading

Uh oh!

ptrblck Jun 29, 2021

Uh oh!

xwang233 Jun 29, 2021

Uh oh!

xwang233 Jun 29, 2021

Uh oh!

mruberry Jul 6, 2021

Uh oh!

mruberry Jul 6, 2021

Uh oh!

mruberry commented Jun 30, 2021

Uh oh!

mruberry Jul 6, 2021

Uh oh!

mruberry Jul 6, 2021

Uh oh!

xwang233 Jul 9, 2021

Uh oh!

mruberry left a comment

Uh oh!

facebook-github-bot commented Jul 9, 2021

Uh oh!

facebook-github-bot commented Jul 9, 2021

Uh oh!

Uh oh!

Fix several test_ops cuda dtypes tests #60922

Fix several test_ops cuda dtypes tests #60922

Uh oh!

Conversation

xwang233 commented Jun 29, 2021

Uh oh!

facebook-github-bot commented Jun 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🚧 1 fixed upstream failure:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mruberry commented Jun 30, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mruberry left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 9, 2021

Uh oh!

facebook-github-bot commented Jul 9, 2021

Uh oh!

Uh oh!

facebook-github-bot commented Jun 29, 2021 •

edited

Loading