[BE] unsupported backward failing on single sample #59455

walterddr · 2021-06-04T16:31:20Z

similar to test_unsupported_dtype which only check exception raised on the first sample. we should do similar things for unsupported_backward as well. The goal for both test is to remind developer to

add a new dtype to the support list if they are fulling runnable without failure (over all samples)
replace the skip mechanism which will indefinitely ignore tests without warning

Test Plan
CI.

facebook-github-bot · 2021-06-04T16:31:26Z

💊 CI failures summary and remediations

As of commit 53dcda9 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-scanned failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm4.2-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

walterddr · 2021-06-04T23:32:05Z

@mruberry this is another idea to enable unsupported backward test. please kindly take a look. I think this is a better approach and should be easier to get merge

mruberry · 2021-06-05T22:49:50Z

test/test_ops.py

This will probably require a slight tweak to handle functions like to_sparse() which return a sparse tensor, but it should be a simple update after #59445 is in

#59445 is merged and I've rebased over it. since it doesn't introduce additional failure I am planning to merge this and then fix anything needed to avoid further merge conflicts.

This doesn't test to_sparse, x.to_sparse().sum().backward() raises runtime error on .sum() and thus passes this test, but it's not what's supposed to be tested.

Good point. Will create a follow up issue along with the rest of the tweaks needed.

mruberry · 2021-06-05T22:51:55Z

torch/testing/_internal/common_methods_invocations.py

This is a surprising change

yup. surprising to me too

mruberry · 2021-06-05T22:54:48Z

This is a great test and the idea seems correct. For the backward, I think this should call rand_like instead of taking the sum to also support sparse outputs (since to_sparse will be an opinfo soon).

There are also a few test issues that need to be addressed still, like test_unsupported_backward_einsum_cuda_bfloat16 and some ROCm tests.

Testing against the "master" builds, too, is definitely the right idea. I erroneously added "all" -- my mistake

fixing tests and adding skips fix test issues

codecov · 2021-06-06T23:01:30Z

Codecov Report

Merging #59455 (53dcda9) into master (390fe74) will increase coverage by 0.00%.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #59455   +/-   ##
=======================================
  Coverage   76.43%   76.43%           
=======================================
  Files        2038     2038           
  Lines      203064   203064           
=======================================
+ Hits       155217   155218    +1     
+ Misses      47847    47846    -1

walterddr

rebased and looks like it is working well. (rocm failure seem irrelevant)
will try to merged it and monitor HUD. if any failure occurs i guess the best option is to forward fix by skipping tests

walterddr · 2021-06-07T05:33:45Z

test/test_ops.py

#59445 is merged and I've rebased over it. since it doesn't introduce additional failure I am planning to merge this and then fix anything needed to avoid further merge conflicts.

walterddr · 2021-06-07T05:34:01Z

torch/testing/_internal/common_methods_invocations.py

yup. surprising to me too

facebook-github-bot · 2021-06-07T05:35:31Z

@walterddr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mruberry

Cool! Let's try to sneak this in

facebook-github-bot · 2021-06-09T15:18:29Z

@walterddr merged this pull request in 26beda8.

Summary: Echo on pytorch#58260 (comment) similar to `test_unsupported_dtype` which only check exception raised on the first sample. we should do similar things for unsupported_backward as well. The goal for both test is to remind developer to 1. add a new dtype to the support list if they are fulling runnable without failure (over all samples) 2. replace the skip mechanism which will indefinitely ignore tests without warning Pull Request resolved: pytorch#59455 Test Plan: CI. Reviewed By: mruberry Differential Revision: D28927169 Pulled By: walterddr fbshipit-source-id: 2993649fc17a925fa331e27c8ccdd9b24dd22c20

facebook-github-bot added the cla signed label Jun 4, 2021

walterddr force-pushed the be_unsupported_backward_single_test branch from 88d5212 to 51f4595 Compare June 4, 2021 17:10

walterddr added the ci/master label Jun 4, 2021

walterddr marked this pull request as ready for review June 4, 2021 23:30

walterddr requested a review from mruberry June 4, 2021 23:30

mruberry reviewed Jun 5, 2021

View reviewed changes

mruberry added the ci/all label Jun 5, 2021

mruberry removed the ci/all label Jun 5, 2021

[initial] commit for all tests

a75979b

fixing tests and adding skips fix test issues

walterddr force-pushed the be_unsupported_backward_single_test branch from c20f66d to beaee2b Compare June 6, 2021 17:12

address CI failures

53dcda9

walterddr force-pushed the be_unsupported_backward_single_test branch from beaee2b to 53dcda9 Compare June 6, 2021 18:49

walterddr commented Jun 7, 2021

View reviewed changes

mruberry approved these changes Jun 9, 2021

View reviewed changes

facebook-github-bot closed this in 26beda8 Jun 9, 2021

facebook-github-bot added the Merged label Jun 9, 2021

walterddr mentioned this pull request Jun 9, 2021

[BE] add unsupported backward test #58260

Closed

walterddr mentioned this pull request Jun 14, 2021

Improvement to unsupported backward OpInfo test #59927

Closed

3 tasks

[BE] unsupported backward failing on single sample #59455

[BE] unsupported backward failing on single sample #59455

Uh oh!

Conversation

walterddr commented Jun 4, 2021

Uh oh!

facebook-github-bot commented Jun 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

walterddr commented Jun 4, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mruberry commented Jun 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 6, 2021

Codecov Report

Uh oh!

walterddr left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 7, 2021

Uh oh!

mruberry left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

facebook-github-bot commented Jun 4, 2021 •

edited

Loading

mruberry commented Jun 5, 2021 •

edited

Loading

walterddr left a comment •

edited

Loading