Add check for 0 to 1 inclusive for elements of target tensor in BCE loss #97814

kiersten-stokes · 2023-03-28T18:56:09Z

Fixes #87373

cc @albanD @mruberry @jbschlosser @walterddr @mikaylagawarecki

pytorch-bot · 2023-03-28T18:56:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/97814

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

MacOS M1 testing is backlogged

✅ No Failures

As of commit aebb9ac:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

bdhirsh

Thanks for the fix!

bdhirsh

actually, I see several nn tests failing - it looks like some of our tests pass > 1 target values through binary_cross_entropy. cc @mikaylagawarecki ? (let me know if I should cc someone else).

kiersten-stokes · 2023-03-31T14:42:43Z

actually, I see several nn tests failing - it looks like some of our tests pass > 1 target values through binary_cross_entropy. cc @mikaylagawarecki ? (let me know if I should cc someone else).

@bdhirsh thanks for the review! I was also wondering about those tests but haven't had time to circle back (and wasn't sure who to ping 🙂 ). Looking forward to hearing thoughts on how/if we want to handle those!

mikaylagawarecki

Hey @kiersten-stokes, thanks for sending this fix!

The tests failures arise because of tests where target was erroneously not constrained in [0, 1]. For most of the target outputs, we ensure this using torch.randn(...).gt(0).double() but this was missed for these two tests here and here). Fixing these should resolve the errors.

Additionally, we should also add the check on CUDA

kiersten-stokes · 2023-04-03T17:29:33Z

The tests failures arise because of tests where target was erroneously not constrained in [0, 1]. For most of the target outputs, we ensure this using torch.randn(...).gt(0).double() but this was missed for these two tests here and here). Fixing these should resolve the errors.

@mikaylagawarecki I really appreciate you pointing me to the relevant tests! It seems embarrassingly simple now that I see it - I guess I'm still learning how to comb my way through the logs efficiently 🙈

Additionally, we should also add the check on CUDA

Done! Will stand by for CI failures (apologies, I have no way to run CUDA-based tests at the moment) and circle back should there be any failures similar to the above!

mikaylagawarecki · 2023-04-03T19:03:55Z

@kiersten-stokes Of course, happy to help!

Looking at the logs, it looks like I might have missed another one where target is not correctly constrained: https://github.com/pytorch/pytorch/blob/master/test/test_nn.py#L8923

It's hard to tell if there are more because CI jobs terminate when a test fail :( But you could try running tests on CPU, python test/test_nn.py, you can also try to filter tests by substring with the -k option e.g. python test/test_nn.py -k cross_entropy but depending on the test naming, that might not be a catch-all.

Let me know if you have any questions about local testing!

mikaylagawarecki

Thanks!

mikaylagawarecki · 2023-04-04T21:30:41Z

@pytorchbot merge

pytorchmergebot · 2023-04-04T21:32:49Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

kiersten-stokes · 2023-04-04T21:42:02Z

Thanks!

@mikaylagawarecki thanks again for your help!

pytorchmergebot · 2023-04-04T23:49:23Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-focal-rocm5.4.2-py3.8 / test (default, 1, 3, linux.rocm.gpu)

Details for Dev Infra team

Raised by workflow job

mikaylagawarecki · 2023-04-05T21:27:40Z

@pytorchbot merge

pytorchmergebot · 2023-04-05T21:29:34Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

@mikaylagawarecki

…oss (#97814) TODO for @mikaylagawarecki : add BC breaking description Fixes #87373 Pull Request resolved: #97814 Approved by: https://github.com/mikaylagawarecki

Add check for [0,1] for target tensor in BCE loss

a65022e

pytorchbot added the open source label Mar 28, 2023

bdhirsh approved these changes Mar 31, 2023

View reviewed changes

bdhirsh requested changes Mar 31, 2023

View reviewed changes

bdhirsh requested a review from mikaylagawarecki March 31, 2023 14:12

mikaylagawarecki added the topic: bc breaking topic category label Mar 31, 2023

mikaylagawarecki reviewed Mar 31, 2023

View reviewed changes

mikaylagawarecki added module: nn Related to torch.nn triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Mar 31, 2023

mikaylagawarecki self-assigned this Mar 31, 2023

kiersten-stokes added 2 commits April 3, 2023 12:05

Add 0-1 check for target vals on CUDA

7d42f92

Adjust tests to ensure target values between 0 and 1 incl

4dcea32

mikaylagawarecki approved these changes Apr 4, 2023

View reviewed changes

mikaylagawarecki added the release notes: nn release notes category label Apr 4, 2023

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 4, 2023

Fix remaining NN BCE test failures due to target elements

a5e64ba

mikaylagawarecki force-pushed the add-0-1-check branch from 50f589c to a5e64ba Compare April 5, 2023 18:28

mikaylagawarecki requested review from mruberry and ngimel as code owners April 5, 2023 18:28

mikaylagawarecki removed request for ngimel and mruberry April 5, 2023 18:31

mikaylagawarecki mentioned this pull request Apr 5, 2023

ROCm opinfo binary_cross_entropy gradient test fails #98431

Closed

Add issue

aebb9ac

pytorchmergebot added the Merged label Apr 5, 2023

pytorchmergebot closed this in 2a48f43 Apr 5, 2023

jbschlosser mentioned this pull request Apr 14, 2023

torch.nn.functional.binary_cross_entropy and torch.nn.functional.binary_cross_entropy_with_logits documentation wrong on description about target #99151

Closed

mikaylagawarecki removed the topic: bc breaking topic category label Sep 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add check for 0 to 1 inclusive for elements of target tensor in BCE loss #97814

Add check for 0 to 1 inclusive for elements of target tensor in BCE loss #97814

kiersten-stokes commented Mar 28, 2023 •

edited by mikaylagawarecki

pytorch-bot bot commented Mar 28, 2023 •

edited

bdhirsh left a comment

bdhirsh left a comment

kiersten-stokes commented Mar 31, 2023

mikaylagawarecki left a comment •

edited

kiersten-stokes commented Apr 3, 2023

mikaylagawarecki commented Apr 3, 2023

mikaylagawarecki left a comment

mikaylagawarecki commented Apr 4, 2023

pytorchmergebot commented Apr 4, 2023

kiersten-stokes commented Apr 4, 2023

pytorchmergebot commented Apr 4, 2023

mikaylagawarecki commented Apr 5, 2023

pytorchmergebot commented Apr 5, 2023

Add check for 0 to 1 inclusive for elements of target tensor in BCE loss #97814

Add check for 0 to 1 inclusive for elements of target tensor in BCE loss #97814

Conversation

kiersten-stokes commented Mar 28, 2023 • edited by mikaylagawarecki

pytorch-bot bot commented Mar 28, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/97814

❗ 1 Active SEVs

✅ No Failures

bdhirsh left a comment

Choose a reason for hiding this comment

bdhirsh left a comment

Choose a reason for hiding this comment

kiersten-stokes commented Mar 31, 2023

mikaylagawarecki left a comment • edited

Choose a reason for hiding this comment

kiersten-stokes commented Apr 3, 2023

mikaylagawarecki commented Apr 3, 2023

mikaylagawarecki left a comment

Choose a reason for hiding this comment

mikaylagawarecki commented Apr 4, 2023

pytorchmergebot commented Apr 4, 2023

Merge started

kiersten-stokes commented Apr 4, 2023

pytorchmergebot commented Apr 4, 2023

Merge failed

mikaylagawarecki commented Apr 5, 2023

pytorchmergebot commented Apr 5, 2023

Merge started

kiersten-stokes commented Mar 28, 2023 •

edited by mikaylagawarecki

pytorch-bot bot commented Mar 28, 2023 •

edited

mikaylagawarecki left a comment •

edited