Support BFloat16 in convolution_backward #7807

swolchok · 2025-01-21T21:13:56Z

Partial fix for #7748.

[ghstack-poisoned]

swolchok · 2025-01-21T21:13:57Z

Stack from ghstack (oldest at bottom):

-> Support BFloat16 in convolution_backward #7807

pytorch-bot · 2025-01-21T21:13:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7807

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 11f4d2d with merge base 466d98f ():

NEW FAILURE - The following job has failed:

pull / test-phi-3-mini-runner-linux / linux-job (gh)
RuntimeError: Command docker exec -t 754646b24fc451bfd4efcbd62d396825679ccb1f7ee702a217eef81b8279adde /exec failed with exit code 2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Partial fix for #7748. ghstack-source-id: f51e6f2 ghstack-comment-id: 2605752234 Pull Request resolved: #7807

manuelcandales · 2025-01-22T17:27:39Z

kernels/test/op_convolution_backward_test.cpp

-  auto expected_grad_weight = tf.make({4, 3, 4, 2}, expected_grad_weight_data);
-  auto expected_grad_bias = tf.make({4}, expected_grad_bias_data);
+    if (DTYPE == ScalarType::Half || DTYPE == ScalarType::BFloat16) {
+      EXPECT_TENSOR_CLOSE_WITH_TOL(grad_input, expected_grad_input, 1e-2, 1e-8);


Why not use defaults here? EXPECT_TENSOR_CLOSE_WITH_TOL should apply the right tolerance given the type

because the default rtol is 1e-5; rtol and atol are different

right, but in the same way that we have kDefaultHalfAtol and kDefaultBFloat16Atol I think we should have kDefaultHalfRtol and kDefaultBFloat16Rtol and set it to a proper value.
You seem to be using 1e-2 for most of these tests. Why not introduced kDefaultHalfRtol and kDefaultBFloat16Rtol with value 1e-2?

Why not introduced kDefaultHalfRtol and kDefaultBFloat16Rtol with value 1e-2?

Because not all operators require the higher rtol.

It is not particularly uncommon to need to set rtol in pytorch core: https://github.com/search?q=repo%3Apytorch%2Fpytorch+%2Frtol%3D%5B1-9%5D%2F&type=code

Partial fix for #7748.

Partial fix for pytorch#7748.

Update

11f4d2d

[ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 21, 2025

swolchok added a commit that referenced this pull request Jan 21, 2025

Support BFloat16 in convolution_backward

3eee295

Partial fix for #7748. ghstack-source-id: f51e6f2 ghstack-comment-id: 2605752234 Pull Request resolved: #7807

swolchok requested review from kirklandsign and manuelcandales January 21, 2025 21:14

swolchok added the release notes: ops & kernels Changes to the opset and any new / changed kernel implementations label Jan 21, 2025

manuelcandales reviewed Jan 22, 2025

View reviewed changes

manuelcandales mentioned this pull request Jan 23, 2025

Support Half/BFloat16 in cdist #7800

Merged

manuelcandales approved these changes Jan 23, 2025

View reviewed changes

swolchok merged commit dabd72f into main Jan 23, 2025
44 of 47 checks passed

swolchok deleted the gh/swolchok/158/head branch January 23, 2025 17:40

YIWENX14 pushed a commit that referenced this pull request Jan 28, 2025

Support BFloat16 in convolution_backward (#7807)

d76ffc1

Partial fix for #7748.

zonglinpeng pushed a commit to zonglinpeng/executorch that referenced this pull request Jan 30, 2025

Support BFloat16 in convolution_backward (pytorch#7807)

8e7b91e

Partial fix for pytorch#7748.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support BFloat16 in convolution_backward #7807

Support BFloat16 in convolution_backward #7807

Uh oh!

swolchok commented Jan 21, 2025

Uh oh!

swolchok commented Jan 21, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 21, 2025 •

edited

Loading

Uh oh!

manuelcandales Jan 22, 2025

Uh oh!

swolchok Jan 22, 2025

Uh oh!

manuelcandales Jan 23, 2025

Uh oh!

swolchok Jan 23, 2025

Uh oh!

swolchok Jan 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Support BFloat16 in convolution_backward #7807

Support BFloat16 in convolution_backward #7807

Uh oh!

Conversation

swolchok commented Jan 21, 2025

Uh oh!

swolchok commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7807

❌ 1 New Failure

Uh oh!

manuelcandales Jan 22, 2025

Choose a reason for hiding this comment

Uh oh!

swolchok Jan 22, 2025

Choose a reason for hiding this comment

Uh oh!

manuelcandales Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

swolchok Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

swolchok Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

swolchok commented Jan 21, 2025 •

edited

Loading

pytorch-bot bot commented Jan 21, 2025 •

edited

Loading