Enable BFloat16 for `logaddexp` & `logaddexp2` on CUDA #57908

imaginary-person · 2021-05-09T01:44:25Z

Enabled BFloat16 for logaddexp & logaddexp2 on CUDA, with a workaround suggested by @zasdfgbnm.

facebook-github-bot · 2021-05-09T01:44:31Z

💊 CI failures summary and remediations

As of commit 700d612 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

imaginary-person · 2021-05-09T22:56:21Z

Compilation fails on Windows

imaginary-person · 2021-05-10T20:24:39Z

Hey @zasdfgbnm, while enabling BFloat16 for logaddexp & logaddexp2 on CUDA, Windows build fails with the following error -

C:/Users/circleci/project/aten/src/ATen/native/cuda/LogAddExpKernel.cu(13): error: calling a
 __host__ function("isinf< ::c10::BFloat16> ") from 
a __host__ __device__ function("at::native::logaddexp_kernel_cuda(::at::TensorIteratorBase &)::[lambda() (instance 1)]::operator 
()() const::[lambda() (instance 6)]::operator ()() const::[lambda( ::c10::BFloat16,  ::c10::BFloat16) 
(instance 1)]::operator () const") is not allowed

Can you please help fix this? Thank you!

zasdfgbnm · 2021-05-10T20:36:04Z

Let me try something

zasdfgbnm · 2021-05-10T20:50:50Z

I think ::isinf doesn't work for half as well, see: https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/test/cuda_half_test.cu#L70

And you can workaround this by doing

#include <ATen/AccumulateType.h>
using accscalar_t = at::acc_type<scalar_t, /*is_cuda=*/true>;
::isinf(static_cast<accscalar_t>(a));

This workaround is already being used in https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/cuda/IGammaKernel.cu#L403

codecov · 2021-05-11T02:47:07Z

Codecov Report

Merging #57908 (700d612) into master (e8fb167) will increase coverage by 0.00%.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #57908   +/-   ##
=======================================
  Coverage   76.83%   76.83%           
=======================================
  Files        1984     1984           
  Lines      197144   197144           
=======================================
+ Hits       151471   151480    +9     
+ Misses      45673    45664    -9

facebook-github-bot · 2021-05-11T04:07:43Z

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ngimel · 2021-05-11T04:07:51Z

Thanks!

facebook-github-bot · 2021-05-11T16:44:31Z

@ngimel merged this pull request in 067147a.

Summary: Enabled BFloat16 for `logaddexp` & `logaddexp2` on CUDA, with a [workaround](pytorch#57908 (comment)) suggested by zasdfgbnm. Pull Request resolved: pytorch#57908 Reviewed By: mruberry Differential Revision: D28344976 Pulled By: ngimel fbshipit-source-id: edef654b5819b236fbd9996f962115beb6e147e1

imaginary-person added 3 commits May 8, 2021 20:36

Enable logaddexp and logaddexp2 for BFloat16 on CUDA

c86a4dd

Different precision to test BFloat16

33b392f

Add BFloat16 to OpInfos

c2e7824

facebook-github-bot added the cla signed label May 9, 2021

pytorchbot added the open source label May 9, 2021

imaginary-person marked this pull request as ready for review May 10, 2021 19:58

imaginary-person mentioned this pull request May 10, 2021

Rollup: improve bfloat16 cuda support #57707

Closed

imaginary-person added 2 commits May 10, 2021 16:00

Add workaround suggested by @zasdfgbnm

401edc3

Remove extra line

700d612

bdhirsh requested a review from ngimel May 10, 2021 22:02

bdhirsh added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 10, 2021

ngimel approved these changes May 11, 2021

View reviewed changes

facebook-github-bot closed this in 067147a May 11, 2021

facebook-github-bot added the Merged label May 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable BFloat16 for `logaddexp` & `logaddexp2` on CUDA #57908

Enable BFloat16 for `logaddexp` & `logaddexp2` on CUDA #57908

Uh oh!

imaginary-person commented May 9, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented May 9, 2021 •

edited

Loading

Uh oh!

imaginary-person commented May 9, 2021

Uh oh!

imaginary-person commented May 10, 2021

Uh oh!

zasdfgbnm commented May 10, 2021

Uh oh!

zasdfgbnm commented May 10, 2021

Uh oh!

codecov bot commented May 11, 2021

Uh oh!

facebook-github-bot commented May 11, 2021

Uh oh!

ngimel commented May 11, 2021

Uh oh!

facebook-github-bot commented May 11, 2021

Uh oh!

Uh oh!

Enable BFloat16 for logaddexp & logaddexp2 on CUDA #57908

Enable BFloat16 for logaddexp & logaddexp2 on CUDA #57908

Uh oh!

Conversation

imaginary-person commented May 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented May 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

imaginary-person commented May 9, 2021

Uh oh!

imaginary-person commented May 10, 2021

Uh oh!

zasdfgbnm commented May 10, 2021

Uh oh!

zasdfgbnm commented May 10, 2021

Uh oh!

codecov bot commented May 11, 2021

Codecov Report

Uh oh!

facebook-github-bot commented May 11, 2021

Uh oh!

ngimel commented May 11, 2021

Uh oh!

facebook-github-bot commented May 11, 2021

Uh oh!

Uh oh!

Enable BFloat16 for `logaddexp` & `logaddexp2` on CUDA #57908

Enable BFloat16 for `logaddexp` & `logaddexp2` on CUDA #57908

imaginary-person commented May 9, 2021 •

edited

Loading

facebook-github-bot commented May 9, 2021 •

edited

Loading