Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement cuda kernels for binary ops #334

Closed
8 tasks done
Tracked by #278
coreylowman opened this issue Jan 7, 2023 · 2 comments
Closed
8 tasks done
Tracked by #278

Implement cuda kernels for binary ops #334

coreylowman opened this issue Jan 7, 2023 · 2 comments
Labels
gpu Related to GPU support

Comments

@coreylowman
Copy link
Owner

coreylowman commented Jan 7, 2023

These can copy the setup for div cuda kernel, which is already implemented.

  • add
  • bce
  • div
  • huber_error
  • maximum
  • minimum
  • mul
  • sub
@coreylowman coreylowman added the gpu Related to GPU support label Jan 7, 2023
@coreylowman coreylowman changed the title Implement add/sub/mul cuda kernels Implement add/sub/mul/minimum/maximum cuda kernels Jan 7, 2023
@coreylowman coreylowman changed the title Implement add/sub/mul/minimum/maximum cuda kernels Implement cuda kernels for binary ops Jan 7, 2023
@coreylowman coreylowman mentioned this issue Jan 7, 2023
47 tasks
@M1ngXU
Copy link
Contributor

M1ngXU commented Jan 9, 2023

add, min, max, mul exist in cudnn

coreylowman pushed a commit that referenced this issue Jan 10, 2023
… and #334 (#346)

* Add cuda implementations for unary and binary tensor operations

* Add cuda kernel for powi; Use fewer 64-bit functions

* use copysign in abs kernal, as suggested in #341 (comment)
@coreylowman
Copy link
Owner Author

Thanks @nkoppel

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gpu Related to GPU support
Projects
None yet
Development

No branches or pull requests

2 participants