Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Gemm<bf16> for CudaBlas. #167

Merged
merged 1 commit into from
Jul 7, 2023

Conversation

LaurentMazare
Copy link
Contributor

(first thanks for this neat crate, it's very helpful)
This PR adds the Gemm trait for bf16 so that cublas can easily be used on this type, the code is very similar to the f16 variant but with the cuda types adapted to bf16.
The gemm f16 test has also been adapted to test this variant too, all this being behind the f16 feature flag.

@coreylowman
Copy link
Owner

Awesome, thanks!

@coreylowman coreylowman merged commit 5434e2b into coreylowman:main Jul 7, 2023
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants