-
-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add cuBLAS or cuDNN support #1
Comments
from what i've seen is that cudnn is great for RNN and CONVs, but i would have to dive deeper into the docs to understand their api |
but we could start with cublas, as it is most likely easier Edit: I'll have a look into this |
I checked out cuDNN; there are so many functions! can you maybe give me a list of relevant functions? since cuDNN requires very "special" settings, cuDNN would most likely have to be a seperate device/have a different kind of tensor? This is my cudnn branch: https://github.com/M1ngXU/cudarc/tree/cudnn |
I really like the idea of cuDNN being a different device, that definitely aligns with the device refactor I'm planning for dfdx. Thanks for sharing your code that'll def be useful |
cublas support was added. will add another issue for cudnn |
* #1 Adding par_launch_async * Changing to using CudaStream * Adding additional details to docstring * Adding unit tests * Adding wait_event on new stream creation
cuBLAS only has matmuls and vec mat muls, so no convs unless a im2col is also added. cuDNN definitely has convs and a couple other nice things, but unsure how matmuls work upon first look
The text was updated successfully, but these errors were encountered: