Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add cuBLAS or cuDNN support #1

Closed
coreylowman opened this issue Sep 17, 2022 · 5 comments
Closed

Add cuBLAS or cuDNN support #1

coreylowman opened this issue Sep 17, 2022 · 5 comments

Comments

@coreylowman
Copy link
Owner

coreylowman commented Sep 17, 2022

cuBLAS only has matmuls and vec mat muls, so no convs unless a im2col is also added. cuDNN definitely has convs and a couple other nice things, but unsure how matmuls work upon first look

@coreylowman coreylowman changed the title Add cublas Add cuBLAS or cuDNN support Sep 28, 2022
@M1ngXU
Copy link
Contributor

M1ngXU commented Nov 2, 2022

from what i've seen is that cudnn is great for RNN and CONVs, but i would have to dive deeper into the docs to understand their api

@M1ngXU
Copy link
Contributor

M1ngXU commented Nov 3, 2022

but we could start with cublas, as it is most likely easier

Edit: I'll have a look into this

@M1ngXU
Copy link
Contributor

M1ngXU commented Nov 5, 2022

I checked out cuDNN; there are so many functions! can you maybe give me a list of relevant functions? since cuDNN requires very "special" settings, cuDNN would most likely have to be a seperate device/have a different kind of tensor?

This is my cudnn branch: https://github.com/M1ngXU/cudarc/tree/cudnn

@coreylowman
Copy link
Owner Author

I really like the idea of cuDNN being a different device, that definitely aligns with the device refactor I'm planning for dfdx. Thanks for sharing your code that'll def be useful

@coreylowman
Copy link
Owner Author

cublas support was added. will add another issue for cudnn

coreylowman added a commit that referenced this issue Feb 26, 2023
coreylowman added a commit that referenced this issue Feb 27, 2023
* #1 Adding par_launch_async

* Changing to using CudaStream

* Adding additional details to docstring

* Adding unit tests

* Adding wait_event on new stream creation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants