Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize Platform + Restructure #3

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

AndreSlavescu
Copy link

@AndreSlavescu AndreSlavescu commented Nov 12, 2023

  • Restructure Makefile (automate detection of compute capability)
  • Optimize existing kernels

@jvhs0706 jvhs0706 self-requested a review November 13, 2023 16:57
@jvhs0706 jvhs0706 self-assigned this Nov 13, 2023
@jvhs0706
Copy link
Collaborator

Hi Andre, your optimizations seem useful! Do you want us to merge now, or do you want to do that after you further optimize existing kernels?

@AndreSlavescu
Copy link
Author

Hi Andre, your optimizations seem useful! Do you want us to merge now, or do you want to do that after you further optimize existing kernels?

Hi Haochen,

I will be working on optimizing the kernels as well. For now the changes are for automatically detecting compute capability which eliminates the need for filling that in manually.

I am also trying to discover what CUDA versions work with this setup, because I was having problems with 11.7+

For now, I will just leave this as a draft PR and promptly make the updates to speedup the kernels.

@AndreSlavescu AndreSlavescu marked this pull request as draft November 13, 2023 17:07
@hongyanz
Copy link
Contributor

hongyanz commented Jan 6, 2024

@AndreSlavescu Hi Andre, can you contact me by my email?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants