Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate storing matrix in registers #53

Closed
fancyIX opened this issue Sep 13, 2018 · 1 comment
Closed

Investigate storing matrix in registers #53

fancyIX opened this issue Sep 13, 2018 · 1 comment
Assignees

Comments

@fancyIX
Copy link
Owner

fancyIX commented Sep 13, 2018

No description provided.

@fancyIX fancyIX self-assigned this Sep 13, 2018
@fancyIX
Copy link
Owner Author

fancyIX commented Sep 13, 2018

There is a trade-off between concurrency and the amount of vpgrs per thread. See https://gpuopen.com/optimizing-gpu-occupancy-resource-usage-large-thread-groups/

fancyIX added a commit that referenced this issue Sep 18, 2018
Currently the state is messy. Not working yet. May not working in the future. But just in case some code is useful.
fancyIX added a commit that referenced this issue Sep 18, 2018
Add new file
fancyIX added a commit that referenced this issue Sep 20, 2018
Seems faster, but logic is not correct yet
fancyIX added a commit that referenced this issue Sep 21, 2018
Speed went up after fixing an error, still not working yet
fancyIX added a commit that referenced this issue Sep 22, 2018
phi2 now working
fancyIX added a commit that referenced this issue Sep 22, 2018
Fix lyra2z
fancyIX added a commit that referenced this issue Sep 22, 2018
minor fix
fancyIX added a commit that referenced this issue Sep 22, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant