Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA 4.1 FFT 50% Performance Drop #1

Closed
kspaff opened this issue Feb 29, 2012 · 1 comment
Closed

CUDA 4.1 FFT 50% Performance Drop #1

kspaff opened this issue Feb 29, 2012 · 1 comment
Labels

Comments

@kspaff
Copy link
Contributor

kspaff commented Feb 29, 2012

With the new LLVM compiler backend, CUDA FFT performance dropped by 50% on Keeneland. OpenCL performance stayed the same.

I suspect this might be due to loops being unrolled differently (the unroll option that used to go to the old compiler is now ignored).

@kspaff
Copy link
Contributor Author

kspaff commented Apr 13, 2014

Fixed some time ago, FFT can now be built to use cufft

@kspaff kspaff closed this as completed Apr 13, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant