Conversation
Please update the changelog in order to start CI tests. View the gpuCI docs here. |
Hi @mnicely, thanks for the benchmark data. I am a bit confused --- any chance the outcomes for v8 + cache and v8 no cache are swapped? The cached performance seems to be worse if I read it right. |
Not swapped, just bad wording! That is the cuSignal cache I created. When I turn it off and use CuPy’s, the FFT is faster! |
Ahh I see, thanks for clarifying, Matt. The number looks very good then! I wonder if all can be attributed to CuPy's cache, or there are additional nice changes made to v8? |
I believe the speedups between cuSignal’s cache + CuPy v7.8 and cuSignal (with no cache) + CuPy v8.0 is solely the cache. And the differences cuSignal (with no cache) between v7.8 and v8.0 is attributed to many improvements. |
Thanks, @mnicely! I wonder if you could do one additional test for me when you have time: Use CuPy v8, but turn off all caches (either cuSignal's or CuPy's). The latter can be turned off this way: import cupy as cp
cache = cp.fft.config.get_plan_cache()
cache.set_size(0) Note the |
Sure, I'll try to have you something by the end of this week! |
Closes #253
This PR adds a check for CuPy v7 or v8, and uses version 8's internal FFT cache.