Use SIMD #40

jimrybarski · 2018-12-12T17:34:51Z

Would it be feasible and beneficial to add SIMD instructions?

ejmahler · 2018-12-12T18:50:18Z

I haven’t tested it, but I bet it would. If you want to do a little of this yourself, a good place to start would be some simple loops. In mixed_radix.rs, search for “// STEP 3: Apply twiddle factors” On one hand, this loop is simple enough that it would be easy to convert to a SIMD iterator. On the other hand, it’s simple enough that he compiler might be automatically converting it. So a helpful place for you to start would be, what is the compiler currently putting out for this loop in optimized builds? If it turns out that it’s not already putting out simd instrucions, I can think of several simple loops like that that wcould be easily converted to a SIMD iterator

…

On Wed, Dec 12, 2018 at 9:34 AM Jim Rybarski ***@***.***> wrote: Would it be feasible and beneficial to add SIMD instructions? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#40>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABGmepouK36CLeUPP3wZuYx2lPJ80kR5ks5u4T47gaJpZM4ZQHQy> .

ejmahler · 2018-12-12T18:57:10Z

For a more complicated function that might benefit, check out “butterfly_4” in radix4.rs. This would require more thinking to convert, but I bet the compiler has a harder time auto-converting it

superblaubeere27 · 2020-09-14T17:02:23Z

I looked into the assembly of DFT and there is definitely no auto-vectorization going on. I will try to fix it and create a PR afterward

ejmahler · 2020-09-25T07:54:03Z

Hi @superblaubeere27 and @jimrybarski you might be interested to know that in the "simd" branch, I currently have an AVX implementation that outperforms FFTW.

It's not ready for release yet (Requires nightly, will fail to compile if you're not targeting x86_64, no way to opt-out of all the AVX code, way too much unsafe) but if you want some faster FFTs, I would love to have someone test it out!

ejmahler · 2020-09-25T08:15:29Z

The simd branch in my own fork https://github.com/ejmahler/RustFFT/tree/simd

ejmahler · 2021-01-05T03:02:09Z

RustFFT 5.0 has an implementation of AVX. I will implement SSE Some Time Soon, and neon/avx512 when i can get a machine to test them on + rust stabilizes them.

FYI, all rust developemt will be happening in https://github.com/ejmahler/RustFFT from now on.

ejmahler closed this as completed Jan 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use SIMD #40

Use SIMD #40

jimrybarski commented Dec 12, 2018

ejmahler commented Dec 12, 2018 via email

ejmahler commented Dec 12, 2018 via email

superblaubeere27 commented Sep 14, 2020

ejmahler commented Sep 25, 2020

ejmahler commented Sep 25, 2020

ejmahler commented Jan 5, 2021

Use SIMD #40

Use SIMD #40

Comments

jimrybarski commented Dec 12, 2018

ejmahler commented Dec 12, 2018 via email

ejmahler commented Dec 12, 2018 via email

superblaubeere27 commented Sep 14, 2020

ejmahler commented Sep 25, 2020

ejmahler commented Sep 25, 2020

ejmahler commented Jan 5, 2021