Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convolution produces garbage result with SSE2? #7

Open
ienapliss opened this issue Sep 21, 2023 · 0 comments
Open

Convolution produces garbage result with SSE2? #7

ienapliss opened this issue Sep 21, 2023 · 0 comments

Comments

@ienapliss
Copy link

ienapliss commented Sep 21, 2023

In my project, I simply copy the files and define MUFFT_HAVE_X86 and MUFFT_HAVE_SSE inside fft_internals.h.
Then I compile my program with gcc (mingw on windows) with -msse2 for enabling sse2.

If I disable SIMD in the flags, the output results is OK.
mufft_create_plan_conv(N, MUFFT_FLAG_CPU_NO_SSE | MUFFT_FLAG_CPU_NO_AVX | MUFFT_FLAG_CPU_NO_SSE3, MUFFT_CONV_METHOD_FLAG_MONO_MONO);

It's not a problem of alignment (all my data is aligned even to 64 bytes). I tested the pffft library with the same data and simd works just fine.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant