-
Notifications
You must be signed in to change notification settings - Fork 651
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support AArch64 Neon SIMD #25
Comments
Can you provide a patch? We don't have an aarch64 development environment, so I am not quite sure what to do. Is neon part of the standard aarch64 ISA, or is it an extension that requires special compiler flags? |
It is beyond me. I have googled this problem. I think Ubuntu community is working on it. |
matteo-frigo: yes NEON is part of the standard ISA (with double the registers, you'll like it!!!). I just found this issue, I'm currently working it around by setting NEON_CFLAGS=-D__ARM_NEON__ |
I don't know much about ARMv8, but there does seem to be some level of
emulation)
floating-
extension Can someone please explain why this is not sufficient? On Mon, Jan 12, 2015 at 9:53 AM, Jérôme Carretero notifications@github.com
|
Actually I was blocked further when trying to build (using an Android aarch64-linux-android sysroot); can someone build for 64-bit ARM at all? |
Figured out that there is no |
I have closed it by mistake. @zougloub thanks for rooting the cause. |
I've added support for double-precision Neon (available in aarch64) in my fork, https://github.com/rdolbeau/fftw3. It should asupport both SP and DP version on aarch64. Only tested in Qemu so far. Any report welcome. |
@rdolbeau, wow, looks nice. @matteo-frigo, can you take a look? (Dealing with MIT's Technology Licensing Office may be a hassle for incorporating nontrivial patches, however, unless you are willing to make a copyright assignment to MIT. The problem is that MIT still sells non-GPL licenses for FFTW.) |
I've also added a cycle counter in my fork for better planning (using the virtual counter register, see the commit message for details; thanks to ARM for the informations). Any report/comment/criticism welcome. |
Aarch64 is supported in the current trunk. |
Does this mean that FFTW releases have support for double precision FFT operations using optimized NEON instructions? How can I compile it to do so? @rdolbeau When compiling with --enable-neon you get the error message |
Armv7 (32 bits such such as A7,A9,A15, ...) do not have hardware support for double-precision NEON, and gives the message above. Aarch64 (64 bits, such as A53, A72, ...) is required to get double-precision NEON support with --enable-neon. |
GCC does not support -mfpu flag for AArch64.
The text was updated successfully, but these errors were encountered: