[AArch64] -mgeneral-regs-only inconsistent with gcc #30140

pirama-arumuga-nainar · 2016-10-26T00:28:41Z


Bugzilla Link	30792
Version	trunk
OS	Linux
Blocks	#4440
CC	@Arnaud-de-Grandmaison-ARM,@efriedma-quic,@gburgessiv,@kbeyls,@slacka,@m-gupta,@nickdesaulniers,@petrhosek,@sbaranga-arm,@stephenhines,@TNorthover

Extended Description

The AArch64 backend runs into a fatal error with the -mgeneral-regs-only when inline assembly refers to a SIMD register.

A reduced test-case:

$ cat > clobber.c << EOF
void dummy() {
asm volatile ("" ::: "v0");
}
EOF

$ clang -c clobber.c -target aarch64-linux-gnu -mgeneral-regs-only
fatal error: error in backend: Do not know how to split the result of this
operator!

The code is compiled by gcc, though. Seems like gcc is lenient with respect to this flag. From https://gcc.gnu.org/onlinedocs/gcc/AArch64-Options.html:
-mgeneral-regs-only
Generate code which uses only the general-purpose registers. This will prevent the compiler from using floating-point and Advanced SIMD registers but will not impose any restrictions on the assembler.

From http://clang.llvm.org/docs/UsersManual.html:
-mgeneral-regs-only
Generate code which only uses the general purpose registers.
This option restricts the generated code to use general registers only. This only applies to the AArch64 architecture.

There are two possible actions here:

Match gcc and allow inline assembly to have SIMD registers. This may be hard to do, considering that '-mgeneral-regs-only' just passes '-targe
t-feature -fp-armv8 -target-feature -crypto -target-feature -neon' to the driver.
Make the driver not crash, and issue an error instead.

This usage seems prevalent in the kernel, which uses -mgeneral-regs-only to avoid saving and restoring the userspace FPSIMD context on every syscall, but hard-codes FPSIMD or crypto instructions in the handful of places they're useful.

pirama-arumuga-nainar · 2016-10-26T00:28:41Z

assigned to @gburgessiv

sbaranga-arm · 2016-11-09T13:04:56Z

Hi Pirama,

I'm trying to figure out what the best action would be for this. Would option 2) work for the main use case (the kernel)?

As you've noted, matching the gcc behaviour would require some effort (although AFAICT it should be straight-forward to do).

Thanks,
Silviu

pirama-arumuga-nainar · 2016-11-09T21:18:56Z

Hi Silviu, based on my understanding, yes, it'd be really useful from a performance perspective to match gcc, especially if it is straightforward.

https://android.googlesource.com/kernel/tegra/+/android-tegra-3.10/arch/arm64/crypto/aes-ce-cipher.c is one example of such a file. Disabling -mgeneral-regs-only for just these files would still necessitate saving of FPSIMD context. Extracting the inline assembly out to separate files would mean relying on lto (which I am not sure is functional for kernel builds yet.

sbaranga-arm · 2016-11-15T18:40:14Z

Hi Pirama,

I now have a preliminary fix this.

Turning the crypto and neon features on only in the assembler turned out to be an easy thing to do. However, correctly handling the inline assembler constraints is more difficult and error-prone (because the simd registers now hold illegal types).

Do you have other examples where this feature is used? I would like to do some more testing before putting this out for review.

Thanks,
Silviu

pirama-arumuga-nainar · 2016-11-17T02:32:07Z

Hi Silviu, did you try the inline assembly in https://android.googlesource.com/kernel/tegra/+/android-tegra-3.10/arch/arm64/crypto/aes-ce-cipher.c? There are two assembly blocks there that clobber v0 and v1.

Unfortunately, I couldn't find any other example use case.

sbaranga-arm · 2016-11-17T02:43:52Z

Thanks, Pirama!

Yes, I went through all of the use cases there. One of them has an umov that clang doesn't like, but otherwise they all work.

In that case I'll put the patches up for review.

-Silviu

TNorthover · 2016-11-18T21:51:00Z

The only difference between this and -mno-implicit-float (which works without adding extra backend features) seems to be how it handles explicit floating-point code.

-mno-implicit-float allows it; -mgeneral-regs-only errors on GCC (it's even polite enough to apologize!), but tries to form some new horribly unsupported soft-float ABI on Clang.

I'd suggest switching it to set NoImplicitFloat instead to fix this particular issue, and then perhaps working out how it should behave in the front-end (error, == -mfloat-abi=soft, or floats allowed).

pirama-arumuga-nainar · 2016-11-29T09:19:58Z

Tim, thanks for the pointer to use -mno-implicit-float. It works for the Kernel because this particular file does not have any explicit floating point operations.

I also agree with the discussion in https://reviews.llvm.org/D26856 that -mgeneral-regs-only should reject explicit floating point code.

m-gupta · 2017-04-24T17:44:25Z

Hi Silviu,

Are you planning on fixing this? I see that https://reviews.llvm.org/D26856 has not been updated in a while.

sbaranga-arm · 2017-04-25T14:46:57Z

Hi Manoj,

The solution at D26856 was the wrong one, we need to reject FP uses in front-end (so probably D26856 could be abandoned).

I don't have any plans to implement this at the moment.

However, we should have a work-around with -mno-implicit-float (as long as the code doesn't use any floating point types).

Thanks,
Silviu

nickdesaulniers · 2017-09-15T18:10:08Z

For anyone interested in picking this up, this is the last bug preventing us from compiling the upstream Linux kernel with LLVM for aarch64 without any out of tree patches.

For more information, please see:

m-gupta · 2017-09-15T18:11:57Z

George, Are you working on it?

gburgessiv · 2017-09-15T18:19:07Z

Yeah, I have a half-made patch that tries to address this. Was planning on sending it out by the end of next week.

gburgessiv · 2017-10-02T21:49:14Z

Proposed fix: https://reviews.llvm.org/D38479

edwintorok · 2021-11-27T01:36:50Z

mentioned in issue #4440

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

efriedma-quic mentioned this issue Apr 6, 2023

[AArch64] Inconsistent -mgeneral-regs-only for assembly #61986

Open

jrtc27 mentioned this issue Apr 6, 2023

module: small fixes for FreeBSD/aarch64 openzfs/zfs#14715

Closed

13 tasks

fdischner mentioned this issue Jul 11, 2023

Adding the -mgeneral-regs-only flag breaks variadic functions littlekernel/lk#383

Open

mysterymath mentioned this issue Jan 16, 2024

[AArch64] Disable FP loads/stores when fp-armv8 not enabled #77817

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AArch64] -mgeneral-regs-only inconsistent with gcc #30140

[AArch64] -mgeneral-regs-only inconsistent with gcc #30140

pirama-arumuga-nainar commented Oct 26, 2016

pirama-arumuga-nainar commented Oct 26, 2016

sbaranga-arm mannequin commented Nov 9, 2016

pirama-arumuga-nainar commented Nov 9, 2016

sbaranga-arm mannequin commented Nov 15, 2016

pirama-arumuga-nainar commented Nov 17, 2016

sbaranga-arm mannequin commented Nov 17, 2016

TNorthover commented Nov 18, 2016

pirama-arumuga-nainar commented Nov 29, 2016

m-gupta commented Apr 24, 2017

sbaranga-arm mannequin commented Apr 25, 2017

nickdesaulniers commented Sep 15, 2017

m-gupta commented Sep 15, 2017

gburgessiv commented Sep 15, 2017

gburgessiv commented Oct 2, 2017

edwintorok commented Nov 27, 2021

[AArch64] -mgeneral-regs-only inconsistent with gcc #30140

[AArch64] -mgeneral-regs-only inconsistent with gcc #30140

Comments

pirama-arumuga-nainar commented Oct 26, 2016

Extended Description

pirama-arumuga-nainar commented Oct 26, 2016

sbaranga-arm mannequin commented Nov 9, 2016

pirama-arumuga-nainar commented Nov 9, 2016

sbaranga-arm mannequin commented Nov 15, 2016

pirama-arumuga-nainar commented Nov 17, 2016

sbaranga-arm mannequin commented Nov 17, 2016

TNorthover commented Nov 18, 2016

pirama-arumuga-nainar commented Nov 29, 2016

m-gupta commented Apr 24, 2017

sbaranga-arm mannequin commented Apr 25, 2017

nickdesaulniers commented Sep 15, 2017

m-gupta commented Sep 15, 2017

gburgessiv commented Sep 15, 2017

gburgessiv commented Oct 2, 2017

edwintorok commented Nov 27, 2021