Fix LLVM target detection when cross-compiling. #57182

hawkinsp · 2022-08-16T15:39:54Z

Conditions like @bazel_tools//src/conditions:linux_aarch64 do not
appear to be triggered correctly when cross-compiling on Linux. I'm
guessing this is because TensorFlow does not yet use Bazel platforms and
instead uses the older --cpu and --crosstool_top features.

This means that we fall through to the default condition and end up
building an x86-targeting LLVM even though we intended to target, say,
aarch64. The symptom this causes is errors like:
'neoverse-n1' is not a recognized processor for this target (ignoring
processor)
'+neon' is not a recognized feature for this target (ignoring feature)
'+fp-armv8' is not a recognized feature for this target (ignoring
feature)
'+crypto' is not a recognized feature for this target (ignoring feature)
'+lse' is not a recognized feature for this target (ignoring feature)
'+crc' is not a recognized feature for this target (ignoring feature)
from XLA:CPU compilation.

Take the same approach that has previously been used for Darwin ARM64
builds and add a new config_setting() for LLVM architecture detection
that mirrors the definitions in //tensorflow/BUILD.

Change tested both under x86->aarch64 cross compilation and aarch64
self-hosted compilation.

Conditions like @bazel_tools//src/conditions:linux_aarch64 do not appear to be triggered correctly when cross-compiling on Linux. I'm guessing this is because TensorFlow does not yet use Bazel platforms and instead uses the older --cpu and --crosstool_top features. This means that we fall through to the default condition and end up building an x86-targeting LLVM even though we intended to target, say, aarch64. The symptom this causes is errors like: 'neoverse-n1' is not a recognized processor for this target (ignoring processor) '+neon' is not a recognized feature for this target (ignoring feature) '+fp-armv8' is not a recognized feature for this target (ignoring feature) '+crypto' is not a recognized feature for this target (ignoring feature) '+lse' is not a recognized feature for this target (ignoring feature) '+crc' is not a recognized feature for this target (ignoring feature) from XLA:CPU compilation. Take the same approach that has previously been used for Darwin ARM64 builds and add a new config_setting() for LLVM architecture detection that mirrors the definitions in //tensorflow/BUILD. Change tested both under x86->aarch64 cross compilation and aarch64 self-hosted compilation.

hawkinsp requested a review from chsigg August 16, 2022 15:39

google-ml-butler bot added the size:M CL Change Size: Medium label Aug 16, 2022

google-ml-butler bot assigned gbaned Aug 16, 2022

google-ml-butler bot added the awaiting review Pull request awaiting review label Aug 16, 2022

This was referenced Aug 16, 2022

[Feature request]: Add support for Linux ARM64 conda-forge/jaxlib-feedstock#125

Open

Provide AArch64 (ARM) Linux jaxlib wheels google/jax#7097

Closed

gbaned added this to Assigned Reviewer in PR Queue via automation Aug 17, 2022

chsigg approved these changes Aug 19, 2022

View reviewed changes

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Aug 19, 2022

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer Aug 19, 2022

kokoro-team removed the kokoro:force-run Tests on submitted change label Aug 19, 2022

copybara-service bot merged commit 5f8ae63 into tensorflow:master Aug 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LLVM target detection when cross-compiling. #57182

Fix LLVM target detection when cross-compiling. #57182

hawkinsp commented Aug 16, 2022

Fix LLVM target detection when cross-compiling. #57182

Fix LLVM target detection when cross-compiling. #57182

Conversation

hawkinsp commented Aug 16, 2022