[AArch64] Add FEAT_FPAC to Neoverse V2 #133054

sjoerdmeijer · 2025-03-26T09:04:36Z

This feature is supported in Grace, but wasn't specified in the CPU definition. This is not important for codegen, but is good for completeness, and good for other tools that could query the CPU definition (e.g. llvm-exegesis).

llvmbot · 2025-03-26T09:05:11Z

@llvm/pr-subscribers-clang

Author: Sjoerd Meijer (sjoerdmeijer)

Changes

This feature is supported in Grace, but wasn't specified in the CPU definition. This is not important for codegen, but is good for completeness, and good for other tools that could query the CPU definition (e.g. llvm-exegesis).

Full diff: https://github.com/llvm/llvm-project/pull/133054.diff

2 Files Affected:

(modified) clang/test/Driver/print-enabled-extensions/aarch64-grace.c (+2-1)
(modified) llvm/lib/Target/AArch64/AArch64Processors.td (+2-1)

diff --git a/clang/test/Driver/print-enabled-extensions/aarch64-grace.c b/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
index fde6aee468cdc..739d86f1fae0f 100644
--- a/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
+++ b/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
@@ -21,6 +21,7 @@
 // CHECK-NEXT:     FEAT_FHM                                               Enable FP16 FML instructions
 // CHECK-NEXT:     FEAT_FP                                                Enable Armv8.0-A Floating Point Extensions
 // CHECK-NEXT:     FEAT_FP16                                              Enable half-precision floating-point data processing
+// CHECK-NEXT:     FEAT_FPAC                                              Enable Armv8.3-A Pointer Authentication Faulting enhancement
 // CHECK-NEXT:     FEAT_FRINTTS                                           Enable FRInt[32|64][Z|X] instructions that round a floating-point number to an integer (in FP format) forcing it to fit into a 32- or 64-bit int
 // CHECK-NEXT:     FEAT_FlagM                                             Enable Armv8.4-A Flag Manipulation instructions
 // CHECK-NEXT:     FEAT_FlagM2                                            Enable alternative NZCV format for floating point comparisons
@@ -59,4 +60,4 @@
 // CHECK-NEXT:     FEAT_TRBE                                              Enable Trace Buffer Extension
 // CHECK-NEXT:     FEAT_TRF                                               Enable Armv8.4-A Trace extension
 // CHECK-NEXT:     FEAT_UAO                                               Enable Armv8.2-A UAO PState
-// CHECK-NEXT:     FEAT_VHE                                               Enable Armv8.1-A Virtual Host extension
\ No newline at end of file
+// CHECK-NEXT:     FEAT_VHE                                               Enable Armv8.1-A Virtual Host extension
diff --git a/llvm/lib/Target/AArch64/AArch64Processors.td b/llvm/lib/Target/AArch64/AArch64Processors.td
index 30d9372e4afd1..43b678ef78713 100644
--- a/llvm/lib/Target/AArch64/AArch64Processors.td
+++ b/llvm/lib/Target/AArch64/AArch64Processors.td
@@ -1067,7 +1067,8 @@ def ProcessorFeatures {
                                      FeatureDotProd, FeatureFPARMv8, FeatureMatMulInt8,
                                      FeatureSSBS, FeatureCCIDX,
                                      FeatureJS, FeatureLSE, FeatureRAS, FeatureRCPC, FeatureRDM];
-  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3]);
+  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3,
+                                                          FeatureFPAC]);
 
   // ETE and TRBE are future architecture extensions. We temporarily enable them
   // by default for users targeting generic AArch64. The extensions do not

llvmbot · 2025-03-26T09:05:12Z

@llvm/pr-subscribers-backend-aarch64

Author: Sjoerd Meijer (sjoerdmeijer)

Changes

This feature is supported in Grace, but wasn't specified in the CPU definition. This is not important for codegen, but is good for completeness, and good for other tools that could query the CPU definition (e.g. llvm-exegesis).

Full diff: https://github.com/llvm/llvm-project/pull/133054.diff

2 Files Affected:

(modified) clang/test/Driver/print-enabled-extensions/aarch64-grace.c (+2-1)
(modified) llvm/lib/Target/AArch64/AArch64Processors.td (+2-1)

diff --git a/clang/test/Driver/print-enabled-extensions/aarch64-grace.c b/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
index fde6aee468cdc..739d86f1fae0f 100644
--- a/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
+++ b/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
@@ -21,6 +21,7 @@
 // CHECK-NEXT:     FEAT_FHM                                               Enable FP16 FML instructions
 // CHECK-NEXT:     FEAT_FP                                                Enable Armv8.0-A Floating Point Extensions
 // CHECK-NEXT:     FEAT_FP16                                              Enable half-precision floating-point data processing
+// CHECK-NEXT:     FEAT_FPAC                                              Enable Armv8.3-A Pointer Authentication Faulting enhancement
 // CHECK-NEXT:     FEAT_FRINTTS                                           Enable FRInt[32|64][Z|X] instructions that round a floating-point number to an integer (in FP format) forcing it to fit into a 32- or 64-bit int
 // CHECK-NEXT:     FEAT_FlagM                                             Enable Armv8.4-A Flag Manipulation instructions
 // CHECK-NEXT:     FEAT_FlagM2                                            Enable alternative NZCV format for floating point comparisons
@@ -59,4 +60,4 @@
 // CHECK-NEXT:     FEAT_TRBE                                              Enable Trace Buffer Extension
 // CHECK-NEXT:     FEAT_TRF                                               Enable Armv8.4-A Trace extension
 // CHECK-NEXT:     FEAT_UAO                                               Enable Armv8.2-A UAO PState
-// CHECK-NEXT:     FEAT_VHE                                               Enable Armv8.1-A Virtual Host extension
\ No newline at end of file
+// CHECK-NEXT:     FEAT_VHE                                               Enable Armv8.1-A Virtual Host extension
diff --git a/llvm/lib/Target/AArch64/AArch64Processors.td b/llvm/lib/Target/AArch64/AArch64Processors.td
index 30d9372e4afd1..43b678ef78713 100644
--- a/llvm/lib/Target/AArch64/AArch64Processors.td
+++ b/llvm/lib/Target/AArch64/AArch64Processors.td
@@ -1067,7 +1067,8 @@ def ProcessorFeatures {
                                      FeatureDotProd, FeatureFPARMv8, FeatureMatMulInt8,
                                      FeatureSSBS, FeatureCCIDX,
                                      FeatureJS, FeatureLSE, FeatureRAS, FeatureRCPC, FeatureRDM];
-  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3]);
+  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3,
+                                                          FeatureFPAC]);
 
   // ETE and TRBE are future architecture extensions. We temporarily enable them
   // by default for users targeting generic AArch64. The extensions do not

llvmbot · 2025-03-26T09:05:12Z

@llvm/pr-subscribers-clang-driver

Author: Sjoerd Meijer (sjoerdmeijer)

Changes

This feature is supported in Grace, but wasn't specified in the CPU definition. This is not important for codegen, but is good for completeness, and good for other tools that could query the CPU definition (e.g. llvm-exegesis).

Full diff: https://github.com/llvm/llvm-project/pull/133054.diff

2 Files Affected:

(modified) clang/test/Driver/print-enabled-extensions/aarch64-grace.c (+2-1)
(modified) llvm/lib/Target/AArch64/AArch64Processors.td (+2-1)

diff --git a/clang/test/Driver/print-enabled-extensions/aarch64-grace.c b/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
index fde6aee468cdc..739d86f1fae0f 100644
--- a/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
+++ b/clang/test/Driver/print-enabled-extensions/aarch64-grace.c
@@ -21,6 +21,7 @@
 // CHECK-NEXT:     FEAT_FHM                                               Enable FP16 FML instructions
 // CHECK-NEXT:     FEAT_FP                                                Enable Armv8.0-A Floating Point Extensions
 // CHECK-NEXT:     FEAT_FP16                                              Enable half-precision floating-point data processing
+// CHECK-NEXT:     FEAT_FPAC                                              Enable Armv8.3-A Pointer Authentication Faulting enhancement
 // CHECK-NEXT:     FEAT_FRINTTS                                           Enable FRInt[32|64][Z|X] instructions that round a floating-point number to an integer (in FP format) forcing it to fit into a 32- or 64-bit int
 // CHECK-NEXT:     FEAT_FlagM                                             Enable Armv8.4-A Flag Manipulation instructions
 // CHECK-NEXT:     FEAT_FlagM2                                            Enable alternative NZCV format for floating point comparisons
@@ -59,4 +60,4 @@
 // CHECK-NEXT:     FEAT_TRBE                                              Enable Trace Buffer Extension
 // CHECK-NEXT:     FEAT_TRF                                               Enable Armv8.4-A Trace extension
 // CHECK-NEXT:     FEAT_UAO                                               Enable Armv8.2-A UAO PState
-// CHECK-NEXT:     FEAT_VHE                                               Enable Armv8.1-A Virtual Host extension
\ No newline at end of file
+// CHECK-NEXT:     FEAT_VHE                                               Enable Armv8.1-A Virtual Host extension
diff --git a/llvm/lib/Target/AArch64/AArch64Processors.td b/llvm/lib/Target/AArch64/AArch64Processors.td
index 30d9372e4afd1..43b678ef78713 100644
--- a/llvm/lib/Target/AArch64/AArch64Processors.td
+++ b/llvm/lib/Target/AArch64/AArch64Processors.td
@@ -1067,7 +1067,8 @@ def ProcessorFeatures {
                                      FeatureDotProd, FeatureFPARMv8, FeatureMatMulInt8,
                                      FeatureSSBS, FeatureCCIDX,
                                      FeatureJS, FeatureLSE, FeatureRAS, FeatureRCPC, FeatureRDM];
-  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3]);
+  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3,
+                                                          FeatureFPAC]);
 
   // ETE and TRBE are future architecture extensions. We temporarily enable them
   // by default for users targeting generic AArch64. The extensions do not

Jojad9

Approve to this change

rj-jesus · 2025-03-26T10:02:30Z

llvm/lib/Target/AArch64/AArch64Processors.td

@@ -1067,7 +1067,8 @@ def ProcessorFeatures {
                                     FeatureDotProd, FeatureFPARMv8, FeatureMatMulInt8,
                                     FeatureSSBS, FeatureCCIDX,
                                     FeatureJS, FeatureLSE, FeatureRAS, FeatureRCPC, FeatureRDM];
-  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3]);
+  list<SubtargetFeature> Grace = !listconcat(NeoverseV2, [FeatureSVE2SM4, FeatureSVEAES, FeatureSVE2SHA3,
+                                                          FeatureFPAC]);


Hi, does it make sense to move this to the Neoverse V2 definition?
Based on the Neoverse V2 TRM it seems FEAT_FPAC should be supported by the core.

Thanks for spotting this, I think you're right. I got confused with "supported", thinking about this architecturally, but this is in the V2 TRM, so all V2 cores implement this. I will move this to the V2.

Yeah that sounds good - there are a number of other cores that have it but are not marked, but I don't have a list. The features was added after the cores were added and it didn't add them to all. Adding for V2 is a good start.

It can change the way PACBTI is emitted, I believe to make it more efficient and shouldn't be an issue so long as the instructions do throw exceptions.

rj-jesus · 2025-03-26T11:24:37Z

llvm/lib/Target/AArch64/AArch64Processors.td

@@ -555,7 +555,8 @@ def TuneNeoverseV2 : SubtargetFeature<"neoversev2", "ARMProcFamily", "NeoverseV2
                                      FeatureEnableSelectOptimize,
                                      FeatureUseFixedOverScalableIfEqualCost,
                                      FeatureAvoidLDAPUR,
-                                      FeaturePredictableSelectIsExpensive]>;
+                                      FeaturePredictableSelectIsExpensive,
+                                      FeatureFPAC]>;


Is there any reason you placed this in tuning instead of the main processor features of the Neoverse V2?

Nope. :-(
But now fixed.

rj-jesus

Except for a seemingly out-of-order test, LGTM!

rj-jesus · 2025-03-26T12:05:45Z

clang/test/Driver/print-enabled-extensions/aarch64-grace.c

@@ -19,6 +19,7 @@
 // CHECK-NEXT:     FEAT_ETE                                               Enable Embedded Trace Extension
 // CHECK-NEXT:     FEAT_FCMA                                              Enable Armv8.3-A Floating-point complex number support
 // CHECK-NEXT:     FEAT_FHM                                               Enable FP16 FML instructions
+// CHECK-NEXT:     FEAT_FPAC                                              Enable Armv8.3-A Pointer Authentication Faulting enhancement


I think this should be below FEAT_FP16 (otherwise the test will probably fail to match).

Thanks, yeah, I have been messing around with that file a lot and tried different things just to get rid of this Git warning:

-// CHECK-NEXT: FEAT_VHE Enable Armv8.1-A Virtual Host extension
\ No newline at end of file
+// CHECK-NEXT: FEAT_VHE Enable Armv8.1-A Virtual Host extension

I don't get it, I don't see what is wrong.
But it was already there, I am ignoring it for now.

I think the file was missing a new line character at the end (which presumably your editor added automatically when you added the FEAT_FPAC line). I think this is a harmless change, but if you want to undo it, I believe truncate -s -1 should work.

This feature is supported in Grace, but wasn't specified in the CPU definition.

sjoerdmeijer requested review from rj-jesus and davemgreen March 26, 2025 09:04

llvmbot added clang backend:AArch64 clang:driver labels Mar 26, 2025

Jojad9 approved these changes Mar 26, 2025

View reviewed changes

rj-jesus reviewed Mar 26, 2025

View reviewed changes

sjoerdmeijer force-pushed the grace-fpac branch from 9cb58d6 to a65f5c9 Compare March 26, 2025 11:04

sjoerdmeijer changed the title ~~[AArch64] Add FEAT_FPAC to Grace~~ [AArch64] Add FEAT_FPAC to Neoverse V2 Mar 26, 2025

sjoerdmeijer force-pushed the grace-fpac branch from a65f5c9 to e6af1a3 Compare March 26, 2025 11:11

rj-jesus reviewed Mar 26, 2025

View reviewed changes

sjoerdmeijer force-pushed the grace-fpac branch from e6af1a3 to 39ac7e6 Compare March 26, 2025 11:49

rj-jesus approved these changes Mar 26, 2025

View reviewed changes

[AArch64] Add FEAT_FPAC to Neoverse V2

b1619f7

This feature is supported in Grace, but wasn't specified in the CPU definition.

sjoerdmeijer force-pushed the grace-fpac branch from 39ac7e6 to b1619f7 Compare March 26, 2025 13:33

sjoerdmeijer merged commit c0cce43 into llvm:main Mar 26, 2025
11 checks passed

sjoerdmeijer deleted the grace-fpac branch March 26, 2025 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AArch64] Add FEAT_FPAC to Neoverse V2 #133054

[AArch64] Add FEAT_FPAC to Neoverse V2 #133054

sjoerdmeijer commented Mar 26, 2025

llvmbot commented Mar 26, 2025

llvmbot commented Mar 26, 2025

llvmbot commented Mar 26, 2025

Jojad9 left a comment

rj-jesus Mar 26, 2025 •

edited

Loading

sjoerdmeijer Mar 26, 2025

davemgreen Mar 26, 2025

rj-jesus Mar 26, 2025

sjoerdmeijer Mar 26, 2025

rj-jesus left a comment

rj-jesus Mar 26, 2025

sjoerdmeijer Mar 26, 2025

rj-jesus Mar 26, 2025 •

edited

Loading

[AArch64] Add FEAT_FPAC to Neoverse V2 #133054

[AArch64] Add FEAT_FPAC to Neoverse V2 #133054

Conversation

sjoerdmeijer commented Mar 26, 2025

llvmbot commented Mar 26, 2025

llvmbot commented Mar 26, 2025

llvmbot commented Mar 26, 2025

Jojad9 left a comment

Choose a reason for hiding this comment

rj-jesus Mar 26, 2025 • edited Loading

Choose a reason for hiding this comment

sjoerdmeijer Mar 26, 2025

Choose a reason for hiding this comment

davemgreen Mar 26, 2025

Choose a reason for hiding this comment

rj-jesus Mar 26, 2025

Choose a reason for hiding this comment

sjoerdmeijer Mar 26, 2025

Choose a reason for hiding this comment

rj-jesus left a comment

Choose a reason for hiding this comment

rj-jesus Mar 26, 2025

Choose a reason for hiding this comment

sjoerdmeijer Mar 26, 2025

Choose a reason for hiding this comment

rj-jesus Mar 26, 2025 • edited Loading

Choose a reason for hiding this comment

rj-jesus Mar 26, 2025 •

edited

Loading

rj-jesus Mar 26, 2025 •

edited

Loading