Skip to content

Commit

Permalink
[X86] AMD Zen 3 has fast variable per-lane shuffles
Browse files Browse the repository at this point in the history
... but lane-crossing shuffles are slow.
  • Loading branch information
LebedevRI committed Jun 1, 2021
1 parent cf9b1f7 commit a3b8695
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion llvm/lib/Target/X86/X86.td
Expand Up @@ -1111,7 +1111,9 @@ def ProcessorFeatures {
FeaturePKU,
FeatureVAES,
FeatureVPCLMULQDQ];
list<SubtargetFeature> ZN3AdditionalTuning = [FeatureMacroFusion];
list<SubtargetFeature> ZN3AdditionalTuning =
[FeatureMacroFusion,
FeatureFastVariablePerLaneShuffle];
list<SubtargetFeature> ZN3Tuning =
!listconcat(ZNTuning, ZN3AdditionalTuning);
list<SubtargetFeature> ZN3Features =
Expand Down

0 comments on commit a3b8695

Please sign in to comment.