s390x: use `simd_shuffle!` macro #1965

folkertdev · 2025-11-27T21:38:08Z

No description provided.

rustbot · 2025-11-27T21:38:13Z

rustbot has assigned @sayantn.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

folkertdev · 2025-11-27T21:39:15Z

crates/core_arch/src/s390x/vector.rs

+// NOTE: `vflls` and `vldeb` are equivalent; our disassmbler prefers vflls.
+#[cfg_attr(
+    all(test, target_feature = "vector-enhancements-1"),
+    assert_instr(vflls)
+)]


@uweigand

just checking, is that correct? this implementation produces vldeb on godbolt, so I really think it's the disassembler that is picking vflls here.

Yes, the two are equivalent; in fact the binary machine code is identical, it's just different assembler mnemonics for the same instruction. Both vldeb v1, v2 and vflls v1, v2 are simply extended mnemonics for the "base" instruction vfll v1, v2, 0, 0 (see the PoP chapter 24 under VECTOR FP LOAD LENGTHENED). It's a bit unfortunate to have two mnemonics for the same thing, but it's due to historical reasons (vflls is the more recent one, and probably should be preferred).

folkertdev · 2025-11-27T21:40:51Z

crates/core_arch/src/s390x/vector.rs

-    simd_shuffle(
-        truncated,
-        truncated,
-        const { u32x4::from_array([0, 0, 1, 1]) },
-    )
+    simd_shuffle!(truncated, truncated, [0, 0, 1, 1])


and here, vec_floate does not specify what happens to the odd elements. clang + llvm end up using poison values, but rust does not have those. So, is there anything that can be done?

this is not all that important of course, but it's a bit of a wart.

Yes, it's deliberately left unspecified what happens to the odd elements. This matches the behavior of the underlying VECTOR FP LOAD ROUNDED instruction, which also states: "The data in the odd elements of the first operand is unpredictable." You could mask it to some defined value, but at extra runtime cost of course.

Right, well there is nothing we can do then. There is always inline assembly if someone really needs to emit the exact instruction.

s390x: use simd_shuffle! macro

1105394

rustbot assigned sayantn Nov 27, 2025

folkertdev commented Nov 27, 2025

View reviewed changes

sayantn added this pull request to the merge queue Nov 30, 2025

Merged via the queue into rust-lang:main with commit 7fb504f Nov 30, 2025
73 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

s390x: use `simd_shuffle!` macro #1965

s390x: use `simd_shuffle!` macro #1965

Uh oh!

folkertdev commented Nov 27, 2025

Uh oh!

rustbot commented Nov 27, 2025

Uh oh!

folkertdev Nov 27, 2025

Uh oh!

uweigand Nov 28, 2025

Uh oh!

folkertdev Nov 27, 2025

Uh oh!

uweigand Nov 28, 2025

Uh oh!

folkertdev Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

s390x: use simd_shuffle! macro #1965

s390x: use simd_shuffle! macro #1965

Uh oh!

Conversation

folkertdev commented Nov 27, 2025

Uh oh!

rustbot commented Nov 27, 2025

Uh oh!

folkertdev Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

uweigand Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

folkertdev Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

uweigand Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

folkertdev Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

s390x: use `simd_shuffle!` macro #1965

s390x: use `simd_shuffle!` macro #1965