Fix rounding mode check in SSE4.1 round functions #3124

eduardosm · 2023-10-16T16:40:51Z

Now it masks out the correct bit and adds some explanatory comments. Also extends the tests.

RalfJung · 2023-10-16T16:48:07Z

src/shims/x86/sse41.rs

+        // determined by the SSE status register. We do not support that case,
+        // since modifying it is unsupported in Miri (and Rust).


Is the SSE status register different from the float status register?

For the float status register, we just assume it is always in NearestTiesToEven mode... but that's the rounding mode for float-to-float operations, right? Is the integer rounding mode even controlled in a status register for regular floats?

So for the SSE status register, I assume we'd just say that it is always assumed to have a particular rounding mode. But I don't know which rounding mode e.g. LLVM will assume here.

Yes, it is the same register that specifies rounding mode for arithmetic operations, so I guess we can assume to be round-to-nearest (we already do that for float-to-int conversions).

It looks like LLVM does not const-fold it, even when the rounding mode does not depend on the status register.
https://godbolt.org/z/zdh3McGcb

Wait, don't the normal float-to-int casts do a round-to-zero?

SSE has two version: one that rounds to zero (_mm_cvttss_si32) and one that uses the rounding mode of the status register (_mm_cvtss_si32). We assume round-to-nearest for the later.

miri/src/shims/x86/sse.rs

Lines 168 to 176 in c8d4e83

let rnd = match unprefixed_name {

// "current SSE rounding mode", assume nearest

// https://www.felixcloutier.com/x86/cvtss2si

"cvtss2si" | "cvtss2si64" => rustc_apfloat::Round::NearestTiesToEven,

// always truncate

// https://www.felixcloutier.com/x86/cvttss2si

"cvttss2si" | "cvttss2si64" => rustc_apfloat::Round::TowardZero,

_ => unreachable!(),

};

SSE has two version: one that rounds to zero (_mm_cvttss_si32) and one that uses the rounding mode of the status register (_mm_cvtss_si32). We assume round-to-nearest for the later.

Ah okay, let's also do that here then.

But independently of that -- f32 as i32 casts also always round to zero, so I supposed they also ignore the status register? IOW, that status register really only affects SSE operations, not scalar float operations?

Yes, Rust's built-in cast will use the same instruction as _mm_cvttss_si32.

src/shims/x86/sse41.rs

RalfJung · 2023-10-17T07:47:48Z

src/shims/x86/sse41.rs

+        0b011 => rustc_apfloat::Round::TowardZero,
+        // When the third bit is 1, the rounding mode is determined by the
+        // SSE status register. Since we do not support modifying it from
+        // Miri (or Rust), we assume to be at its default mode (round-to-nearest).


Suggested change

// Miri (or Rust), we assume to be at its default mode (round-to-nearest).

// Miri (or Rust), we assume it to be at its default mode (round-to-nearest).

RalfJung · 2023-10-17T07:48:15Z

r=me with the last typo fixed.
@bors delegate+

bors · 2023-10-17T07:48:18Z

✌️ @eduardosm, you can now approve this pull request!

If @RalfJung told you to "r=me" after making some further change, please make that change, then do @bors r=@RalfJung

Now it masks out the correct bit and adds some explanatory comments. Also extends the tests.

eduardosm · 2023-10-17T15:23:19Z

@bors r=@RalfJung

bors · 2023-10-17T15:23:21Z

📌 Commit 2a88ae4 has been approved by RalfJung

It is now in the queue for this repository.

bors · 2023-10-17T15:24:28Z

⌛ Testing commit 2a88ae4 with merge aaaac66...

bors · 2023-10-17T16:11:33Z

☀️ Test successful - checks-actions
Approved by: RalfJung
Pushing aaaac66 to master...

RalfJung reviewed Oct 16, 2023

View reviewed changes

eduardosm force-pushed the fix-sse41-round branch from 075e582 to 3884d15 Compare October 16, 2023 17:39

RalfJung reviewed Oct 16, 2023

View reviewed changes

src/shims/x86/sse41.rs Show resolved Hide resolved

RalfJung reviewed Oct 17, 2023

View reviewed changes

Fix rounding mode check in SSE4.1 round functions

2a88ae4

Now it masks out the correct bit and adds some explanatory comments. Also extends the tests.

eduardosm force-pushed the fix-sse41-round branch from 3884d15 to 2a88ae4 Compare October 17, 2023 15:22

bors merged commit aaaac66 into rust-lang:master Oct 17, 2023
8 checks passed

eduardosm deleted the fix-sse41-round branch October 17, 2023 16:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix rounding mode check in SSE4.1 round functions #3124

Fix rounding mode check in SSE4.1 round functions #3124

eduardosm commented Oct 16, 2023

RalfJung Oct 16, 2023

eduardosm Oct 16, 2023

RalfJung Oct 16, 2023 •

edited

eduardosm Oct 16, 2023

RalfJung Oct 16, 2023

eduardosm Oct 16, 2023

RalfJung Oct 17, 2023

RalfJung commented Oct 17, 2023

bors commented Oct 17, 2023

eduardosm commented Oct 17, 2023

bors commented Oct 17, 2023

bors commented Oct 17, 2023

bors commented Oct 17, 2023

		// determined by the SSE status register. We do not support that case,
		// since modifying it is unsupported in Miri (and Rust).

	let rnd = match unprefixed_name {
	// "current SSE rounding mode", assume nearest
	// https://www.felixcloutier.com/x86/cvtss2si
	"cvtss2si" \| "cvtss2si64" => rustc_apfloat::Round::NearestTiesToEven,
	// always truncate
	// https://www.felixcloutier.com/x86/cvttss2si
	"cvttss2si" \| "cvttss2si64" => rustc_apfloat::Round::TowardZero,
	_ => unreachable!(),
	};

	// Miri (or Rust), we assume to be at its default mode (round-to-nearest).
	// Miri (or Rust), we assume it to be at its default mode (round-to-nearest).

Fix rounding mode check in SSE4.1 round functions #3124

Fix rounding mode check in SSE4.1 round functions #3124

Conversation

eduardosm commented Oct 16, 2023

RalfJung Oct 16, 2023

Choose a reason for hiding this comment

eduardosm Oct 16, 2023

Choose a reason for hiding this comment

RalfJung Oct 16, 2023 • edited

Choose a reason for hiding this comment

eduardosm Oct 16, 2023

Choose a reason for hiding this comment

RalfJung Oct 16, 2023

Choose a reason for hiding this comment

eduardosm Oct 16, 2023

Choose a reason for hiding this comment

RalfJung Oct 17, 2023

Choose a reason for hiding this comment

RalfJung commented Oct 17, 2023

bors commented Oct 17, 2023

eduardosm commented Oct 17, 2023

bors commented Oct 17, 2023

bors commented Oct 17, 2023

bors commented Oct 17, 2023

RalfJung Oct 16, 2023 •

edited