[LangRef] Clarify semantics of masked vector load/store #82469

RalfJung · 2024-02-21T07:44:52Z

This is based on what I think has to follow from the statement about preventing exceptions. But I don't actually know what LLVM IR passes will do with these intrinsics, so this requires careful review by someone who does. :)

@nikic do you know these passes / know who knows these passes to do the review?

Also, there's an open question that remains: for the purpose of noalias, do these operations access the masked-off lanes or not? I sure hope they don't, but I realized that while data races are mentioned, noalias is not.

llvmbot · 2024-02-21T07:45:22Z

@llvm/pr-subscribers-llvm-ir

Author: Ralf Jung (RalfJung)

Changes

This is based on what I think has to follow from the statement about preventing exceptions. But I don't actually know what LLVM IR passes will do with these intrinsics, so this requires careful review by someone who does. :)

@nikic do you know these passes / know who knows these passes to do the review?

Also, there's an open question that remains: for the purpose of noalias, do these operations access the masked-off lanes or not? I sure hope they don't, but I realized that while data races are mentioned, noalias is not.

Full diff: https://github.com/llvm/llvm-project/pull/82469.diff

1 Files Affected:

(modified) llvm/docs/LangRef.rst (+2)

diff --git a/llvm/docs/LangRef.rst b/llvm/docs/LangRef.rst
index fd2e3aacd0169c..496773c4d3d386 100644
--- a/llvm/docs/LangRef.rst
+++ b/llvm/docs/LangRef.rst
@@ -23752,6 +23752,7 @@ Semantics:
 
 The '``llvm.masked.load``' intrinsic is designed for conditional reading of selected vector elements in a single IR operation. It is useful for targets that support vector masked loads and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar load operations.
 The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask. However, using this intrinsic prevents exceptions on memory access to masked-off lanes.
+In particular, this means that only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).
 
 
 ::
@@ -23794,6 +23795,7 @@ Semantics:
 
 The '``llvm.masked.store``' intrinsics is designed for conditional writing of selected vector elements in a single IR operation. It is useful for targets that support vector masked store and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar store operations.
 The result of this operation is equivalent to a load-modify-store sequence. However, using this intrinsic prevents exceptions and data races on memory access to masked-off lanes.
+In particular, this means that only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).
 
 ::

RalfJung · 2024-02-21T07:45:35Z

llvm/docs/LangRef.rst

@@ -23752,6 +23752,7 @@ Semantics:

 The '``llvm.masked.load``' intrinsic is designed for conditional reading of selected vector elements in a single IR operation. It is useful for targets that support vector masked loads and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar load operations.
 The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask. However, using this intrinsic prevents exceptions on memory access to masked-off lanes.
+In particular, this means that only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).


Is "masked-on" the opposite of "masked-off"? Or is there some other term I could use?

nikic · 2024-02-21T08:13:07Z

I would rephrase this in terms of something like this:

However, these intrinsics behave as-is the masked off lanes are not accessed.

Which should tell use everything necessary about their semantics. Then can continue to clarify that this means no exceptions / data races / etc.

RalfJung · 2024-02-21T10:03:14Z

That doesn't quite say everything -- there's the question of whether this Rust PR should say offset (aka getelementptr inbounds) or offset_wrapping (aka getelementptr) when describing how the pointers to the individual elements being loaded are computed.

programmerjake · 2024-02-21T11:03:32Z

That doesn't quite say everything -- there's the question of whether this Rust PR should say offset (aka getelementptr inbounds)

there is the additional caveat that LLVM is allowed to create a poison value without UB (which is what happens with getelementptr inbounds with out-of-bounds indexes), but Rust defines out-of-bounds offset to be immediate UB, instead of deferred to the load/store.

a major difference between the two choices is that doing a masked load on a pointer before the beginning of it's allocation is disallowed with inbounds, but allowed without inbounds as long as vector elements are masked-off until the offset is big enough to be in the allocation's bounds.

RalfJung · 2024-02-21T12:29:31Z

a major difference between the two choices is that doing a masked load on a pointer before the beginning of it's allocation is disallowed with inbounds, but allowed without inbounds as long as vector elements are masked-off until the offset is big enough to be in the allocation's bounds.

Yes that is indeed the key point: if the first half of the vector is masked-off, and that first half is actually out-of-bounds, then the pointer itself is conceptually out-of-bounds and "computing the pointer to the actually loaded element" would be a non-inbounds pointer computation. I expect this usecase to be allowed, which is why I added the following in this PR:

Only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).

RalfJung · 2024-05-02T07:08:59Z

@nikic I have updated the wording to

The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask, except that the masked-off lanes are not accessed.

Followed by clarification regarding exceptions, noalias, and data races. Does that work for you?

nikic

LGTM, but a second opinion wouldn't hurt.

llvm/docs/LangRef.rst

RalfJung mentioned this pull request Feb 21, 2024

Correct the simd_masked_{load,store} intrinsic docs rust-lang/rust#119203

Merged

llvmbot added the llvm:ir label Feb 21, 2024

RalfJung commented Feb 21, 2024

View reviewed changes

nikic requested a review from topperc February 21, 2024 08:07

RalfJung force-pushed the vector-masked branch from ad9bcf6 to acf5422 Compare February 21, 2024 10:05

RalfJung mentioned this pull request May 2, 2024

clarify semantics of masked.load/store #77449

Closed

RalfJung force-pushed the vector-masked branch 2 times, most recently from 32f4ea4 to 9c21fa7 Compare May 2, 2024 07:07

nikic approved these changes May 3, 2024

View reviewed changes

llvm/docs/LangRef.rst Outdated Show resolved Hide resolved

llvm/docs/LangRef.rst Outdated Show resolved Hide resolved

nikic changed the title ~~clarify semantics of masked vector load/store~~ [LangRef] Clarify semantics of masked vector load/store May 3, 2024

nikic requested a review from preames May 3, 2024 03:32

clarify semantics of masked vector load/store

fcdd2f5

RalfJung force-pushed the vector-masked branch from b772a03 to fcdd2f5 Compare May 3, 2024 05:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LangRef] Clarify semantics of masked vector load/store #82469

[LangRef] Clarify semantics of masked vector load/store #82469

RalfJung commented Feb 21, 2024

llvmbot commented Feb 21, 2024

RalfJung Feb 21, 2024

nikic commented Feb 21, 2024

RalfJung commented Feb 21, 2024

programmerjake commented Feb 21, 2024

RalfJung commented Feb 21, 2024 •

edited

RalfJung commented May 2, 2024

nikic left a comment

[LangRef] Clarify semantics of masked vector load/store #82469

Are you sure you want to change the base?

[LangRef] Clarify semantics of masked vector load/store #82469

Conversation

RalfJung commented Feb 21, 2024

llvmbot commented Feb 21, 2024

RalfJung Feb 21, 2024

Choose a reason for hiding this comment

nikic commented Feb 21, 2024

RalfJung commented Feb 21, 2024

programmerjake commented Feb 21, 2024

RalfJung commented Feb 21, 2024 • edited

RalfJung commented May 2, 2024

nikic left a comment

Choose a reason for hiding this comment

RalfJung commented Feb 21, 2024 •

edited