Add workgroupuniform load built-in and analysis #3586

alan-baker · 2022-11-09T15:09:28Z

Add workgroupUniformLoad built-in function as
new synchronization built-ins
Rework synchronization built-ins to match style of other sections
Modify rvalue expression table to split variable identifier rows into
two
- the first for when the load rule is not invoked maintains the
  incoming uniformity (that is pointers and references are uniform
  while they remain memory views)
- the second for when the load rule is invoked retains the previous
  behaviour

github-actions · 2022-11-09T15:12:57Z

Previews, as seen when this build job started (6198f55):
WebGPU _{webgpu.idl | Explainer | Correspondence Reference}
WGSL _{grammar.js | wgsl.lalr.txt}

wgsl/index.bs

alan-baker · 2022-11-10T17:37:41Z

Open question: do we want a version of workgroupBroadcast based on local invocation index?

wgsl/index.bs

dneto0 · 2022-11-16T14:16:09Z

From the 2022-11-15 meeting, the committee agreed to proceed with workgroupUniformLoad.

wgsl/index.bs

dneto0

workgroupBroadcast is not yet agreed by the committee.

workgroupUniformLoad needs further qualification: The result is uniform only if the pointer argument is uniform. Details posted to the issue.

dneto0 · 2022-11-17T23:12:20Z

wgsl/index.bs

+    space.
+</table>
+
+### `workgroupBroadcast` ### {#workgroupBroadcast-builtin}


Committee 2022-11-15 did not (yet) agree to workgroupBroadcast

Make the synchronzation builtins section look like the other builtins sections. This takes the editorial-only changes from gpuweb#3586

Fixes gpuweb#2586 * Add workgroupUniformLoad built-in function as new synchronization built-ins * Rework synchronization built-ins to match style of other sections * Modify rvalue expression table to split variable identifier rows into two * the first for when the load rule is not invoked maintains the incoming uniformity (that is pointers and references are uniform while they remain memory views) * the second for when the load rule is invoked retains the previous behaviour

dneto0

Thanks!

dneto0 · 2022-11-29T17:19:38Z

wgsl/index.bs

@@ -9805,6 +9807,13 @@ The rules for analyzing expressions take as argument both the expression itself
      <td class="nowrap">*CF*, *CF*
      <td>
   <tr><td>identifier [=resolves|resolving=] to function-scope variable "x"
+      where the [=load rule=] is not invoked during [=type checking=]


Yep.
This aligns uniformity analysis with how we do type-checking and the conceptual model of how execution/evaluation occurs.
This is the end result of what @jimblandy suggested, and it works out really nicely.

dneto0 · 2022-11-29T17:21:17Z

Would also fix #3602

* Move function scope pointer desugaring to a higher level section * generalize to pointer desugaring * function-scope pointer parameter variable substitution * let-declaration desugaring * update modified identifier expression rules to require that "x" is also the root identifier

dneto0

I think there's a technical problem with the pointer desugaring, due to incomplete capture of values of variables that may change between the pointer declaration and its use.

wgsl/index.bs

dneto0 · 2022-11-30T20:55:06Z

wgsl/index.bs

+Each [=let-declaration=] with an [=effective-value-type=] that is a [=pointer
+type=] is desugared as follows:
+* The initializer expression of the declaration is recorded.
+* Each [=identifier=] that [=resolves=] is substituted with the recorded


Missing words here? Each identifier that resolves to a let-declaration ...

Interestingly, it doesn't matter what order the substitutions are made.

Might want to add a note:

The order of substitutions is not significant, always settling with the same final result.

Is there a technical problem here? Because the indexing into those desugared pointers could have its uniformity properties changed. They may refer to variables that have had full assignments on them in the meantime, or vice versa. Basically, you have to properly capture the values of variables at the point of recording the initializer for the let-decls.

E.g. see how 'foo' evolves through desugaring. Yet, variable i receives a full assignment that makes it uniform halfway through the function.

@group(0) @binding(0) var t: texture_2d<f32>; @group(0) @binding(1) var s: sampler; fn foo(parami:i32) -> vec4<f32> { var i = parami; var v = array<i32,2>(-1,1); let p = &v[i]; if (v[i] > 0) { return textureSample(t,s,vec2(.0)); // causes uniformity error. } i = 0; // full assignment if (v[i] > 0) { return textureSample(t,s,vec2(.0)); // no uniformity error } return vec4<f32>(); } fn foo_via_pointers(parami:i32) -> vec4<f32> { var i = parami; var v = array<i32,2>(-1,1); let p = &v[i]; if (*p > 0) { return textureSample(t,s,vec2(.0)); // should cause uniformity error. } i = 0; // full assignment if (*p > 0) { return textureSample(t,s,vec2(.0)); // should still cause uniformity error } return vec4<f32>(); } fn foo_via_pointers_desugared(parami:i32) -> vec4<f32> { var i = parami; var v = array<i32,2>(-1,1); let p = &v[i]; if (*(&v[i]) > 0) { return textureSample(t,s,vec2(.0)); // should cause uniformity error. } i = 0; // full assignment if (*(&v[i]) > 0) { return textureSample(t,s,vec2(.0)); // should still cause uniformity error // but it won't } return vec4<f32>(); } @fragment fn main(@builtin(position) p: vec4<f32>) -> @location(0) vec4<f32> { return foo(i32(trunc(p.x))); }

Good catch. The desugaring should be capturing the values of mutable variables other than the root identifier.

wgsl/index.bs

* Capture component subexpression values when desugaring pointers * fix references and typos * Added a note about type checking ordering * Improve wording of identifier expression uniformity rules

dneto0

LGTM.
Thanks for handling this delicate rule!

dneto0 · 2022-12-05T15:58:03Z

wgsl/index.bs

-* Each [=identifier=] that [=resolves=] is substituted with the recorded
-    initializer expression (wrapped in a
-    [[#parenthesized-expressions|parenthesized expression]]).
+* Visit each subexpression, *SE*, of the initializer expression of *L* in a postorder depth-first traversal:


Right. It has to be post-order because you want to capture the most deeply nested item first.

Then I started thinking: Does this have to specify left-to-right traversal as well? I don't think it does.
Interestingly, the desugaring doesn't have to produce and equivalent value, just an equivalent uniformity property.
And the only mutable variable that can have its uniformity affected is a function-scope var. And the analysis only updates that uniformity property in an assigment statement in the same function. So this desugaring is sufficient.

If we actually wanted to capture the right value, we'd also have to capture the values and side effects of function calls. That's a lot more work, and not needed here.

dneto0 · 2022-12-05T15:59:37Z

wgsl/index.bs

-      the [=root identifier=] of the [=memory view=]
+   <tr><td>identifier [=resolves|resolving=] to function-scope variable "x",
+      where the identifier appears as the [=root identifier=] of a [=memory view=]
+      expression, *MVE*, and the [=load rule=] is not invoked on *MVE* during


nit: Might be worth emphasizing the "not", either italics or bold.

An example suggested from internal review:

Consider let p = &y[x[10]];

In this case:

one MVE is x[10] and it does have the load rule applied. The root identifier is x.

the other MVEs are y[x[10]] and &y[x[10]], with root identifier y, but neither of those have the load rule applied during type checking.

jrprice

The mechanics of this LGTM, and aligns with how we plan to implement this in Tint.

We can also remove the special-case for arrayLength from the analysis, but happy for that to be a follow-up change if preferred.

dneto0 · 2022-12-05T22:50:25Z

Discussed internally at Google:

This can be expanded in future when more general pointers are added.
- In particular, when adding pointer-to-workspace params, they can be desugared into fresh module-scope variables in workgroup address space; where "fresh" means each pointer formal param in the source corresponds to a distinct module-scope variable. That's sufficient for analysis because we assume the aliasing restrictions are still in place. The variable would be seen as non-uniform because it's still mutable and shared among invocations. There may be other ways to handle this too.

alan-baker added the wgsl WebGPU Shading Language Issues label Nov 9, 2022

alan-baker added this to the V1.0 milestone Nov 9, 2022

alan-baker requested review from jimblandy, kdashg, jrprice, mehmetoguzderin and dneto0 November 9, 2022 15:09

alan-baker added the uniformity Issues / discussions around uniformity analysis label Nov 9, 2022

jrprice reviewed Nov 9, 2022

View reviewed changes

wgsl/index.bs Outdated Show resolved Hide resolved

wgsl/index.bs Outdated Show resolved Hide resolved

wgsl/index.bs Outdated Show resolved Hide resolved

Kangz reviewed Nov 14, 2022

View reviewed changes

wgsl/index.bs Outdated Show resolved Hide resolved

dneto0 reviewed Nov 17, 2022

View reviewed changes

wgsl/index.bs Show resolved Hide resolved

dneto0 mentioned this pull request Nov 17, 2022

Workgroup Broadcast #2586

Closed

dneto0 requested changes Nov 17, 2022

View reviewed changes

dneto0 pushed a commit to dneto0/gpuweb that referenced this pull request Nov 18, 2022

wgsl: Make sections for each barrier function

da71f56

Make the synchronzation builtins section look like the other builtins sections. This takes the editorial-only changes from gpuweb#3586

dneto0 mentioned this pull request Nov 18, 2022

wgsl: Make sections for each barrier function #3604

Merged

jrprice mentioned this pull request Nov 24, 2022

Designing a uniformity opt-out #3554

Closed

alan-baker force-pushed the workgroup-broadcast branch from a0c2516 to 8a2f225 Compare November 29, 2022 15:04

alan-baker changed the title ~~Add workgroup broadcast and uniform load built-ins~~ Add workgroupuniform load built-in and analysis Nov 29, 2022

alan-baker requested review from dneto0 and jrprice November 29, 2022 15:06

alan-baker mentioned this pull request Nov 29, 2022

Add uniformity analysis opt-out for derivative-using builtins #3644

Closed

dneto0 approved these changes Nov 29, 2022

View reviewed changes

alan-baker requested a review from dneto0 November 30, 2022 14:35

dneto0 requested changes Nov 30, 2022

View reviewed changes

Changes for review

6198f55

* Capture component subexpression values when desugaring pointers * fix references and typos * Added a note about type checking ordering * Improve wording of identifier expression uniformity rules

dneto0 approved these changes Dec 5, 2022

View reviewed changes

jrprice approved these changes Dec 5, 2022

View reviewed changes

dneto0 merged commit 43d9242 into gpuweb:main Dec 5, 2022

This was referenced Dec 5, 2022

wgsl: add workgroupUniformLoad, update uniformity analysis for pointers, references vs. their contents gpuweb/cts#2054

Closed

wgsl: full reference to module-scope variable is uniform, the value stored there may not be #3602

Closed

jrprice mentioned this pull request Dec 9, 2022

Distinguish between pointer and pointer contents for uniformity requirements on parameters #3677

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add workgroupuniform load built-in and analysis #3586

Add workgroupuniform load built-in and analysis #3586

alan-baker commented Nov 9, 2022 •

edited

Loading

github-actions bot commented Nov 9, 2022 •

edited

Loading

alan-baker commented Nov 10, 2022

dneto0 commented Nov 16, 2022

dneto0 left a comment

dneto0 Nov 17, 2022

dneto0 left a comment

dneto0 Nov 29, 2022

dneto0 commented Nov 29, 2022

dneto0 left a comment

dneto0 Nov 30, 2022

dneto0 Nov 30, 2022

alan-baker Dec 1, 2022

dneto0 left a comment

dneto0 Dec 5, 2022

dneto0 Dec 5, 2022

dneto0 Dec 5, 2022

jrprice left a comment

dneto0 commented Dec 5, 2022

Add workgroupuniform load built-in and analysis #3586

Add workgroupuniform load built-in and analysis #3586

Conversation

alan-baker commented Nov 9, 2022 • edited Loading

github-actions bot commented Nov 9, 2022 • edited Loading

alan-baker commented Nov 10, 2022

dneto0 commented Nov 16, 2022

dneto0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dneto0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dneto0 commented Nov 29, 2022

dneto0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dneto0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrprice left a comment

Choose a reason for hiding this comment

dneto0 commented Dec 5, 2022

alan-baker commented Nov 9, 2022 •

edited

Loading

github-actions bot commented Nov 9, 2022 •

edited

Loading