Fragment output <-> attachment interface matching rules def #2013

shrekshao · 2021-08-03T17:58:00Z

Right now the WebGPU spec and WGSL spec are vague about the fragment output and attachment interface matching rules.

https://www.w3.org/TR/webgpu/#fragment-state

The pipeline output type must be compatible with colorState.format.

The rule should have two parts.

For BaseType: The scalar type (f32, i32, or u32) must match the sample type of the format. (Pretty clear).
For componentCounts: here are results shown by some tests (when the component count of output and channel count of fragment color attachment don't match)

	Metal	D3D12	Vulkan(Win)
`vec4<f32>` <-> `R8Unorm`	✔️	✔️	✔️
`f32` <-> `RGBA8Unorm`	❌Error	❌ Undefined value for unwritten channels (Error with backend validation)	❌ Undefined value ~~✔️(seems happy, pad with 0)~~

Here are some related spec text I found (please help add refs)

We have 2 options here:

State in the spec that component count of the fragment color output should be equal or bigger than the channel count of the colorState.format. The extra components are discarded. Otherwise there would be a validation error.
Do a transform of the user-input WGSL shader in our implementations to make sure the output is always expanded to vec4<T>, without validating the componentCount.

WDYT

The text was updated successfully, but these errors were encountered:

kainino0x · 2021-08-03T18:13:08Z

FWIW, reading the Vulkan spec I just realized why it's always allowed to output too many components: fragment outputs don't go directly into the attachment, they go to the blending unit. The blending unit may use the output alpha value even if the attachment doesn't have an alpha channel.

We need to determine what the blending unit sees in the alpha channel if the shader doesn't output it. If it's an undefined value (edit: see next comment), I think we have to choose one of these options:

Pad all outputs to vec4 (on such backends)
Validate outputs are vec4
Validate outputs are vec4 if they use blending
Validate outputs are vec4 if they use blending modes that read the alpha channel

~~If it's defined, then option 1 (the first option Shrek posted) is additionally still valid.~~

Aside: I found this in the Vulkan spec section you linked:

Any value that cannot be represented in the attachment’s format is undefined.

I guess this means there are technically undefined values if you write outside of the range of a unorm/snorm attachment type. That seems like a pretty minor hazard but I'm surprised by it.

kainino0x · 2021-08-03T18:15:15Z

Oh, finally found the relevant sentence in Vulkan (emphasis added):

The input values to blending or color attachment writes are undefined for components which do not correspond to a fragment shader output.

kvark · 2021-08-03T21:43:40Z

Injecting shader code is always extra work, and also incompatible with the idea of generating MTLLibrary ahead of time. So we should play safe and say that for now the shader output has to:

have at least as many components as the target format
be a vec4 if any of the blend factors for this attachment include one of the following:
- "src-alpha"
- "one-minus-src-alpha"
- "src-alpha-saturated"

kainino0x · 2021-08-03T22:10:04Z

note: I don't think it's incompatible with generating ahead of time. We don't have to pad to the size of the format, we can always pad to vec4 unconditionally.

kainino0x · 2021-08-09T19:18:29Z

Resolution: Add the precise validation rules (option 5, the one kvark described just above)

kainino0x · 2021-08-09T20:57:11Z

Adding to editor agenda to clarify a corner case: what if the format has alpha, but blending is used and the fragment output alpha actually isn't used?

kainino0x · 2021-08-09T21:16:22Z

Editors: That should also be valid. Unless the validation layers complain about that.

Buuut then what if the write mask prevents any actual undefined values? Then we should also allow that. Unless the validation layers complain about that. @shrekshao do you have things set up so you could check what the Metal and d3d12 debug layers think about these things?

shrekshao · 2021-08-09T21:38:59Z

With fragment output set to f32, attachment format RGBA8Unorm, writeMask GPUColorWrite::Red, D3D12 still treats it as invalid.

So seems that we don't have to consider write mask state at validation

kainino0x · 2021-08-09T21:45:09Z

Good to know! Could you also check if D3D12 considers the blend factors in its debug layer?

shrekshao · 2021-08-09T22:21:04Z

For blending, at least D3D12 doesn't care if blending is null, or enabled but srcFactor/dstFactor doesn't include src-alpha, when writing vec3<f32> to RGBA8Unorm. It simply said it's unmatched. This actually makes implementing the validation a bit easier.

ID3D12Device::CreateGraphicsPipelineState: Pixel Shader output 'SV_Target0' is writing to 3 components, while the corresponding RenderTarget slot [0] has format (R8G8B8A8_UNORM) with 4 component(s). This results in undefined contents in the unwritten channels of the render target. [ STATE_CREATION WARNING #974: CREATEGRAPHICSPIPELINESTATE_PS_OUTPUT_RT_OUTPUT_MISMATCH]

kainino0x · 2021-08-10T00:53:48Z

Hmm, that seems like there might be a deficiency in the D3D12 debug layer. Am I correct in understanding it allows the following case? Which should be invalid.

output vec3
attachment format is rg
blending is enabled, factors include src-alpha

In any case, we will make that invalid, because it's undefined according to the Vulkan spec. So I think the rules might be:

have at least as many components as the target format

be a vec4 if any of the [color] blend factors for this attachment include one of the following:

"src-alpha"

"one-minus-src-alpha"

"src-alpha-saturated"

be a vec4 if:
- the target format has alpha, and
- [edited] the .alpha.srcFactor is not 'zero' OR the .alpha.dstFactor includes src or src-alpha

Plus whatever is needed for #2025.

shrekshao · 2021-08-10T01:40:06Z

(Sorry I misunderstood here. I tested against attachment format rgba first)

For

output vec3

attachment format is rg

blending is enabled, factors include src-alpha

D3D12 debug layer (--enable-backend-validation in dawn) let it passed. We should validate this since the alpha channel would be undefined value.

The conclusion seems good.

kainino0x · 2021-08-10T02:12:04Z

(Sorry I misunderstood here. I tested against attachment format rgba first)

No misunderstanding, I just thought of this when I read your latest result at the time. :)

kainino0x · 2021-08-10T07:24:10Z

any of the alpha blend factors for this attachment include src or src-alpha.

Realized this is wrong, I think it should say

the .alpha.srcFactor is not 'zero' OR the .alpha.dstFactor includes src or src-alpha

kainino0x · 2021-08-12T05:22:52Z

the .alpha.srcFactor is not 'zero'

reading/thinking about this YET FURTHER, this may also be wrong. If src.alpha is undefined, then multiplying it by zero is still undefined, e.g. if src.alpha was NaN.

Unfortunately I can't find a concrete answer to either issue (#2013 or #2025) in the D3D11 functional spec - all language seems to ignore the possibility that the blend factor uses an alpha value that wasn't output by the shader. However I did find some interesting sections that very explicitly call out the use of write masks to protect against undefined results:

https://microsoft.github.io/DirectX-Specs/d3d/archive/D3D11_3_FunctionalSpec.htm#17.15%20Output%20Write%20Masks

Failure to provide sufficient data to the Output Merger for all of the RenderTarget(s)/component(s) enabled with the write masks results in undefined values being written out.

https://microsoft.github.io/DirectX-Specs/d3d/archive/D3D11_3_FunctionalSpec.htm#OutputWrites

Partial writes to a given o# output register (writing a nonempty proper subset of the declared components) will produce undefined results in the unwritten component(s) that were declared for output. i.e. Declaring o0.rga but only writing o0.r means the RenderTarget location for o0.ga will be written with undefined values. However the application can take advantage of the write-enable masks to prevent undefined values from being written out and thus vary outputs with flow control in a Shader, as long as the condition doesn't vary within a given Draw*() (since the write-enable masks can only be updated between Draw*() calls).

kainino0x · 2021-08-12T05:42:54Z

Ahh the addition I made to the rules earlier (rule 3) is actually redundant: For any format that has an alpha channel, the shader must output vec4 anyway due to rule 1.

(edited) So the first 2 rules in #2013 (comment) were correct, but revising rule 1 here for generality/preciseness:

Shader output must:

have a superset of the components of the target format
be a vec4 if any of the color blend factors for this attachment include one of the following:
- "src-alpha"
- "one-minus-src-alpha"
- "src-alpha-saturated"

shrekshao · 2021-08-12T17:02:30Z

Does the revised rule without rule 3 covers this case:
fragment output: vec3<f32>
attachemnt format: rg8unorm
blending:
alpha.dstFactor: src-alpha

The attachment doesn't actually have an alpha channel, so the src-alpha although reading from something undefined but the value is discarded anyway.

So seems the revised rule is correct...

kainino0x · 2021-08-12T17:12:55Z

My mental model from reading the D3D spec is that blending for all 4 channels always runs, what matters is whether undefined values make it into the output. So in this case, the alpha channel blending reads dst=undef, multiplies it with dstFactor=undef, resulting in a final value of undef, but that doesn't get written so it doesn't matter.

kainino0x · 2021-08-12T17:14:21Z

Ohh yes sorry, the revised rules I posted aren't actually the original 2 rules. They are still changed to say "color blend factors" instead of "blend factors"

Kangz · 2021-12-14T18:53:44Z

What still needs to happen to be able to close this issue?

kainino0x · 2021-12-15T03:10:52Z

I think we figured out what the spec needs to say, so just spec work.

kainino0x added this to Needs Discussion in Main Aug 3, 2021

kainino0x added this to the MVP milestone Aug 3, 2021

kainino0x moved this from Needs Discussion to Needs Specification in Main Aug 9, 2021

kainino0x added the for webgpu editors meeting label Aug 9, 2021

kainino0x removed the for webgpu editors meeting label Aug 9, 2021

kainino0x assigned shrekshao Aug 9, 2021

toji mentioned this issue Aug 9, 2021

Do pipeline blend states need to be validated against the pipline render targets? #2025

Closed

shrekshao mentioned this issue Aug 10, 2021

Add type validation tests for fragment outputs and attachments gpuweb/cts#676

Merged

5 tasks

shrekshao mentioned this issue Aug 12, 2021

Remove some alpha blending factor validation gpuweb/cts#708

Merged

4 tasks

kainino0x added the copyediting Pure editorial stuff (copyediting, bikeshed, etc.) label Dec 15, 2021

kainino0x assigned kainino0x and unassigned shrekshao Sep 2, 2022

kainino0x mentioned this issue Sep 2, 2022

Matching rules for fragment shader output and color attachments #3408

Merged

kainino0x closed this as completed in #3408 Oct 3, 2022

kainino0x mentioned this issue Jan 5, 2023

Define/document behavior of missing channels in blending #3720

Merged

kainino0x mentioned this issue Dec 8, 2023

Adjust fragment state blend factor and write mask validation gpuweb/cts#3217

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fragment output <-> attachment interface matching rules def #2013

Fragment output <-> attachment interface matching rules def #2013

shrekshao commented Aug 3, 2021 •

edited

kainino0x commented Aug 3, 2021 •

edited

kainino0x commented Aug 3, 2021 •

edited

kvark commented Aug 3, 2021

kainino0x commented Aug 3, 2021 •

edited

kainino0x commented Aug 9, 2021

kainino0x commented Aug 9, 2021 •

edited

kainino0x commented Aug 9, 2021

shrekshao commented Aug 9, 2021 •

edited by kainino0x

kainino0x commented Aug 9, 2021

shrekshao commented Aug 9, 2021 •

edited

kainino0x commented Aug 10, 2021 •

edited

shrekshao commented Aug 10, 2021

kainino0x commented Aug 10, 2021

kainino0x commented Aug 10, 2021

kainino0x commented Aug 12, 2021 •

edited

kainino0x commented Aug 12, 2021 •

edited

shrekshao commented Aug 12, 2021

kainino0x commented Aug 12, 2021

kainino0x commented Aug 12, 2021

Kangz commented Dec 14, 2021

kainino0x commented Dec 15, 2021

Fragment output <-> attachment interface matching rules def #2013

Fragment output <-> attachment interface matching rules def #2013

Comments

shrekshao commented Aug 3, 2021 • edited

kainino0x commented Aug 3, 2021 • edited

kainino0x commented Aug 3, 2021 • edited

kvark commented Aug 3, 2021

kainino0x commented Aug 3, 2021 • edited

kainino0x commented Aug 9, 2021

kainino0x commented Aug 9, 2021 • edited

kainino0x commented Aug 9, 2021

shrekshao commented Aug 9, 2021 • edited by kainino0x

kainino0x commented Aug 9, 2021

shrekshao commented Aug 9, 2021 • edited

kainino0x commented Aug 10, 2021 • edited

shrekshao commented Aug 10, 2021

kainino0x commented Aug 10, 2021

kainino0x commented Aug 10, 2021

kainino0x commented Aug 12, 2021 • edited

kainino0x commented Aug 12, 2021 • edited

shrekshao commented Aug 12, 2021

kainino0x commented Aug 12, 2021

kainino0x commented Aug 12, 2021

Kangz commented Dec 14, 2021

kainino0x commented Dec 15, 2021

shrekshao commented Aug 3, 2021 •

edited

kainino0x commented Aug 3, 2021 •

edited

kainino0x commented Aug 3, 2021 •

edited

kainino0x commented Aug 3, 2021 •

edited

kainino0x commented Aug 9, 2021 •

edited

shrekshao commented Aug 9, 2021 •

edited by kainino0x

shrekshao commented Aug 9, 2021 •

edited

kainino0x commented Aug 10, 2021 •

edited

kainino0x commented Aug 12, 2021 •

edited

kainino0x commented Aug 12, 2021 •

edited