Pipeline cache API and implementation for Vulkan #5319

DJMcNab · 2024-02-28T17:38:30Z

Connections

Fixes Pipeline Caching (between program executions) #5293 (for Vulkan)
Required for Winit example taking very long to startup on android linebender/vello#289
See also Checkpoint pipeline caching linebender/vello#459 which validated that this was required

Description
Adds a PipelineCache resource to wgpu's API, and wires it down to Vulkan vkCreatePipelineCache.
This cache is added as a parameter to ComputePipelineDescriptor and RenderPipelineDescriptor.
This parameter is used as expected on Vulkan, and ignored on DirectX/Metal/WebGPU/gles.

Testing
Testing has been performed on a Pixel 6, which has validated that this works as expected between runs.
This is in the branch in my fork of Vello.
This does only test the cache for Vulkan pipelining.

Checklist

Run cargo fmt.
Run cargo clippy. If applicable, add:
- --target wasm32-unknown-unknown
- --target wasm32-unknown-emscripten - gets the same errors on main, so I assume it's fine
Run cargo xtask test to run tests - I get the same failures on main, so I assume it's fine
Add change to CHANGELOG.md. See simple instructions inside file.

Status
This PR is now fully ready for review

Future Work
I am planning on implementing additional validation inside wgpu that the cache is suitable, as discussed in https://medium.com/@zeuxcg/creating-a-robust-pipeline-cache-with-vulkan-961d09416cda. That is mainly about storing the driver version and file size inline in the file. This should be done before this feature is released, but could be in a different pull request. Please advise if you have any preference here.

This could also be extended to different platforms/APIs - see e.g. gfx-rs/gfx#2877
I am not intending to do this work myself

DJMcNab · 2024-03-01T16:57:41Z

cc @expenses, as you had a previous investigation in gfx-rs/gfx#3716

cwfitzgerald

Thanks for this! The code itself is fine, keep it up like this, and the documentation is dreamy.

The api has a few issues at multiple layers. At points wgpu-hal is expected to do validation, at other points the end user is just expected to know how to pass the correct thing. I think we should unify the api as follows.

wgpu-hal

The wgpu-hal layer should just expect that the blob passed to it is correct for that platform and driver combination. It shouldn't do any additional validation. All of its internal code can expect that the cache key is honored.

It may append additional data structures to the blob which contain shader intermediate states (such as HLSL generated or DXBC/DXIL).

There is a method on the Adapter which returns a backend-specific string of bytes to serve as a "cache key". This includes the wgpu-hal version.

wgpu-core

Core does all of the validation that the key is honored and the file was not corrupted.

It adds metadata to the top of the blob that stores the magic number, file size, file hash, and pipeline key.

If this errors, it will create a new empty cache.

wgpu

Passes through the core apis.

User

The user will pull the pipeline key from the adapter, hash it, and use it as a filename to store the data.

Does this all make sense?

wgpu-core/src/device/resource.rs

wgpu-core/src/pipeline.rs

wgpu-hal/src/vulkan/device.rs

wgpu-types/src/lib.rs

DJMcNab · 2024-03-14T06:56:46Z

Thanks for the review @cwfitzgerald

In terms of validation, my intent was to follow up with that, because I wasn't sure I was doing the right thing. So I can definitely add that in here.
So I can definitely add that validation in this PR, since the rest of the shape is approximately right.
One thing I need guidance on is how to hash the data in wgpu_core. Are you aware of a preferred library for this? It feels like the sort of thing which is quite opinionated

I just want some clarification about what you mean by:

The user will pull the pipeline key from the adapter, hash it, and use it as a filename to store the data.

In my prototype integration, I wasn't hashing the filename. I find it unlikely that users would do so, either.
Should we pre-hash the value from the cache key method?

DJMcNab · 2024-03-14T07:46:39Z

It shouldn't do any additional validation

This is also slightly troubling, as different APIs have different validation needs. For example, Vulkan probably needs to validate the driver version is the same, whereas the concept of a driver version isn't exposed at the higher levels

xorgy · 2024-03-14T13:15:15Z

For example, Vulkan probably needs to validate the driver version is the same, whereas the concept of a driver version isn't exposed at the higher levels

That's interesting... are there Vk drivers in the wild that accept incompatible pipeline caches from applications, or is this more preemptive (due to the quality issues with some drivers)?

DJMcNab · 2024-03-14T13:56:10Z

According to the linked article, there have been cases in the past where a driver either:

Doesn't validate the UUID which is meant to detect this at all
Fails to update the UUID when it would be incompatible

cwfitzgerald · 2024-03-14T17:51:28Z

This is also slightly troubling, as different APIs have different validation needs. For example, Vulkan probably needs to validate the driver version is the same, whereas the concept of a driver version isn't exposed at the higher levels

The idea is that the wgpu-hal decides on its pipeline key based on its validation needs, and wgpu-core is the one who does the comparison and rejection of the pipeline key. This way the driver version will be part of the wgpu-hal pipeline key.

In my prototype integration, I wasn't hashing the filename. I find it unlikely that users would do so, either.
Should we pre-hash the value from the cache key method?

If we make cache key an opaque type we can put a into_bytes and hash method on it.

DJMcNab · 2024-03-14T17:56:10Z

The key factor which needs to be accounted for is:

If the driver updates, the old cache you still have is useless.

I'm planning on implementing that with effectively two keys, a "device" key (which identifies the device/backend combination) and a further validation key. The device key should related to the "file name" being used, whereas the validation key is just used to bail from using the cache. That is potentially what you are describing, but I think it's good to clarify that

DJMcNab · 2024-03-18T10:58:12Z

This is now ready for follow-up review. I am going to now test this updated version in Vello, and also need some guidance on computing the hash (it currently uses a hardcoded value).

I'm unlikely to make further functional changes before another review

DJMcNab · 2024-04-05T14:29:08Z

Having done some more thinking on this, I have decided to exclude the hashing of the cache data from this PR. The main reasons for this are:

I believe it to be unlikely that filesystems would corrupt the data in a way which keeps the same length, but changes some of the content
Any attempt at meddling in the file by other processes/application is already declared out of scope by our safety comment
We do not already have a dependency which would compute a suitable hash. We could likely produce a hash designed for a hashmap, but these have significantly different requirements to our hashing need.

I also don't see the necessity in making the pipeline cache key an opaque set of bytes, rather than a readable string.
These files should be in a location which would be rarely accessed by end-users.
Additionally, if using these as a file name, an application would have to sanitise the bytes to be reasonable for use as a file name. I suspect at least some applications would not do so, and potentially lose some of portability which wgpu provides.
But I also think it's plausible that I've missed some important reason for this to be an incorrect assessment.

cwfitzgerald

This is absolutely fabulous work!

I think the setup for caching is perfectly reasonable, and the core/hal relationship is exactly what i wanted to see!

Have some comments. I also would like to see an execution test that actually uses the pipeline cache. Maybe make a cache, make a pipeline, serialize the cache, re-parse it to a new cache, and make an identical pipeline. Just to make sure everything actually works and there aren't any api level issues.

examples/src/mipmap/mod.rs

wgpu-core/src/pipeline_cache.rs

wgpu/src/lib.rs

wgpu-hal/src/vulkan/device.rs

wgpu/src/lib.rs

DJMcNab · 2024-04-19T17:16:38Z

Re-requesting review pending CI

cwfitzgerald

Sorry for taking so long to get back to you on this - we're basically there, just some small nits at this point.

tests/tests/pipeline_cache.rs

cwfitzgerald · 2024-05-09T02:50:29Z

wgpu-hal/src/vulkan/device.rs

+            .as_ref()
+            .and_then(|it| it.cache)
+            .map(|it| it.raw);
+        // TODO: What should the behaviour be when both are set and different?


We should just assert they're the same, and document the requirement on the wgpu-hal interface - wgpu-core will always uphold it, so this is fine.

How should wgpu-core uphold it? The way that it works with PipelineCompilationOptions works, it's specified directly on each of the vertex and fragment shaders.

The problem becomes that constants are per-shader, whereas the pipeline cache is across the entire pipeline (at least in Vulkan).
I think the least bad option is to specify that the vertex cache will be prioritised on platforms where these are the same?

Ah! Core should check to make sure the caches are the same - honestly it sounds like it needs to be part of the base pipeline descriptor instead of a per-entrypoint argument.

Yeah, possibly adding it to the root descriptor is best.

But it's a bit annoying to ask every single user to need to add this field, when they already have a defaulted PipelineCompilationOptions.

I can also imagine these being meaningfully distinct, such as for producing DXIL (although combining them would also work in that case).

An alternative is to merge the caches on Vulkan, and make them alias the same raw cache object. There don't seem to be any good options, because of the semantics here

I won't be addressing this until at least Thursday, but I suppose putting in the top-level descriptor is actually best.

Hinting to users in the upgrade that the cache exists and that they should/could use it is possibly a good thing, anyway

Sounds like root is the way to go. Give me a buzz when that's ready and we can get this in.

wgpu-types/src/lib.rs

Fix some CI issues

Temp: Start wiring up data access

Remove optionality Resolve the real CI issues

Guess at correct tracker indices thing

We don't have any current dependencies which would compute this, and it is probably superfluous

Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com>

cwfitzgerald

Just need your throughs on where the pipeline cache should go, then we're good to land.

This really makes the diff so much more unwieldy

DJMcNab requested a review from a team as a code owner February 28, 2024 17:38

DJMcNab mentioned this pull request Mar 1, 2024

Asynchronous Pipline Creation using CPU shaders linebender/vello#485

Open

DJMcNab force-pushed the pipeline-cache-api-vulkan branch from 6350e86 to b657c91 Compare March 1, 2024 16:34

DJMcNab mentioned this pull request Mar 5, 2024

Long startup time under DX12 linebender/vello#498

Closed

cwfitzgerald requested changes Mar 14, 2024

View reviewed changes

wgpu-core/src/device/resource.rs Outdated Show resolved Hide resolved

wgpu-core/src/pipeline.rs Show resolved Hide resolved

wgpu-hal/src/vulkan/device.rs Outdated Show resolved Hide resolved

wgpu-types/src/lib.rs Outdated Show resolved Hide resolved

DJMcNab force-pushed the pipeline-cache-api-vulkan branch from 397326f to d256e9c Compare March 14, 2024 09:19

cwfitzgerald self-requested a review March 14, 2024 17:00

DJMcNab marked this pull request as draft March 14, 2024 17:50

DJMcNab marked this pull request as ready for review March 18, 2024 10:56

DJMcNab force-pushed the pipeline-cache-api-vulkan branch from e973ce8 to 7626051 Compare March 18, 2024 10:56

DJMcNab mentioned this pull request Mar 18, 2024

Pipeline caching, this time not jankily linebender/vello#524

Draft

DJMcNab mentioned this pull request Apr 8, 2024

Allow configuring whether workgroup memory is zero initialised #5508

Merged

10 tasks

cwfitzgerald requested changes Apr 18, 2024

View reviewed changes

examples/src/mipmap/mod.rs Show resolved Hide resolved

wgpu-core/src/pipeline_cache.rs Show resolved Hide resolved

wgpu/src/lib.rs Outdated Show resolved Hide resolved

wgpu-hal/src/vulkan/device.rs Outdated Show resolved Hide resolved

wgpu/src/lib.rs Outdated Show resolved Hide resolved

DJMcNab force-pushed the pipeline-cache-api-vulkan branch from 53f7f1e to 84e1616 Compare April 19, 2024 09:23

DJMcNab requested a review from cwfitzgerald April 19, 2024 17:16

MarcelStruckWO mentioned this pull request May 3, 2024

Winit example taking very long to startup on android linebender/vello#289

Open

cwfitzgerald requested changes May 9, 2024

View reviewed changes

DJMcNab added 3 commits May 9, 2024 10:42

Implement resource management of pipeline caches

7a52266

Introduce the cache API to the frontend

ef88b7c

Fix some CI issues

Support using a pipeline cache when creating a pipeline

7edb08b

Temp: Start wiring up data access

DJMcNab added 21 commits May 9, 2024 10:42

Actually pass the cache to the data

1b53496

Move pipeline_cache_key to wgpu::util

2e0e614

Remove optionality Resolve the real CI issues

Fix solveable CI errors

f488e81

Guess at correct tracker indices thing

Add a CHANGELOG entry

eb17331

Add more comprehensive documentation

f7d4516

Update documentation and caching use

40f2a85

Add a feature for pipeline caching

66f9d54

Address (most) review comments

ceb70a0

Sketch out some API for validation

026edf5

Sketch out the cache header API fully

7969387

Integrate cache validation

824feab

📎

fea8a79

Fix enforced truncation

7ac7010

Punt the hash validation

ca65d7e

We don't have any current dependencies which would compute this, and it is probably superfluous

Move the cache to PipelineCompilationOptions

c6f1669

Address some review comments

d46ccd3

Combine the init and non-init paths

e046f9b

Fix more review comments and add a test

b470541

Fix incorrect intra-doc-link

3daf8eb

📎

130cb0a

Fix webgpu compile

20906e9

DJMcNab force-pushed the pipeline-cache-api-vulkan branch from d3d4fb9 to 20906e9 Compare May 9, 2024 08:45

DJMcNab and others added 3 commits May 9, 2024 10:47

Fix changelog again

bddbe63

Update wgpu-types/src/lib.rs

e957a85

Co-authored-by: Connor Fitzgerald <connorwadefitzgerald@gmail.com>

Address review comments

1f11a64

DJMcNab requested a review from cwfitzgerald May 14, 2024 08:08

cwfitzgerald requested changes May 14, 2024

View reviewed changes

DJMcNab added 2 commits May 16, 2024 12:49

Move the cache back to the root :(

f292531

This really makes the diff so much more unwieldy

Merge branch 'trunk' into pipeline-cache-api-vulkan

5ecb734

DJMcNab requested a review from cwfitzgerald May 16, 2024 11:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline cache API and implementation for Vulkan #5319

Pipeline cache API and implementation for Vulkan #5319

DJMcNab commented Feb 28, 2024 •

edited

DJMcNab commented Mar 1, 2024

cwfitzgerald left a comment •

edited

DJMcNab commented Mar 14, 2024

DJMcNab commented Mar 14, 2024

xorgy commented Mar 14, 2024 •

edited

DJMcNab commented Mar 14, 2024

cwfitzgerald commented Mar 14, 2024

DJMcNab commented Mar 14, 2024 •

edited

DJMcNab commented Mar 18, 2024 •

edited

DJMcNab commented Apr 5, 2024

cwfitzgerald left a comment

DJMcNab commented Apr 19, 2024

cwfitzgerald left a comment

cwfitzgerald May 9, 2024

DJMcNab May 9, 2024

cwfitzgerald May 14, 2024

DJMcNab May 14, 2024

cwfitzgerald May 14, 2024

cwfitzgerald left a comment

Pipeline cache API and implementation for Vulkan #5319

Are you sure you want to change the base?

Pipeline cache API and implementation for Vulkan #5319

Conversation

DJMcNab commented Feb 28, 2024 • edited

DJMcNab commented Mar 1, 2024

cwfitzgerald left a comment • edited

Choose a reason for hiding this comment

wgpu-hal

wgpu-core

wgpu

User

DJMcNab commented Mar 14, 2024

DJMcNab commented Mar 14, 2024

xorgy commented Mar 14, 2024 • edited

DJMcNab commented Mar 14, 2024

cwfitzgerald commented Mar 14, 2024

DJMcNab commented Mar 14, 2024 • edited

DJMcNab commented Mar 18, 2024 • edited

DJMcNab commented Apr 5, 2024

cwfitzgerald left a comment

Choose a reason for hiding this comment

DJMcNab commented Apr 19, 2024

cwfitzgerald left a comment

Choose a reason for hiding this comment

cwfitzgerald May 9, 2024

Choose a reason for hiding this comment

DJMcNab May 9, 2024

Choose a reason for hiding this comment

cwfitzgerald May 14, 2024

Choose a reason for hiding this comment

DJMcNab May 14, 2024

Choose a reason for hiding this comment

cwfitzgerald May 14, 2024

Choose a reason for hiding this comment

cwfitzgerald left a comment

Choose a reason for hiding this comment

DJMcNab commented Feb 28, 2024 •

edited

cwfitzgerald left a comment •

edited

xorgy commented Mar 14, 2024 •

edited

DJMcNab commented Mar 14, 2024 •

edited

DJMcNab commented Mar 18, 2024 •

edited