Avoid reading the entire asset into memory during asset processing. #21925

andriyDev · 2025-11-24T02:40:44Z

Objective

When processing assets, we first read the whole asset into memory, then process the asset from that in-memory representation. This means that large assets may just not fit into memory causing much bigger issues (e.g., switching over to slow virtual memory).

Solution

Read the asset twice during processing: 1) once to determine the asset hash, 2) once to actually process the asset.

This means the processing code itself doesn't need to read the whole asset into memory at any point, meaning we can now process much bigger assets.

However, there are some risks. Asset sources which can't read chunks of an asset - which need to read the whole asset into memory anyway - now have to do so twice (not at the same time though). An example of this kind of asset source is the default Wasm source, or the HTTP asset source. In practice, I don't think this is a big issue - processing is likely to be happening on local assets anyway - it seems unlikely that users will want to download large assets from an HTTP asset source multiple times.

Another risk is that for asset processing, copying files is possibly slower - previously we just read the whole asset in, and then wrote the whole asset out. Now, we read small 1k chunks from the file, then write that chunk. This introduces some scheduling overhead in the copying. We can tune this number if we find it's too slow.

Testing

The processing tests all still pass.

JMS55 · 2025-11-24T03:30:15Z

Nice step towards https://zeux.io/2025/09/30/billions-of-triangles-in-minutes/ !

Some questions:

What is the hash used for? And the duplicate reads is only for asset processing, and won't have any effect on release versions of games loading already-processed assets, right?
We still don't have a way to read a certain slice of bytes from the asset, right? E.g. parsing a GLTF file to find meshes in them, then reading the meshes one at a time from other parts of the file

andriyDev · 2025-11-24T06:39:14Z

@JMS55 Yup haha, I think you posted this a while ago in the Discord and it's been rattling around in my brain as something we simply can't do with Bevy today. I intend to fix that!

The hash is used when the processor initializes to determine whether an asset is up-to-date.
You are correct, the duplicate reads only matter during asset processing. This has no effect for loading already-processed assets.
Sorta. Currently Reader only requires AsyncRead and AsyncSeekForward. In theory, you could write a glTF loader that cleverly seeked in such a way that they never go backwards. That would be very painful though. So what I've been working on is to allow the loader to request features from the asset source. So a glTF loader could request the "seek from start" feature, and then its reader could do whatever seeks it needs - much simpler to work with. We'll see how it pans out!

JMS55

Looks pretty good to me, some last feedback.

JMS55 · 2025-11-25T20:07:51Z

release-content/migration-guides/process_trait_changes.md

+
+```rust
+// Inside `impl Process for Type`
+let reader = context.asset_reader();


Can we add a nice helper for this?

I intentionally don't want to. We should be encouraging users to engage with the "buffered" API of reading and writing, to reduce how much memory we're using. Today a lot of our loaders and sources just read everything into memory, and that can really limit how much data we can process. Besides, this is exactly how users should be doing it in loaders anyway, so this is no different. This migration is just weird because the previous behavior of having all the bytes is weird.

crates/bevy_asset/src/processor/mod.rs

crates/bevy_asset/src/io/mod.rs

JMS55 · 2025-11-26T03:23:57Z

crates/bevy_asset/src/meta.rs

+    loop {
+        let bytes_read = asset_reader.read(&mut buffer).await?;
+        if bytes_read == 0 {
+            // This means we've reached EOF, so we're done consume asset bytes.


Suggested change

// This means we've reached EOF, so we're done consume asset bytes.

// This means we've reached EOF, so we're done consuming asset bytes.

Done! I also realized this was slightly wrong - it should be if we didn't fill the buffer that it keeps processing.

release-content/migration-guides/process_trait_changes.md

JMS55

Approved with some minor doc suggestions.

JMS55 suggested changes Nov 25, 2025

View reviewed changes

andriyDev force-pushed the no-memory-process branch from c3593c7 to abc6390 Compare November 26, 2025 01:01

andriyDev requested a review from JMS55 November 26, 2025 01:06

andriyDev force-pushed the no-memory-process branch 2 times, most recently from 9ea552b to 67d34ca Compare November 26, 2025 02:56

JMS55 reviewed Nov 26, 2025

View reviewed changes

release-content/migration-guides/process_trait_changes.md Outdated Show resolved Hide resolved

JMS55 approved these changes Nov 26, 2025

View reviewed changes

andriyDev added 2 commits November 25, 2025 22:06

Avoid reading the whole asset into memory for asset processing.

5cfcb48

Create a migration guide for changes to ProcessContext.

85432e7

andriyDev force-pushed the no-memory-process branch from f41e260 to 15712b0 Compare November 26, 2025 06:07

andriyDev added 2 commits November 25, 2025 22:48

Check for buffer < len to find EOF.

8248adb

Reword migration guide title.

7110da5

andriyDev force-pushed the no-memory-process branch from 15712b0 to 7110da5 Compare November 26, 2025 06:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Avoid reading the entire asset into memory during asset processing. #21925

Avoid reading the entire asset into memory during asset processing. #21925

andriyDev commented Nov 24, 2025 •

edited

Loading

Uh oh!

JMS55 commented Nov 24, 2025

Uh oh!

andriyDev commented Nov 24, 2025

Uh oh!

JMS55 left a comment

Uh oh!

JMS55 Nov 25, 2025

Uh oh!

andriyDev Nov 26, 2025

Uh oh!

Uh oh!

Uh oh!

JMS55 Nov 26, 2025

Uh oh!

andriyDev Nov 26, 2025

Uh oh!

Uh oh!

JMS55 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	// This means we've reached EOF, so we're done consume asset bytes.
	// This means we've reached EOF, so we're done consuming asset bytes.

Uh oh!

Avoid reading the entire asset into memory during asset processing. #21925

Are you sure you want to change the base?

Avoid reading the entire asset into memory during asset processing. #21925

Conversation

andriyDev commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Uh oh!

JMS55 commented Nov 24, 2025

Uh oh!

andriyDev commented Nov 24, 2025

Uh oh!

JMS55 left a comment

Choose a reason for hiding this comment

Uh oh!

JMS55 Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

andriyDev Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JMS55 Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

andriyDev Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

JMS55 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andriyDev commented Nov 24, 2025 •

edited

Loading