Amortize the cost of freeing entities by ElliottjPierce · Pull Request #22658 · bevyengine/bevy

ElliottjPierce · 2026-01-23T05:02:19Z

Objective

The biggest drawback of #18670 was that it made freeing Entity's back to the allocator 4x slower. That meant a 20% regression in despawn performance. This PR vastly improves the performance of the entity allocator for freeing entities.

Solution

Add a local free list in pace in the main entity allocator. This is an ArrayVec called quick_free. When an entity is freed, add it to the quick_free. If it is full, flush the array to the full shared allocator.

Currently the array has length 64, taking 512 bytes. Since this is directly included in the already massive World type, I don't think this is an issue, and I would guess boxing it would hurt performance here. It also means that there will be at most 64 freed entities that simply can't be allocated. This reduces the worst case maximum entity count from 4,294,967,296 to 4,294,967,232 (big deal).

This also adds a new free_many function that is very fast compared to doing them one by one.

Testing

CI and benches.

Showcase

Here are some rough benchmarks on my M2 MAX:

group                                        post_quick_free_list                   pre_quick_free_list                    pre_remote_reservation
-----                                        --------------------                   -------------------                    ----------------------
entity_allocator_free/10000_entities         1.00     29.7±0.48µs        ? ?/sec    1.31     38.9±0.97µs        ? ?/sec    1.00     29.8±0.85µs        ? ?/sec
entity_allocator_free/100_entities           1.00   393.3±26.21ns        ? ?/sec    1.35   531.8±26.34ns        ? ?/sec    1.14   446.7±11.32ns        ? ?/sec
entity_allocator_free/1_entities             1.00      4.6±2.17ns        ? ?/sec    42.27  195.3±32.49ns        ? ?/sec    4.25     19.6±8.67ns        ? ?/sec
entity_allocator_free_bulk/10000_entities    1.00      8.7±0.36µs        ? ?/sec
entity_allocator_free_bulk/100_entities      1.00   240.9±31.01ns        ? ?/sec
entity_allocator_free_bulk/1_entities        1.00   206.8±39.95ns        ? ?/sec

Looking at the cost of freeing 1,000 entities, this makes the new allocator exactly as fast as the pre-#18670 one, 30% faster than main. The new free_many takes 8.7µs to free 1,000 entities where the optimized free takes 29.7, so another big win there.

This should make up the 20% regression to despawning. It might be even faster than pre-#18670 if we increase 64 to 128 or something, but I think that's unnecessary. This could also much improve performance for despawning scenes if we can find a way to make use of free_many, but that's a different task.

ElliottjPierce · 2026-01-23T05:18:24Z

Ah, this is playing with the order entities are allocated in, which some tests still depend on. I'll fix that real quick.

chescock

Looks good! We really need doc comments for quick_free, and I'm going to wait for those before approving, but the rest of my comments are just nits and possibilities.

crates/bevy_ecs/src/entity/remote_allocator.rs

chescock · 2026-02-02T14:58:53Z

crates/bevy_ecs/src/entity/remote_allocator.rs

 /// This is in contrast to the [`RemoteAllocator`], which may be cloned freely.
 pub(super) struct Allocator {
    shared: Arc<SharedAllocator>,
+    quick_free: ArrayVec<Entity, 64>,


Why not use an ordinary Vec? You can use Vec::with_capacity(64) and then check quick_free.len() == quick_free.capacity(). Storing these on a separate heap allocation instead of inline in World might even be better for cache locality, since most uses of World won't need this data.

I was concerned about cache locality, but you're not wrong. IMO, we either do a bigger heap Vec or a smaller/similarly sized ArrayVec. I'm totally fine with either; just depends on what we are more interested in optimizing.

Yeah, I'm also fine with either! This is just another "prefer the simpler thing", where std types like Vec feel simpler than crates like ArrayVec.

I just tried boxing it and doubling its capacity:

Much worse perf for freeing less than 64 entities.

Much better perf for freeing between 64 and 128 entities.

Broke even for 1000 entities.

So I don't see a strong case to not box it, so now it's boxed. I did try just a Vec but that made performance ~50% worse unfortunately.

crates/bevy_ecs/src/entity/remote_allocator.rs

alice-i-cecile

No objections in principle and this is generally well-made. More docs are always good though, and @chescock's review suggestions are unsurprisingly excellent.

# Objective The biggest drawback of bevyengine#18670 was that it made freeing `Entity`'s back to the allocator 4x slower. That meant a 20% regression in despawn performance. This PR vastly improves the performance of the entity allocator for freeing entities. ## Solution Add a local free list in pace in the main entity allocator. This is an `ArrayVec` called `quick_free`. When an entity is freed, add it to the `quick_free`. If it is full, flush the array to the full shared allocator. Currently the array has length 64, taking 512 bytes. Since this is directly included in the already massive `World` type, I don't think this is an issue, and I would guess boxing it would hurt performance here. It also means that there will be at most 64 freed entities that simply can't be allocated. This reduces the worst case maximum entity count from 4,294,967,296 to 4,294,967,232 (big deal). This also adds a new `free_many` function that is very fast compared to doing them one by one. ## Testing - CI and benches. --- ## Showcase Here are some rough benchmarks on my M2 MAX: ```txt group post_quick_free_list pre_quick_free_list pre_remote_reservation ----- -------------------- ------------------- ---------------------- entity_allocator_free/10000_entities 1.00 29.7±0.48µs ? ?/sec 1.31 38.9±0.97µs ? ?/sec 1.00 29.8±0.85µs ? ?/sec entity_allocator_free/100_entities 1.00 393.3±26.21ns ? ?/sec 1.35 531.8±26.34ns ? ?/sec 1.14 446.7±11.32ns ? ?/sec entity_allocator_free/1_entities 1.00 4.6±2.17ns ? ?/sec 42.27 195.3±32.49ns ? ?/sec 4.25 19.6±8.67ns ? ?/sec entity_allocator_free_bulk/10000_entities 1.00 8.7±0.36µs ? ?/sec entity_allocator_free_bulk/100_entities 1.00 240.9±31.01ns ? ?/sec entity_allocator_free_bulk/1_entities 1.00 206.8±39.95ns ? ?/sec ``` Looking at the cost of freeing 1,000 entities, this makes the new allocator exactly as fast as the pre-bevyengine#18670 one, 30% faster than main. The new `free_many` takes 8.7µs to free 1,000 entities where the optimized `free` takes `29.7`, so another big win there. This should make up the 20% regression to despawning. It might be even faster than pre-bevyengine#18670 if we increase 64 to 128 or something, but I think that's unnecessary. This could also much improve performance for despawning scenes if we can find a way to make use of `free_many`, but that's a different task.

ElliottjPierce added 6 commits January 22, 2026 23:05

add an array-vec to amortize the cost of freeing.

b222261

fixed optional dependency

a878856

bug fixes

59f4349

add free_many to match alloc_many

d0ec1d3

use free_many in benchmarks

e247f9e

fix lints

61a4770

ElliottjPierce added A-ECS Entities, components, systems, and events C-Performance A change motivated by improving speed, memory usage or compile times S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Jan 23, 2026

ElliottjPierce and others added 3 commits January 23, 2026 00:34

removed allocation order dependence in tests

2c7835d

Merge branch 'main' into amortize-the-cost-of-freeing-entities

503499f

fixed test

3027870

chescock reviewed Feb 2, 2026

View reviewed changes

alice-i-cecile reviewed Feb 2, 2026

View reviewed changes

ElliottjPierce and others added 3 commits February 2, 2026 17:45

renames

26da72a

faster free_many for small slices

99c07f9

Merge branch 'main' into amortize-the-cost-of-freeing-entities

66ac7dd

chescock approved these changes Feb 3, 2026

View reviewed changes

alice-i-cecile approved these changes Feb 3, 2026

View reviewed changes

alice-i-cecile added S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it and removed S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Feb 3, 2026

ElliottjPierce added 2 commits February 2, 2026 20:39

explain for_each

5aa743a

Box local_free

19fbfeb

alice-i-cecile enabled auto-merge February 3, 2026 02:08

alice-i-cecile added this pull request to the merge queue Feb 3, 2026

Merged via the queue into bevyengine:main with commit 4cc92a0 Feb 3, 2026
42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Amortize the cost of freeing entities#22658

Amortize the cost of freeing entities#22658
alice-i-cecile merged 14 commits intobevyengine:mainfrom
ElliottjPierce:amortize-the-cost-of-freeing-entities

ElliottjPierce commented Jan 23, 2026

Uh oh!

ElliottjPierce commented Jan 23, 2026

Uh oh!

chescock left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chescock Feb 2, 2026

Uh oh!

ElliottjPierce Feb 2, 2026

Uh oh!

chescock Feb 3, 2026

Uh oh!

ElliottjPierce Feb 3, 2026

Uh oh!

Uh oh!

alice-i-cecile left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

ElliottjPierce commented Jan 23, 2026

Objective

Solution

Testing

Showcase

Uh oh!

ElliottjPierce commented Jan 23, 2026

Uh oh!

chescock left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chescock Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottjPierce Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

chescock Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottjPierce Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alice-i-cecile left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants