Reduce V2 serializer allocations by ~23% via fast path by scottmyron · Pull Request #583 · procore-oss/blueprinter

scottmyron · 2026-04-24T19:41:40Z

Add a context-free serialization path for the common case where no extension hooks, conditionals, default values, formatters, or Proc extractors are configured.

Previously, every call to serialize allocated a Context::Field and a Context::Parent struct unconditionally — even when nothing in the serialization loop ever read them. Together these accounted for ~22% of all object allocations and caused V2 to trigger 2× more GC runs than V1 under the same workload.

Changes:

Add Extractors::Property.extract_simple to extract field values directly from an object/hash without a Context::Field
Precompute @needs_field_ctx at blueprint load time in find_used_hooks! (requires finalize_fields! to run first, hence the reorder in initialize)
Branch to serialize_fast when @needs_field_ctx is false, skipping the Context::Field allocation entirely
Make Context::Parent lazy in both paths — created at most once per serialize call, only when an association field is actually encountered

Result on 500 widgets × 50 iterations (30 fields, 10 object associations, 5 collection associations):

Allocations: −23% (3.3M → 2.56M objects)
GC runs: −26% (85 → 63)
Context::Field: eliminated from hot path (75k → 0 samples)
Context::Parent: eliminated from hot path (75k → ~10 samples)

V2 now allocates fewer objects than V1 for the common case.

Made-with: Cursor

Checklist:

I have updated the necessary documentation
I have signed off all my commits as required by DCO
My build is green

Add a context-free serialization path for the common case where no extension hooks, conditionals, default values, formatters, or Proc extractors are configured. Previously, every call to `serialize` allocated a `Context::Field` and a `Context::Parent` struct unconditionally — even when nothing in the serialization loop ever read them. Together these accounted for ~22% of all object allocations and caused V2 to trigger 2× more GC runs than V1 under the same workload. Changes: - Add `Extractors::Property.extract_simple` to extract field values directly from an object/hash without a Context::Field - Precompute `@needs_field_ctx` at blueprint load time in `find_used_hooks!` (requires `finalize_fields!` to run first, hence the reorder in `initialize`) - Branch to `serialize_fast` when `@needs_field_ctx` is false, skipping the Context::Field allocation entirely - Make `Context::Parent` lazy in both paths — created at most once per `serialize` call, only when an association field is actually encountered Result on 500 widgets × 50 iterations (30 fields, 10 object associations, 5 collection associations): - Allocations: −23% (3.3M → 2.56M objects) - GC runs: −26% (85 → 63) - Context::Field: eliminated from hot path (75k → 0 samples) - Context::Parent: eliminated from hot path (75k → ~10 samples) V2 now allocates fewer objects than V1 for the common case. Made-with: Cursor

jhollinger · 2026-04-24T20:40:03Z

Thanks @scottmyron! I'm still studying it but there's definitely something here. I do wonder if we could keep a single serialize method and conditionally create the Field Context (like you're doing with Parent Context)? Could ease maintenance and testing. Just a thought - YMMV.

Also, something we've been talking about is better automation around speed/memory/etc checks to prevent future regressions. (V1 saw huge perf degradation over time b/c nothing was checking it.) Would you be able to push up your memory perf code so we could eventually incorporate it into whatever system we eventually come up with? Automatically seeing "this PR adds a bunch of allocations" would be invaluable IMO.

jhollinger · 2026-04-28T14:33:09Z

Going to merge this and riff on it a bit. Thanks!

Reduce V2 serializer allocations by ~23% via fast path

scottmyron · 2026-04-28T14:59:13Z

Apologies for the delay... I'll get another PR going soon with the memory profiling turned into a Github Action (hopefully).

scottmyron requested review from a team and ritikesh as code owners April 24, 2026 19:41

github-actions Bot assigned scottmyron Apr 24, 2026

Merge branch 'jh/release-2.0-faster' into sm/release-2.0-faster-tweaks

2d2cb17

jhollinger merged commit 30abb0f into procore-oss:jh/release-2.0-faster Apr 28, 2026
1 check failed

jhollinger added a commit that referenced this pull request Apr 28, 2026

Merge pull request #583 from scottmyron/sm/release-2.0-faster-tweaks

22c9256

Reduce V2 serializer allocations by ~23% via fast path

jhollinger added a commit that referenced this pull request Apr 28, 2026

Merge pull request #583 from scottmyron/sm/release-2.0-faster-tweaks

6057122

Reduce V2 serializer allocations by ~23% via fast path

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce V2 serializer allocations by ~23% via fast path#583

Reduce V2 serializer allocations by ~23% via fast path#583
jhollinger merged 2 commits intoprocore-oss:jh/release-2.0-fasterfrom
scottmyron:sm/release-2.0-faster-tweaks

scottmyron commented Apr 24, 2026

Uh oh!

jhollinger commented Apr 24, 2026

Uh oh!

jhollinger commented Apr 28, 2026

Uh oh!

Uh oh!

scottmyron commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

scottmyron commented Apr 24, 2026

Uh oh!

jhollinger commented Apr 24, 2026

Uh oh!

jhollinger commented Apr 28, 2026

Uh oh!

Uh oh!

scottmyron commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants