fix x86_64 performance regression #43

davidhewitt · 2025-11-14T19:50:24Z

This appears to fix the performance regression on x86_64 CPUs (at least on my local dev box).

CPU profiles suggested that v.is_null has a fair amount of dynamic dispatch indirection (it's an &Arc<dyn Array> cast itself as &dyn Array). Extracting the logical_nulls out (should be ~ a reference count cycle for simple arrays) allows better inlining.

I guess this might have somehow upset the x86_64 branch predictor? Maybe the dynamic dispatch made the call chain complex enough that the branch predictor was preempting the result of is_null incorrectly.

fix x86_64 performance regression

c3c0bc1

github-actions bot added the physical-expr label Nov 14, 2025

davidhewitt mentioned this pull request Nov 14, 2025

Refactor InListExpr to support structs by re-using existing hashing infrastructure apache/datafusion#18449

Open

adriangb merged commit 2db9927 into pydantic:refactor-in-list Nov 14, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix x86_64 performance regression #43

fix x86_64 performance regression #43

davidhewitt commented Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix x86_64 performance regression #43

fix x86_64 performance regression #43

Conversation

davidhewitt commented Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants