perf: Optimize `lower`, `upper` for sliced arrays by neilconway · Pull Request #21814 · apache/datafusion

neilconway · 2026-04-23T20:45:07Z

Which issue does this PR close?

Closes lower, upper is inefficient for sliced arrays #21804.

Rationale for this change

case_conversion_ascii_array operates directly on the underlying values buffer, but it neglects to ensure it only looks at bytes within the visible slice. For sliced arrays, this can lead to doing substantial unnecessary work.

What changes are included in this PR?

Optimize case_conversion_ascii_array for sliced arrays
Add a unit test
Add a benchmark. We can make the "sliced array" case arbitrarily extreme, so the raw benchmark number here is less important; it is more important that this benchmark confirms that the work we do scales with the visible size of a sliced array, which it does.

Are these changes tested?

Yes.

Are there any user-facing changes?

No.

comphead

Thanks @neilconway
would have been awesome if benchmarks details added

neilconway · 2026-04-24T03:16:26Z

@comphead

  Sliced ASCII
  - parent=8192, slice=128, str_len=32: main 11.83 µs → branch 344 ns — ~34× faster (−97.2%)
  - parent=65536, slice=128, str_len=32: main 93 µs → branch 337 ns — ~275× faster (−99.65%)
  - parent=65536, slice=1024, str_len=32: main 94 µs → branch 1.66 µs — ~57× faster (−98.3%)

  Non-sliced ASCII (lower_all_values_are_ascii)
  - size 1024: main 1.68 µs → branch 1.62 µs — −4.1%
  - size 4096: main 6.27 µs → branch 6.27 µs — −2.0%
  - size 8192: main 12.28 µs → branch 12.21 µs — −0.5%

  Non-sliced non-ASCII, first row non-ASCII (lower_the_first_value_is_nonascii)
  - size 1024: main 20.89 µs → branch 19.79 µs — −5.3%
  - size 4096: main 84.54 µs → branch 79.90 µs — −5.5%
  - size 8192: main 156.10 µs → branch 160.77 µs — +2.5%

  Non-sliced non-ASCII, middle row non-ASCII (lower_the_middle_value_is_nonascii)
  - size 1024: main 20.72 µs → branch 20.30 µs — +0.06% (noise)
  - size 4096: main 84.68 µs → branch 85.42 µs — +1.3%
  - size 8192: main 167.11 µs → branch 165.70 µs — +0.8% (noise)

.

dde5dbd

github-actions Bot added the functions Changes to functions implementation label Apr 23, 2026

comphead approved these changes Apr 24, 2026

View reviewed changes

comphead added this pull request to the merge queue Apr 24, 2026

Merged via the queue into apache:main with commit 7d5ddca Apr 24, 2026
31 checks passed

neilconway deleted the neilc/perf-lower-upper-sliced-arrays branch April 24, 2026 17:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Optimize `lower`, `upper` for sliced arrays#21814

perf: Optimize `lower`, `upper` for sliced arrays#21814
comphead merged 1 commit intoapache:mainfrom
neilconway:neilc/perf-lower-upper-sliced-arrays

neilconway commented Apr 23, 2026

Uh oh!

comphead left a comment

Uh oh!

neilconway commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

neilconway commented Apr 23, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

comphead left a comment

Choose a reason for hiding this comment

Uh oh!

neilconway commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants