8356993: ArrayDeque should use Arrays.fill() instead of for() loops #25237

archiecobbs · 2025-05-14T19:37:37Z

Please review this small performance tweak ArrayDeque.

ArrayDeque has an invariant in which any unused elements in the array must be null. In a couple of places, the code is setting contiguous ranges of elements to null using for() loops. This can be both simplified and sped up by using Arrays.fill() instead.

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8356993: ArrayDeque should use Arrays.fill() instead of for() loops (Enhancement - P4)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/25237/head:pull/25237
$ git checkout pull/25237

Update a local copy of the PR:
$ git checkout pull/25237
$ git pull https://git.openjdk.org/jdk.git pull/25237/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 25237

View PR using the GUI difftool:
$ git pr show -t 25237

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/25237.diff

Using Webrev

Link to Webrev Comment

bridgekeeper · 2025-05-14T19:38:24Z

👋 Welcome back acobbs! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-05-14T19:39:26Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

openjdk · 2025-05-14T19:40:06Z

@archiecobbs The following label will be automatically applied to this pull request:

core-libs

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

mlbridge · 2025-05-14T19:43:20Z

Webrevs

AlanBateman · 2025-05-15T06:07:06Z

Are you planning to add some JMH benchmarks to go with this?

archiecobbs · 2025-05-15T14:12:21Z

Are you planning to add some JMH benchmarks to go with this?

I wasn't planning to, but I'm inferring from your question that you'd prefer to see one.

Which also makes me curious. I'd be shocked if this were slower, but even if not, I wonder how much faster it would be.

I will work on creating one.

RogerRiggs · 2025-05-15T14:45:51Z

I'm curious to know whether C2 turns the loop into a vectorized operation. The Arrays.fill might be more expressive, but not necessarily faster.

archiecobbs · 2025-05-15T17:45:50Z

I added a benchmark to the PR (hopefully I did that right).

It shows a decrease in performance. I have no idea why. I did this on my laptop so who knows, but if the effect is real then it kind of raises a lot of larger questions.

jdk-25+22-94-g0318e49500e (master):
Benchmark                                       Mode  Cnt   Score   Error  Units
ArrayDeque.ClearBenchmarkTestJMH.fillAndClear  thrpt   50  37.064 ± 0.225  ops/s

jdk-25+22-95-g84fb0903be0 (JDK-8356993):
Benchmark                                       Mode  Cnt   Score   Error  Units
ArrayDeque.ClearBenchmarkTestJMH.fillAndClear  thrpt   50  35.528 ± 0.180  ops/s

forax · 2025-05-15T17:50:30Z

test/jdk/java/util/ArrayDeque/ClearBenchmarkTestJMH.java

+    @Benchmark
+    @Measurement(iterations = 10)
+    @Warmup(iterations = 3)
+    public void fillAndClear() {


I think you need to return the collection or send it to a BlackHole

I think you need to return the collection or send it to a BlackHole

I'm fairly new to the benchmark game so I would not be surprised if this is broken.

Previously I was adding them to a list but that caused OOMs.

Can you clarify what you mean? By 'return' do you just mean returning the deque from the method? Also I don't konw what a BlackHole is.

Apologies for not knowing what I'm doing here. Thanks.

Here are many exmaples on how to correctly use JMH.

A blackhole prevents the compiler to optimize away your code.

Here are many exmaples on how to correctly use JMH.

A blackhole prevents the compiler to optimize away your code.

Thanks for the tip. FWIW after doing that the numbers came out about the same - which is not surprising given that Arrays.fill() is just the same for() loop...

#2 - After adding Blackhole jdk-25+22-94-g0318e49500e (master): Benchmark Mode Cnt Score Error Units ArrayDeque.ClearBenchmarkTestJMH.fillAndClear thrpt 50 35.663 ± 0.163 ops/s jdk-25+22-97-g9f0c5fe1f90 (JDK-8356993): Benchmark Mode Cnt Score Error Units ArrayDeque.ClearBenchmarkTestJMH.fillAndClear thrpt 50 35.112 ± 0.501 ops/s

ExE-Boss · 2025-05-15T19:53:03Z

Note that Arrays.fill(…) is simply a for(…) loop with an additional range check and is potentially subject to profile pollution due to JDK‑8015417:

jdk/src/java.base/share/classes/java/util/Arrays.java

Lines 3449 to 3453 in c59debb

    
           public static void fill(Object[] a, int fromIndex, int toIndex, Object val) { 
        
               rangeCheck(a.length, fromIndex, toIndex); 
        
               for (int i = fromIndex; i < toIndex; i++) 
        
                   a[i] = val; 
        
           }

archiecobbs · 2025-05-15T20:05:23Z

Note that Arrays.fill(…) is simply a for(…) loop with an additional range check

Interesting... I was assuming that most of the "bulk" methods in Arrays were being hand-optimized with special hardware magic (e.g., vector instructions), and that the opportunity to do this was part of the motivation for adding them in the first place.

If C2 is already able to automatically optimize this into the maximum possible hardware performance, then great! But is that actually the case?

archiecobbs · 2025-05-16T23:53:33Z

I'm closing the PR because it's gone into low-level optimization details that are beyond me.

However I'm still unclear on whether bulk memory set operations are being fully optimized. By that I mean doing something like this on arm64 at least. Any insights from the experts would be appreciated.

Thanks for the interesting discussion.

Use Arrays.fill() instead of for() loops to null out array elements.

84fb090

openjdk bot added the rfr Pull request is ready for review label May 14, 2025

openjdk bot added the core-libs core-libs-dev@openjdk.org label May 14, 2025

Add benchmark for ArrayDeque.clear().

1b8fb83

forax reviewed May 15, 2025

View reviewed changes

Blackhole ArrayDeque to ensure it's not ignored by the compiler.

9f0c5fe

archiecobbs closed this May 16, 2025

8356993: ArrayDeque should use Arrays.fill() instead of for() loops #25237

8356993: ArrayDeque should use Arrays.fill() instead of for() loops #25237

Uh oh!

Conversation

archiecobbs commented May 14, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress

Issue

Reviewing

Uh oh!

bridgekeeper bot commented May 14, 2025

Uh oh!

openjdk bot commented May 14, 2025

Uh oh!

openjdk bot commented May 14, 2025

Uh oh!

mlbridge bot commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

AlanBateman commented May 15, 2025

Uh oh!

archiecobbs commented May 15, 2025

Uh oh!

RogerRiggs commented May 15, 2025

Uh oh!

archiecobbs commented May 15, 2025

Uh oh!

forax May 15, 2025

Choose a reason for hiding this comment

Uh oh!

archiecobbs May 15, 2025

Choose a reason for hiding this comment

Uh oh!

rgiulietti May 15, 2025

Choose a reason for hiding this comment

Uh oh!

archiecobbs May 15, 2025

Choose a reason for hiding this comment

Uh oh!

ExE-Boss commented May 15, 2025

Uh oh!

archiecobbs commented May 15, 2025

Uh oh!

archiecobbs commented May 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

6 participants

archiecobbs commented May 14, 2025 •

edited by openjdk bot

Loading

mlbridge bot commented May 14, 2025 •

edited

Loading