Apply `Array#push`'s resizing heuristic to `#unshift` #10750

HertzDevil · 2021-05-27T02:08:45Z

The following illustrates some examples of the heuristic in action:

class Array(T)
  def to_s(io : IO) : Nil
    root = root_buffer
    io << '['
    @capacity.times do |i|
      io << ", " if i > 0
      if @offset_to_buffer <= i < @offset_to_buffer + @size
        root[i].inspect(io)
      else
        io << '_'
      end
    end
    io << ']'
  end
end

x = Array(Int32).new(5)
x.push(1, 2); puts x
x.unshift(3); puts x
x.push(4); puts x

# before:
# [1, 2, _, _, _]
# [_, _, _, _, 3, 1, 2, _, _, _]
# [_, _, _, _, 3, 1, 2, 4, _, _]

# after:
# [1, 2, _, _, _]
# [_, 3, 1, 2, _] # there is enough space; move 1 and 2 to the middle of the root buffer
# [_, 3, 1, 2, 4]

x = Array(Int32).new(4)
x.unshift(0); puts x
x.shift; x.unshift(1); puts x
x.shift; x.unshift(2); puts x
x.unshift(4, 3); puts x

# before:
# [_, _, _, 0, _, _, _, _]
# [_, _, _, _, _, _, _, 1, _, _, _, _, _, _, _, _]
# [_, _, _, _, _, _, _, _, _, _, _, _, _, _, _, 2, _, _, _, _, _, _, _, _, _, _, _, _, _, _, _, _]
# [_, _, _, _, _, _, _, _, _, _, _, _, _, 4, 3, 2, _, _, _, _, _, _, _, _, _, _, _, _, _, _, _, _]

# after:
# [_, 0, _, _] # there is pre-allocated space; start inserting elements from the middle
# [_, 1, _, _]
# [_, 2, _, _]
# [_, 4, 3, 2] # there is enough space; move 3 and 2 to the middle of the root buffer

stakach

LGTM

stakach · 2021-05-27T02:30:52Z

Hopefully this can make it into v1.0.1

straight-shoota

This is probably fine for a quick fix.

But I think it might be beneficial to take a closer look at the applied heuristics. I could see some room for improvement there.
A worst-case scenario with this is adding an element each to the front and back of an array that's filled to capacity (a typical example would be literals). Adding just these two elements quadruples the capacity: it's doubled by both push and unshift, regardless of their order. This means allocating a lot of unnecessary memory.
If the resize operations for push and unshift would not fill the buffer all the way to the front and back, respectively, this could be avoided.
I don't know what would be a better solution and I'm not even sure this needs to be changed, but it probably deserves an evaluation and discussion. Obviously, the best solution depends on a fair balance between different use cases.

spec/std/array_spec.cr

asterite · 2021-05-27T12:03:09Z

I could see some room for improvement there

I see what you did here 😄 😉

asterite · 2021-06-18T10:01:20Z

We should probably merge this since it's an important fix

beta-ziliani

Let's get this for 1.1.0. There's the unanswered question about the need for 10k iterations in tests, when probably a handful ones should suffice. But we can change that later (I'd like to hear @HertzDevil opinion on the matter).

Apply Array#shift's resizing heuristic to #unshift

505efba

stakach approved these changes May 27, 2021

View reviewed changes

HertzDevil changed the title ~~Apply Array#shift's resizing heuristic to #unshift~~ Apply Array#push's resizing heuristic to #unshift May 27, 2021

straight-shoota approved these changes May 27, 2021

View reviewed changes

spec/std/array_spec.cr Outdated Show resolved Hide resolved

straight-shoota added kind:bug A bug in the code. Does not apply to documentation, specs, etc. topic:stdlib:collection labels Jun 18, 2021

Merge branch 'master' into bug/array-unshift-resize

8d202ce

beta-ziliani approved these changes Jun 25, 2021

View reviewed changes

asterite added this to the 1.1.0 milestone Jun 25, 2021

HertzDevil added 2 commits June 26, 2021 09:27

Update array_spec.cr

bcfae5e

Update array_spec.cr

c52cfdb

straight-shoota approved these changes Jun 26, 2021

View reviewed changes

asterite merged commit a03539c into crystal-lang:master Jun 26, 2021

HertzDevil deleted the bug/array-unshift-resize branch June 27, 2021 03:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply `Array#push`'s resizing heuristic to `#unshift` #10750

Apply `Array#push`'s resizing heuristic to `#unshift` #10750

HertzDevil commented May 27, 2021 •

edited

Loading

stakach left a comment

stakach commented May 27, 2021

straight-shoota left a comment

asterite commented May 27, 2021

asterite commented Jun 18, 2021

beta-ziliani left a comment

Apply Array#push's resizing heuristic to #unshift #10750

Apply Array#push's resizing heuristic to #unshift #10750

Conversation

HertzDevil commented May 27, 2021 • edited Loading

stakach left a comment

Choose a reason for hiding this comment

stakach commented May 27, 2021

straight-shoota left a comment

Choose a reason for hiding this comment

asterite commented May 27, 2021

asterite commented Jun 18, 2021

beta-ziliani left a comment

Choose a reason for hiding this comment

Apply `Array#push`'s resizing heuristic to `#unshift` #10750

Apply `Array#push`'s resizing heuristic to `#unshift` #10750

HertzDevil commented May 27, 2021 •

edited

Loading