Improve components that accept forward iterators and compute the size #169

Morwenn · 2020-08-22T13:57:16Z

The inplace_merge algorithm that works for forward iterators when no heap memory is available was passing next(i, n) to upper_bound and lower_bound while both of those start by computing the size again from the passed iterators. Introducing lower_bound_n and upper_bound_n avoids to cross the subrange twice to compute information we already have. This change makes the forward iterator version of merge_sort way faster than it used to be when no heap memory is available.

Morwenn · 2020-08-23T15:48:45Z

Introducing lower_bound_n and upper_bound_n that take an iterator and a size yielded some surprisingly good results: the graph below shows the result of benchmarking the old version of merge_sort against the new one when sorting an std::forward_list and no extra heap memory is available - as you can see the new version performs really faster than the previous one in almost every scenario:

It's now story time fellow mortals: that improvement comes from the use of lower_bound_n and upper_bound_n in the fallback algorithm used by inplace_merge to merge forward iterators when no extra heap memory is available. There are places where I called lower_bound(it, std::next(it, n), ...), however lower_bound starts by computing std::distance(first, last) to get the size of the search space. In the case of forward iterators, that meant traversing the same sequence twice to get information that we already had to start with, hence the poor results.

Now those of you who learnt the README by heart will remember this piece of ~~trivia~~ attribution:

The library internally uses an inplace_merge function that works with forward itetors. Its implementation uses a merge algorithm proposed by Dudziński and Dydek, and implemented by Alexander Stepanov and Paul McJones in their book Elements of Programming.

For all I know, Stepanov always recomments using the information we have fully and not to recompute what we already know (see for example the Law of Useful Return). With that in mind I wondered how Elements of Programming could contain suboptimal code like this, so I decided to go back in and have a look at the original code.

...

It turns out that Elements of Programming uses functions named lower_bound_n and upper_bound_n too, so it does not waste time traversing the search space twice. My guess is that I originally took that piece of code back when I was still trying not to reimplement half of the standard library's algorithms in cpp-sort and just "patched" the algorithm on the fly with std::next so that I could use std::lower_bound directly, and never gave it a second thought or tried to benchmark how bad the impact would be.

Once again Stepanov and McJones were right, and the mistakes were mine.

probe::dis was accidentally O(n^3) with forward iterators and bidirectional iterators. This commit changes the algorithm a bit to avoid a lot of unnecessary calls to std::distance, and gives it its marketed O(n^2) complexity for all categories of iterators. Loosely related to issue #169.

Morwenn · 2020-08-27T21:00:46Z

probe::dis got some love too because of this issue and the improvement is just ridiculously huge (to the point that the bars for the new speed are almost invisible):

Long story short: probe::dis was accidentally cubic instead of quadratic for forward and bidirectional iterators. The latest commit made it O(n²) as it was expected to be. I've still got a few ideas to make probe::dis a bit faster again, but none would be that big: issue #85 tracks the measures of presortedness for which the time complexity could be improved according to literature, and Dis(X) is not among them.

EDIT: with the addition of a small heuristic that short-circuits parts of the computation when we know that we can't find a better result, we get an even faster algorithm on average:

The orange bar in this graph is the O(n²) version from the previous graph (which was also an orange bar), the green bar is the O(n²) with the new optimization. It is worth noting that the second benchmark uses a collection 10 times bigger than that of the previous benchmark.

Add new innplace_merge overload for bidirectional iterators to allow to pass the sizes of the subranges to merge without passing a buffer. Use this information to avoid recomputing size information as often in the vergesort overload that handles bidirectional iterators. This commit also cleans the bidirectional overload of vergesort, which had been kind of neglected.

Morwenn added the enhancement label Aug 22, 2020

Morwenn added this to the 1.8.0 milestone Aug 22, 2020

Morwenn added a commit that referenced this issue Aug 22, 2020

verge_sort: avoid computing the collection size when passed (#169)

b0759f0

Morwenn added a commit that referenced this issue Aug 23, 2020

stable_adapter: don't recompute size when already known (#169)

c4bc1cf

Morwenn added a commit that referenced this issue Aug 25, 2020

Various probes: don't recompute size when already known (#169)

2967a13

Morwenn added a commit that referenced this issue Aug 29, 2020

schwartz_adapter: don't recompute size when already known (#169)

650418f

Morwenn added a commit that referenced this issue Aug 31, 2020

probe::rem: don't always recompute size (#169)

d64cd6c

Morwenn closed this as completed Sep 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve components that accept forward iterators and compute the size #169

Improve components that accept forward iterators and compute the size #169

Morwenn commented Aug 22, 2020 •

edited

Morwenn commented Aug 23, 2020 •

edited

Morwenn commented Aug 27, 2020 •

edited

Improve components that accept forward iterators and compute the size #169

Improve components that accept forward iterators and compute the size #169

Comments

Morwenn commented Aug 22, 2020 • edited

Morwenn commented Aug 23, 2020 • edited

Morwenn commented Aug 27, 2020 • edited

Morwenn commented Aug 22, 2020 •

edited

Morwenn commented Aug 23, 2020 •

edited

Morwenn commented Aug 27, 2020 •

edited