Strange benchmark results with std::string #4

Morwenn · 2019-01-29T18:45:17Z

Hi,

I recently tried to rerun the benchmarks to check another simpler Inv-adaptive algorithm against drop-merge-sort. I got expected results with int, but the std::string benchmark gives the following result:

In the graph above, pdq_sort is an algorithm similar to std::sort, split_sort is the algorithm I was working on and drop_merge_sort is similar to the implementation from this repository (I tried to use the implementation from here though, and it gave me similar results). The relative curves of pdq_sort and split_sort, while not as pretty as the ones in your README, are rather expected. However I can't explain the one for drop_merge_sort. Do you have any idea what might be happening here, keeping in mind that the benchmark for integers has the expected results?

EDIT: it says "sorting 10^6 int" but it's only because I forgot to change the graph title. It's effectively the results of the string benchmark.

The text was updated successfully, but these errors were encountered:

Morwenn · 2019-01-29T20:55:43Z

I just investigated a bit and apparently when drop-merge-sort is benchmarked with std::string, the resulting dropped vector never contains a single element prior to the merge. I'd say that the strange behaviour comes from the data generation in the benchmark more than from the algorithm itself.

So far it seems that the condition if (begin != write && comp(*read, *(write - 1))) never evaluates to true in the benchmark. Check showed that randomize doesn't produce an already sorted collection? So I don't really know what's going on there.

Morwenn · 2019-01-30T11:30:57Z

Ok, I eventually found the issue: when I uncomment the "sanity check" in the benchmarks, it throws an exception. The issue does come from dmsort and not from the benchmark itself. It is in the following piece of code:

} else {
    *write = std::move(*read);
    ++read;
    ++write;
    num_dropped_in_row = 0;
}

Here *write = std::move(*read); actually triggers a self-move when write == read. Adding a self-assignment check solves the problem and I get coherent results again:

} else {
    if (read != write) {
        *write = std::move(*read);
    }
    ++read;
    ++write;
    num_dropped_in_row = 0;
}

For performance reasons, I don't think the algorithm needs to perform that check above for trivially copyable types.

adrian17 · 2019-02-04T14:30:30Z

Thank you for investigating this! Just curious, which compiler/flag version triggered the exception? I'd experiment with it a bit myself.

In the next weekend I'll test it out, rerun the benchmarks and submit the patch.

Morwenn · 2019-02-04T16:21:44Z

I'm using MinGW-w64, either 32 or 64 bits. I otherwise still have issues (not with your code this time) with libc++ on OSX with other std::string tests which don't fail on Linux with libstdc++, so... Maybe check either libc++ on OSX or MinGW-w64 depending on what you have.

adrian17 · 2019-02-04T16:36:02Z

Alright. If I remember correctly, back then I tested it on Linux with g++ 5.4 and clang++ 3.8. Will try to repeat tests on Windows and on newer Linux VMs.

This was referenced Jan 30, 2019

Self-moves break several sorting algorithms Morwenn/cpp-sort#141

Closed

Some algorithms don't work with long strings Morwenn/cpp-sort#142

Closed

Morwenn mentioned this issue Mar 10, 2019

Guard against self-move (fixes #4) #5

Merged

adrian17 closed this as completed in 4d9318f Mar 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange benchmark results with std::string #4

Strange benchmark results with std::string #4

Morwenn commented Jan 29, 2019 •

edited

Morwenn commented Jan 29, 2019 •

edited

Morwenn commented Jan 30, 2019 •

edited

adrian17 commented Feb 4, 2019

Morwenn commented Feb 4, 2019

adrian17 commented Feb 4, 2019

Strange benchmark results with std::string #4

Strange benchmark results with std::string #4

Comments

Morwenn commented Jan 29, 2019 • edited

Morwenn commented Jan 29, 2019 • edited

Morwenn commented Jan 30, 2019 • edited

adrian17 commented Feb 4, 2019

Morwenn commented Feb 4, 2019

adrian17 commented Feb 4, 2019

Morwenn commented Jan 29, 2019 •

edited

Morwenn commented Jan 29, 2019 •

edited

Morwenn commented Jan 30, 2019 •

edited