adds R.stableSort and tests #1456

ConorLinehan · 2015-10-20T22:23:06Z

Addresses #1135.

buzzdecafe · 2015-10-20T22:29:24Z

src/stableSort.js

+    if (comparator(a, b) === 0) {
+      return -(copyList.indexOf(a) - copyList.indexOf(b));
+    } else {
+      return comparator(a, b);


why call comparator twice?

davidchambers · 2015-10-20T22:31:26Z

Should we update R.sort rather than define a new function?

scott-christopher · 2015-10-21T01:12:10Z

src/stableSort.js

+
+  return copyList.sort(function(a, b) {
+    if (comparator(a, b) === 0) {
+      return -(copyList.indexOf(a) - copyList.indexOf(b));


An alternative to scanning the list twice with indexOf here would be to zip up the input list with the index of each field and then use the index to perform the secondary comparison.

e.g. something like:

var stableSort = R.curry(function (comparator, list) { var withIndex = R.zip(list, R.range(0, list.length)); var sortedWithIndex = withIndex.sort(function(a, b) { var order = comparator(a[0], b[0]); return order === 0 ? a[1] - b[1] : order; }); return R.map(R.head, sortedWithIndex); });

I'd be interested to see how these two compare for different sized lists.

CrossEye · 2015-10-21T01:29:46Z

Should we update R.sort rather than define a new function?

👍

CrossEye · 2015-10-21T01:31:01Z

An alternative to scanning the list twice with indexOf here would be to zip up the input list with the index of each field and then use the index to perform the secondary comparison. [ ... ]

I'd be interested to see how these two compare for different sized lists.

Agreed. I've used that before for stable sorts.

ConorLinehan · 2015-10-21T21:47:05Z

@scott-christopher Heres a benchmark of the different methods http://jsperf.com/stable-sort-ramda

oskarkv · 2016-03-22T13:19:10Z

Hey guys, where are you on this?

CrossEye · 2016-03-23T01:59:13Z

@oskarkook:

Nowhere at the moment. It's fallen between the cracks.

@ConorLinehan: Any interest in addressing the concerns brought up in comments here? I would like to include this.

ConorLinehan · 2016-03-23T16:19:19Z

@CrossEye Yup will do :)

ConorLinehan · 2016-03-23T17:20:17Z

Should we update R.sort rather than define a new function?

After doing the benchmarks (http://jsperf.com/stable-sort-ramda) with both discussed methods do we still want to update R.sort or create the new R.stableSort function with the following implementation

   var stableSort1 = R.curry(function(comparator, list) {
      var withIndex = R.zip(list, R.range(0, list.length));
      var sortedWithIndex = withIndex.sort(function(a, b) {
        var order = comparator(a[0], b[0]);
        return order === 0 ? a[1] - b[1] : order;
      });
      return R.map(R.head, sortedWithIndex);
    });

CrossEye · 2016-03-23T18:45:48Z

I would look for ways to optimize the solution, but at a guess, we may need to have a separate stableSort version.

scott-christopher · 2016-03-23T20:36:29Z

I've added an extra benchmark case here which uses the same method, but hand-rolls the zip-with-index, etc. It shows a slight boost of performance in Chrome, but negligible in FF. 🤷

    var stableSort3 = R.curry(function(comparator, list) {
      var sorted = [];
      var idx;

      idx = 0;
      while (idx < list.length) {
        sorted.push([list[idx], idx]);
        idx += 1;
      }

      sorted.sort(function(a, b) {
        var order = comparator(a[0], b[0]);
        return order === 0 ? a[1] - b[1] : order;
      });

      idx = 0;
      while (idx < list.length) {
        sorted[idx] = sorted[idx][0];
        idx += 1;
      }
      return sorted;
    });

davidchambers · 2016-03-24T15:27:24Z

src/stableSort.js

+ * @memberOf R
+ * @category List
+ * @sig (a,a -> Number) -> [a] -> [a]
+ * @param {Function} comparator A sorting function :: a -> b -> Int


Let's remove a -> b -> Int here. It's confusing that it doesn't match (a,a -> Number) above, and at any rate it's unnecessary to duplicate the information.

davidchambers · 2016-03-24T15:38:57Z

I would look for ways to optimize the solution, but at a guess, we may need to have a separate stableSort version.

I don't like the sound of this. Users would then need to choose between the "fast" sorting function which doesn't always do the right thing and the "slow" sorting function which does. This is incidental complexity. I'm imagining the entries in @svozza's I want to… table. I want to sort a list quickly and I want to stably sort a list (and I'm not in a hurry). ;)

CrossEye · 2016-03-24T18:33:47Z

I don't like the sound of this. Users would then need to choose between the "fast" sorting function which doesn't always do the right thing and the "slow" sorting function which does.

Which could also be written as, I want to sort a list and I want to sort a list so that equivalent items retain their original order, without mentioning speed or with just trailing parenthetical (faster) and (slower) notations.

We can certainly look for more performant sorters. There is no reason to rely on Array.prototype.sort. If we want to take the complexity hit, we can use Timsort, or we could try a mergesort. There are plenty of options.

Or we can live with the performance hit. That might well be the best, but it certainly would seem worth pursuing the options before making that decision.

davidchambers · 2016-03-24T18:43:47Z

My view is that we should make a decision on behalf of our users. If we consider stable sorting to be important enough to justify a significant performance hit, we should update R.sort. If not, we should include a recipe for stable sorting in the cookbook and reference it in the description of R.sort.

CrossEye · 2016-03-24T18:56:23Z

I can probably agree to that. It's probably not worth having a second function. But I think there is more investigation necessary to find if we can have our cake and still eat at least some of it.

ConorLinehan · 2016-03-24T19:15:06Z

Paraphrasing from this, http://stackoverflow.com/questions/3026281/array-sort-sorting-stability-in-different-browsers, it seems chrome uses a none stable sort and are sticking to it(https://bugs.chromium.org/p/v8/issues/detail?id=90)
Last post

Project Member Comment 58 by jkummerow@chromium.org, Nov 10, 2015
Using O(n) stack space is out of the question, as the stack is way too limited (less than 1 MB). Observe how the existing implementation limits itself to O(log n) stack space even in the worst case (i.e. pathologically unlucky pivot elements) by avoiding the recursive descent into the larger half of the array (lines 1036-1042).

All the other browsers seem to use a stable sort. So from the above benchmarks here we would only be able to get near to firefox. Currently we're roughly 50% slower than firefox 47, So I suppose it's picking an acceptable tolerance?

CrossEye · 2016-03-24T19:27:19Z

But see also a comparison with various merge-sort implementation:

http://jsperf.com/stable-sort-comparison/5

Timsort is quite a bit faster, but it also is substantially more complex to implement. But most merge-sort implementation are stable, I believe. That might be enough.

If we want stability, we obviously cannot use the native sort. Even if all the current crop of browsers did use a stable version, there is simply no guarantee. We could build atop the native version as above, or we could use our own, mergesort, timsort, or one of many other choices.

CrossEye · 2016-03-25T13:36:29Z

So, should we consider an implementation like this instead?:

module.exports = _curry2(function sort(comparator, list) {

  return msort(_slice(list), 0, list.length);

  function msort(list, begin, end) {
    var size = end - begin;
    if (size < 2) return;
    var middle = begin + Math.floor(size / 2);

    msort(list, begin, middle);
    msort(list, middle, end);
    merge(list, begin, middle, end);
    return list;
  }

  function merge(list, begin, middle, end) {
    while (begin < middle) {
      if (comparator(list[begin], list[middle]) > 0) {
        var val = list[begin];
        list[begin] = list[middle];
        insert(list, middle, end, val);
      }
      begin += 1;
    }
  }

  function insert(list, begin, end, val) {
    while (begin + 1 < end && comparator(list[begin + 1], val) < 0) {
      swap(list, begin, begin + 1);
      begin += 1;
    }
    list[begin] = val;
  }

  function swap(list, a, b) {
    var tmp = list[a];
    list[a] = list[b];
    list[b] = tmp;
  }
});

(Perhaps this would need to be pulled into an _internal version to be reused by sortBy.)

Of course this depends hugely on internal mutation, but that doesn't escape the function, so I don't see any issues.

This is a really basic merge sort and maybe one of the ones at http://jsperf.com/stable-sort-comparison/5 would be faster, but anything based on the same ideas might not only add stability but also actually be more performant (for now) than Array.prototype.sort.

ajhyndman · 2018-02-12T01:13:30Z

I'd be keen to see this move ahead. I think a not-guaranteed-to-be-stable sort implementation is a bit of a foot-gun in JavaScript, and I think Ramda would do well to guard users from that trap.

davidchambers · 2018-02-12T01:20:24Z

It's worth mentioning that Z.sort performs a stable sort. Ramda would get stable sorting as part of the package if #2347 is merged one day.

wojpawlik · 2019-12-23T18:36:50Z

#2926 (comment)

adds R.stableSort and tests

874b7c4

buzzdecafe reviewed Oct 20, 2015
View reviewed changes

scott-christopher reviewed Oct 21, 2015
View reviewed changes

ConorLinehan added 2 commits March 23, 2016 22:21

changes to more performant implementation

92d7c2d

changes to more performant implementation

e3f12c1

davidchambers reviewed Mar 24, 2016
View reviewed changes

CrossEye mentioned this pull request Nov 23, 2016

sortWith, ascend and descend methods #1946

Merged

CrossEye added the 1.0 discussion label Oct 16, 2017

wojpawlik closed this Dec 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adds R.stableSort and tests #1456

adds R.stableSort and tests #1456

ConorLinehan commented Oct 20, 2015

buzzdecafe Oct 20, 2015

davidchambers commented Oct 20, 2015

scott-christopher Oct 21, 2015

CrossEye commented Oct 21, 2015

CrossEye commented Oct 21, 2015

ConorLinehan commented Oct 21, 2015

oskarkv commented Mar 22, 2016

CrossEye commented Mar 23, 2016

ConorLinehan commented Mar 23, 2016

ConorLinehan commented Mar 23, 2016

CrossEye commented Mar 23, 2016

scott-christopher commented Mar 23, 2016

davidchambers Mar 24, 2016

davidchambers commented Mar 24, 2016

CrossEye commented Mar 24, 2016

davidchambers commented Mar 24, 2016

CrossEye commented Mar 24, 2016

ConorLinehan commented Mar 24, 2016

CrossEye commented Mar 24, 2016

CrossEye commented Mar 25, 2016

ajhyndman commented Feb 12, 2018

davidchambers commented Feb 12, 2018

wojpawlik commented Dec 23, 2019

adds R.stableSort and tests #1456

adds R.stableSort and tests #1456

Conversation

ConorLinehan commented Oct 20, 2015

buzzdecafe Oct 20, 2015

Choose a reason for hiding this comment

davidchambers commented Oct 20, 2015

scott-christopher Oct 21, 2015

Choose a reason for hiding this comment

CrossEye commented Oct 21, 2015

CrossEye commented Oct 21, 2015

ConorLinehan commented Oct 21, 2015

oskarkv commented Mar 22, 2016

CrossEye commented Mar 23, 2016

ConorLinehan commented Mar 23, 2016

ConorLinehan commented Mar 23, 2016

CrossEye commented Mar 23, 2016

scott-christopher commented Mar 23, 2016

davidchambers Mar 24, 2016

Choose a reason for hiding this comment

davidchambers commented Mar 24, 2016

CrossEye commented Mar 24, 2016

davidchambers commented Mar 24, 2016

CrossEye commented Mar 24, 2016

ConorLinehan commented Mar 24, 2016

CrossEye commented Mar 24, 2016

CrossEye commented Mar 25, 2016

ajhyndman commented Feb 12, 2018

davidchambers commented Feb 12, 2018

wojpawlik commented Dec 23, 2019