Add Specializations for SortedRange #3534

nordlow · 2015-08-07T15:45:09Z

Add new predicate function std.range.primitives.isSortedRange: DONE

Add specializations for SortedRange in the followings algorithms:

sort: just return input argument if pred parameter matches: DONE
sort-alikes: just return input argument if pred parameter matches:
isSorted: return true: DONE
find and alikes, should do some kind of binary search. These could take an extra param SearchPolicy for sorted ranges that defaults to binary search.
Some of the functions that return a sub-range or a mutated range could probably be returning sorted ranges as well if its input is a sorted range, remove, strip and split at least could.
minPos: return input as is, if pred matches: DONE

In compliance with C++ STL also add

minElement: O(1) for SortedRange, O(n) otherwise: DONE
maxElement: O(1) for SortedRange!BidirectionalRange, O(n) otherwise: DONE
minmaxElement: O(1) for SortedRange!BidirectionalRange, O(n) otherwise.

See also: http://forum.dlang.org/post/yenezfjjteokzyvgmzcf@forum.dlang.org
See also: https://issues.dlang.org/show_bug.cgi?id=11667
See also: http://forum.dlang.org/post/mqskao$28vh$1@digitalmars.com

See also: http://en.cppreference.com/w/cpp/algorithm/min_element

DmitryOlshansky · 2015-08-10T06:57:48Z

O(2)

Hilarious. It's sill O(1), the number just means it's some constant factor times 1 so it doesn't matter if this number is 1, 2, 1000 since multiplier in front of it is unspecified.

DmitryOlshansky · 2015-08-10T07:02:30Z

std.range.primitives.isSortedRange

I recall what the problem we had with it was - literally things like SortedRange!(int[], binaryFun!" a < b") and SortedRange!(int[], (a,b) => a < b) and SortedRange!(int[], (b,c) => b < c) are distinct types b/c respective predicates are not identical.

It doesn't block anything but should be taken into consideration, a compiler ought to try and match lambas with identical bodies at least token-wise identical.

nordlow · 2015-08-13T15:47:35Z

Hilarious. It's sill O(1)

I kind of the new that ;)

nordlow · 2015-08-13T15:55:11Z

Any suggestions on how to compare pred lambdas?

For instance

static assert(is(typeof(binaryFun!"a<b") ==
                        typeof(binaryFun!"a<b")));
static assert(is(typeof(binaryFun!"a<b") ==
                        typeof(binaryFun!"a < b")));

both passes but

static assert(!is(typeof(binaryFun!"a<b") ==
                         typeof(binaryFun!"a>b")));

fails. What to do about this?...should we extract and reuse the CT-parsing logic of "a < b" which I presume is happening in std.functional?

It doesn't block anything but should be taken into consideration, a compiler ought to try and match lambas with identical bodies at least token-wise identical.

Is this lambda-comparison just a wish or is there something that can be used already?

Should minElement, maxElement and minmaxElement return a range instead? Or perhaps a Nullable!ElementType!R. The corresponding C++ algorithms all return iterators.
Should we add a specialization for reduce aswell so that the user isn't force to use minElement to get these optimizations?

JakobOvrum · 2015-09-01T06:39:54Z

std/algorithm/searching.d

@@ -2777,23 +2777,30 @@ smallest element and with the same ending as $(D range). The function
 can actually be used for finding the maximum or any other ordering
 predicate (that's why $(D maxPos) is not provided).
 */
-Range minPos(alias pred = "a < b", Range)(Range range)
+auto minPos(alias pred = "a < b", Range)(Range range)


The return type is Range for all paths; explicit return type is better for documented functions as it provides important information to readers.

JakobOvrum · 2015-09-01T07:02:07Z

Functions that return subranges often already them as the same type as the whole range; this already works with SortedRange.

Comparing predicates has been discussed a lot before, particularly in the context of string lambdas. One such thread:
http://forum.dlang.org/post/jnlqesrwxfekdsxjerlp@forum.dlang.org (Sorry, I have a hard time digging up the older ones)

quickfur · 2016-01-06T20:06:53Z

I suggest to break this PR up into smaller pieces so that it's easier to review, and the non-controversial parts can be merged first while we work on the other parts. Otherwise this will go very slowly and take too long to get merged.

quickfur · 2016-01-06T20:07:36Z

As a start, I'd say do a separate PR per overload set, as a rough guideline. I think that's much easier to review, and safe to merge piecemeal.

quickfur · 2016-02-11T19:43:41Z

ping @nordlow
Let's move forward with this?

nordlow · 2016-02-29T09:16:12Z

@quickfur Should I start with an initial pull for the new std.range.primitives.isSortedRange eventhough I have not not found an ideal way to detect which pred argument that was given as template argument to isSortedRange?

One way to move forward with this PR is to restrict logic to a specific set of expressions such as

"a < b"
"a > b"
"a <= b"
"a >= b"

which should cover most uses. All other inputs should trigger a compile-time error with a nice descriptive message.

quickfur · 2016-02-29T17:55:03Z

Hmm. On second thoughts, there is no good way to compare two lambdas (whether string or function literal -- the latter because the compiler has no implementation of such a comparison). So there is no good solution for moving this PR forward.

The problem is that comparing two arbitrary lambdas is, in the most general case, uncomputable. For most practical purposes, though, we can reduce the problem to something tractable by ignoring non-trivial equivalences such as (a < b) == !(a >= b), and just looking at AST equivalence. Of course, that also needs some further restriction, since a < b+x may mean different things if x is bound to the surrounding context of the lambda, so if they are bound to two different contexts they should not compare as equal.

But in any case, this requires compiler support, and I agree with the sentiment that we should not promote string lambdas anymore so it's kinda pointless to support these functions just for string lambdas. Let's wait until we have a DIP on how to implement lambda comparisons.

MetaLang · 2016-02-29T19:54:31Z

Of course, that also needs some further restriction, since a < b+x may mean different things if x is bound to the surrounding context of the lambda, so if they are bound to two different contexts they should not compare as equal.

I'm not sure if you're saying that this is bad, but I'd consider that to be a pretty good thing.

quickfur · 2016-02-29T20:16:12Z

I was just thinking aloud. Obviously, it would be very bad if the two lambdas in the following code compared equal!

auto sortedRange1(int[] a)
{
    auto x = 1;
    return a.sort!((a,b) => a < x*b);
}
auto sortedRange2(int[] a)
{
    auto x =  -1;
    return a.sort!((a,b) => a < x*b);
}

sigod · 2016-03-01T14:54:12Z

I have a question: Why not to add specializations directly into SortedRange?

nordlow · 2016-03-01T14:59:52Z

That might be cleaner, thanks to UFCS and because isSortedRange doesn't have to be imported.

AFAIK: Lambda comparison problem will remain, though.

quickfur · 2016-03-01T16:20:30Z

Actually, if the specializations are added as member functions of SortedRange, then we don't need isSortedRange and lambda comparisons, because the member functions will have direct access to the sorting predicate and the range can safely be assumed to be sorted (since otherwise it wouldn't be a SortedRange to begin with).

Given the current state of things, that could potentially be the better way to go right now.

nordlow · 2016-03-01T16:36:47Z

Ok, I'll look into it.

I guess we should be begin with calls such as:

sr.sort
sr.find(element)
sr.isSorted
sr.minPos

where sr is an instance of a SortedRange.

I guess find doesn't need the `SearchPolicy´ argument then, right?

Anymore low-hanging fruits?

But note that this, of course, only works for calls to sort, isSorted, minPos without explicit lambda-argument pred. But I guess that's ok, right?

sigod · 2016-03-01T16:45:12Z

then we don't need ... lambda comparisons

There's slight problem. For example, if range were sorted with "a < b" and user wants to sort it again with "a > b". Ideally it's just a call to retro.

sigod · 2016-03-01T16:56:35Z

Anymore low-hanging fruits?

minCount, probably.

quickfur · 2016-03-01T18:13:08Z

@sigod We can't possibly know about all these special cases. It would be up to user code to detect such a case and use retro instead of sort. (Of course, ideally, sort should be using an algorithm that behaves pretty closely to calling retro when the incoming range is sorted in reverse order.) But there's also the question of how common is it to need to call sort on a sorted range with the reverse predicate, to justify including this special case in the standard library. If this is a rare use case, it doesn't justify the cost of additional complexity in SortedRange.

(Not to mention that determining whether two predicates are the opposite of each other, generally speaking, is uncomputable. It's easy to detect built-in operators < and >, but what if the user type implements a non-trivial opCmp? What if opCmp is a partial order, rather than a total order? Etc.)

sigod · 2016-03-01T19:51:37Z

(Not to mention that determining whether two predicates are the opposite of each other, generally speaking, is uncomputable. It's easy to detect built-in operators < and >, but what if the user type implements a non-trivial opCmp? What if opCmp is a partial order, rather than a total order? Etc.)

This didn't occurred to me. Then I think such detection is out of question.

But note that this, of course, only works for calls to sort, isSorted, minPos without explicit lambda-argument pred. But I guess that's ok, right?

We probably could easily add support for the same pred with which SortedRange were constructed.

wilzbach · 2016-04-29T09:25:35Z

@nordlow {min,max}Element are now in Phobos (crowd goes woohoo!). I actually didn't know you also had them in this PR ;-)
Thus could you please remove them from your PR?

Actually, if the specializations are added as member functions of SortedRange, then we don't need isSortedRange and lambda comparisons

What else was blocking this PR?

nordlow · 2016-04-29T09:55:54Z

But we still need SortedRange-specializations for {min,max,minmax}Element, right? The default implementations of them don't care about sortedness (SortedRange).

Should these overloads be implemented as free functions take a SortedRange as argument or via SortedRange-members?

I'll restrict the overloads in this pull to only operate on SortedRange with a predicate that matches the predicate of minElement.

wilzbach · 2016-04-29T11:27:09Z

I'll restrict the overloads in this pull to only operate on SortedRange with a predicate that matches the predicate of minElement.

Please use static if - we should put such details to the implementation. Have a look at the post by big boss.
You should also have a look at std.algorithm.search: expose extremum #4257 - it's an "optimization" for the default case of an empty lambda.
How do you know that the mapping function if given and the sorting pred match? -> imho we can only make the static if if no mapping function is used (see 2))

sigod · 2016-04-29T11:35:17Z

Should these overloads be implemented as free functions take a SortedRange as argument or via SortedRange-members?

I vote for SortedRange-members. It's easier and cleaner.

wilzbach · 2016-04-29T11:43:30Z

I vote for SortedRange-members. It's easier and cleaner.

How do you avoid that SortedRanges match the other template? Won't you have to add something like !isSortedRange!R

sigod · 2016-04-29T12:01:33Z

How do you avoid that SortedRanges match the other template?

It's intended for SortedRange-members to "override" some of the common templates.

Won't you have to add something like !isSortedRange!R

I hope not.

nordlow · 2016-04-29T12:59:25Z

But to support overloading of minElement(Range, Seed) don't we need to override behaviour through a free function overload

minElement(R)(R r)
    if (isSortedRange!R)
{ ... }

?

So if is defined as member function in SortedRange a call minElement(sortedRange) won't pick the overload. or does D's UFCS work bothways? I don't remember. I only make use of it as: r.f(...) => f(r, ...), not vice versa.

sigod · 2016-04-29T13:19:04Z

minElement(Range, Seed)

Yes, for such use we need free function overload. To be honest I didn't consider it. I always use UFCS.

or does D's UFCS work bothways?

No. Only f(r, ...) => r.f(...).

nordlow · 2016-04-29T13:42:56Z

So, AFAICT, the only current solution that harmonizes with Phobos' own {min,max}Element is to provide specialization via member-call syntax only then, right? I hope I'm wrong...

Update: Can't we still use if (isInstanceOf(SortedRange, R) as restriction for free function overload solution?

sigod · 2016-04-29T13:58:08Z

So, AFAICT, the only current solution is to provide specialization is via member-call syntax then, right?

That or we need to deal with comparison of predicates. I'd go with former.

JackStouffer · 2017-05-14T20:22:22Z

Closing due to inactivity. @nordlow If you want to reopen this and continue working on it, please let me know.

nordlow force-pushed the sortedrange-specializations branch from 7ac09c3 to 644087d Compare August 18, 2015 13:01

JakobOvrum reviewed Sep 1, 2015
View reviewed changes

nordlow added 22 commits September 3, 2015 09:33

Improve comment

9e13329

First try at isSortedRange

990f5d9

First working version of isSortedRange

a435e4a

Add alias fun in unittest

a0c62d9

Move isSortedRange to primitives

79ae662

Add failing use of isSortedRange in sort()

95e8d55

Make isSortedRange more robust

513c680

Add comment

ac89671

Add specialization of isSorted

07063e0

Remove import std.range.primitives: isSortedRange;

0670174

Support minPos

144c686

Bugfix

cb2c335

Add minElement and maxElement

54ee64a

Move empty check in minPos to else clauses

9f3e555

Add space

88ac1b0

Only calculate TemplateArgsOf!T once

88f586d

Use __traits(isSame, )

dbcf473

Fix bug in isSortedRange

d6d9c1f

Robuster isSortedRange

681d476

More general standardizePredicate

f6b9df7

Better naming comments for standardizePredicate*

2812b99

Change return type of minPos from auto to Range

b7bf452

quickfur added the Merge:Blocked label Feb 29, 2016

nordlow mentioned this pull request Dec 7, 2016

Issue 8829 - std.algorithm.find fails to take advantage of SortedRange #4907

Merged

JackStouffer closed this May 14, 2017

wilzbach mentioned this pull request Jul 24, 2017

Fix Issue 17679 - SortedRange.contains should be deprecated in favor of the generic canFind #5651

Closed

Uh oh!

Add Specializations for SortedRange #3534

Add Specializations for SortedRange #3534

Uh oh!

Conversation

nordlow commented Aug 7, 2015

Uh oh!

DmitryOlshansky commented Aug 10, 2015

Uh oh!

DmitryOlshansky commented Aug 10, 2015

Uh oh!

nordlow commented Aug 13, 2015

Uh oh!

nordlow commented Aug 13, 2015

Uh oh!

JakobOvrum Sep 1, 2015

Choose a reason for hiding this comment

Uh oh!

nordlow Sep 3, 2015

Choose a reason for hiding this comment

Uh oh!

JakobOvrum commented Sep 1, 2015

Uh oh!

quickfur commented Jan 6, 2016

Uh oh!

quickfur commented Jan 6, 2016

Uh oh!

quickfur commented Feb 11, 2016

Uh oh!

nordlow commented Feb 29, 2016

Uh oh!

quickfur commented Feb 29, 2016

Uh oh!

MetaLang commented Feb 29, 2016

Uh oh!

quickfur commented Feb 29, 2016

Uh oh!

sigod commented Mar 1, 2016

Uh oh!

nordlow commented Mar 1, 2016

Uh oh!

quickfur commented Mar 1, 2016

Uh oh!

nordlow commented Mar 1, 2016

Uh oh!

sigod commented Mar 1, 2016

Uh oh!

sigod commented Mar 1, 2016

Uh oh!

quickfur commented Mar 1, 2016

Uh oh!

sigod commented Mar 1, 2016

Uh oh!

wilzbach commented Apr 29, 2016

Uh oh!

nordlow commented Apr 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wilzbach commented Apr 29, 2016

Uh oh!

sigod commented Apr 29, 2016

Uh oh!

wilzbach commented Apr 29, 2016

Uh oh!

sigod commented Apr 29, 2016

Uh oh!

nordlow commented Apr 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sigod commented Apr 29, 2016

Uh oh!

nordlow commented Apr 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sigod commented Apr 29, 2016

Uh oh!

JackStouffer commented May 14, 2017

Uh oh!

Uh oh!

nordlow commented Apr 29, 2016 •

edited

Loading

nordlow commented Apr 29, 2016 •

edited

Loading

nordlow commented Apr 29, 2016 •

edited

Loading