Revive pull request #1348: "Issue 9882 - Implement a "tee" style InputRange... #1965

MetaLang · 2014-02-25T00:02:51Z

...so that a function can be called during a chain of InputRanges."

brad-anderson · 2014-02-25T00:07:53Z

JakobOvrum · 2014-02-25T00:12:37Z

std/range.d

+template tee(alias func, Flag!"pipeOnFront" pipeOnFront = No.pipeOnFront)
+    if (is(typeof(unaryFun!func)))
+{
+    auto tee(Range)(Range inputRange) if (isInputRange!(Range))


nitpick: isInputRange!Range

brad-anderson · 2014-02-25T00:14:56Z

I'd change the commit message to describe what is changed rather than discuss GitHub processes. Also the Pull Request description should say what this is for.

JakobOvrum · 2014-02-25T00:15:27Z

std/range.d

+        {
+            private Range _input;
+
+            this(Range r)


This constructor is basically dead code.

JakobOvrum · 2014-02-25T00:22:40Z

The result should propagate length.

Perhaps it should also propagate bidirectionality and random access (and slicing?), with pipeOnFront taken into account on back/popBack, of course. It sounds like it would be a lot easier if it could use sub-typing.

As for the idea itself, as I've stated before, I personally don't want this in Phobos because it encourages mixing functional-style and imperative-style code. tee, as presented, is essentially imperative code disguised as functional code. In particular, I think how tee is intended for extra side-effects, masked as functional code, hurts readability.

edit:

It looks like the commit should have retained the original author as an act of courtesy.

MetaLang · 2014-02-25T00:35:19Z

"It looks like the commit should have retained the original author as an act of courtesy."

Whoops, that was completely unintended. Is there a way to fix that?

JakobOvrum · 2014-02-25T00:42:08Z

The easiest way would be to simply checkout the relevant branch on the author's fork, then rebasing that onto a new branch of your own, branched off master. Optionally this could use interactive-rebase's squash command to merge the original three commits into one.

MetaLang · 2014-02-25T00:45:21Z

@eco Edited with the original commit message.

@JakobOvrum Is it possible to make a new commit with the updated author? I originally checked out the author's branch, cherry-picked it into a new branch, then made a few changes and rebased, which squashed the original commit with them as the author... I think.

JakobOvrum · 2014-02-25T00:51:40Z

You'll want to remake the commit that introduces the relevant changes. The easiest way to do that is with rebase. In this case, it's also not terribly tedious to use cherry-pick, as there are only three commits. You can also use tools such as git commit --amend --author to do the same thing "manually". Note that you don't have to make a new pull request; reuse your branch then force push. If you're unfamiliar with the aforementioned tools, I'd be happy to post step-by-step instructions :)

MetaLang · 2014-02-25T00:55:59Z

git commit --amend --author sounds a lot simpler. I'll just do that with a commit that removes the outer template.

…o that a function can be called during a chain of InputRanges."

JakobOvrum · 2014-02-25T01:00:04Z

As I said, git-rebase really is the easiest way... but it does predicate familiarity with that command.

MetaLang · 2014-02-25T01:02:38Z

I am not comfortable enough with git that I'm confident I wouldn't mess anything up.

MetaLang · 2014-02-25T01:06:01Z

Removed outer template and the redundant constructor. As for tee's usefulness, it is intended to aid in debugging range code. It is somewhat dirty inserting imperative code into a mostly-functional range chain, but we can also use debug in pure functions for debugging. This has an advantage over std.algorithm.map in that the user can choose whether to call the lambda function when front is accessed, or only on popFront.

JakobOvrum · 2014-02-25T01:20:35Z

The proposed documentation does not state that this is intended for debug code. With debug statements in pure functions, it is also obvious at the call site that the code is debug code, which is not the case for tee.

…CS chains, and that the default behaviour is to call func on popFront

MetaLang · 2014-02-25T01:47:59Z

I have modified the documentation to make it clear that tee is useful for debugging.

…vailable statically, make it available statically in the result as well. Change pipeOnFront back to pipeOnPop because _input may have back and popBack.

monarchdodra · 2014-02-25T09:12:29Z

The proposed documentation does not state that this is intended for debug code.

Implying tee can't be used for things "other" than debug? If this is really the case, then tee should be a no-op in release, which id not the case at all.

monarchdodra · 2014-02-25T09:16:33Z

std/range.d

+    assert(equal(newValues2, values));
+
+    int count = 0;
+    auto newValues3 = filter!(a => a < 10)(values)


You should UFCS from the start:

auto newValues3 = values .filter!(a => a < 10)() .tee!(a => count++)() .map!(a => a + 1)() .filter!(a => a < 10)();

monarchdodra · 2014-02-25T09:22:40Z

Perhaps it should also propagate bidirectionality and random access (and slicing?)

Propagating random access with pipeOnPop could be problematic, since you could iterate the entire range without ever popping it. You could break the interface. You can always forward length though: That's always useful, even without RA, for things like a call to reserve in array or Appender.

I you can propagate RA safely iff pipOnPop == false.

monarchdodra · 2014-02-25T09:27:51Z

One of the issues I am seeing is this pattern of

int sentinel;
auto range = tee!(++sentinel)( ... );
assert(range.equal([...]);
assert(sentinel = ...);

I see this as a problem, because sentinel will only be modified if the range is actually "walked". Yet, all I see is a assertive equality test. This is problematic, since code inside an assert should have no side effect. I know it's a unittest, but it is also a documented unittest, so we don't want to promote wrong code.

I'd recommend you use a piece of code that will un-conditionally walk you range, and then test the sentinel. I kind of wish we had a walkRange primitive that simply does that, but in the mean time, a call to array should work just as well (and you can then test the array result for equality). reduce!"1" could work as an alternative to walkRange.

That, or just:

bool b = r.equal([...]); //Side effect
assert (b); //Test result

JakobOvrum · 2014-02-25T09:32:03Z

Implying tee can't be used for things "other" than debug? If this is really the case, then tee should be a no-op in release, which id not the case at all.

I already stated that I don't think it's appropriate for other purposes (or for debug purposes for that matter).

monarchdodra · 2014-02-25T10:15:51Z

I already stated that I don't think it's appropriate for other purposes (or for debug purposes for that matter).

Ah... I failed to correctly read the totality of that post. Apologies. I think your "In particular, I think how tee is intended for extra side-effects, masked as functional code, hurts readability." is relevant to the comment I made regarding "I see this as a problem, because sentinel will only be modified if the range is actually "walked"."

JakobOvrum · 2014-02-25T10:25:31Z

Propagating random access with pipeOnPop could be problematic, since you could iterate the entire range without ever popping it. You could break the interface.

I don't see it as an issue because you can also call front (and back) freely without ever invoking the call, and I see that as equivalent of indexing. Perhaps the flag could be called pipeOnMutate to emphasise the behaviour. Regardless though, at least the behaviour of bidirectionality should be completely intuitive, so I don't see any reason not to propagate it.

monarchdodra · 2014-02-25T13:08:04Z

I don't see it as an issue because you can also call front (and back) freely without ever invoking the call, and I see that as equivalent of indexing.

Right, but you can't iterate with front/back alone. You have to iterate on it using popFront()/popBack(). With indexing, you can iterate the entire range, but without ever popping anything.

Perhaps the flag could be called pipeOnMutate to emphasise the behaviour.

I'd confused with that specific name, since I'd understand it as mutating the elements themselves.

Regardless though, at least the behaviour of bidirectionality should be completely intuitive, so I don't see any reason not to propagate it.

Agreed. That's already done though.

Poita · 2014-02-25T21:56:19Z

Agreed with @JakobOvrum. I'm worried about the side-effect nature of the way your are supposed to use this, and I'm incredibly worried about what that means when save is being forwarded. With save there, you have no idea how many times popFront or popBack will be called, so using it for things like counting is just asking for bugs.

I do think the tee problem needs to be solved, but I don't think this is the way to solve it (with or without save).

…r a function.

MetaLang · 2014-07-14T04:54:29Z

I'd like it if someone could comment on my use of DDOC. I'm not familiar with it at all, and I don't know if I'm using it correctly. Also above Andrei suggest the following:

A thought on the issues with tee: it should maintain a flag thisItemHasBeenTeed which is initialized to false in the constructor. Then, all of empty, front, and popFront called will examine the flag and tee it if false, then set it to true.

My response was:

I've already changed tee to do this when pipeOnPop is false, so fun will only be called on front once per item. Is it really necessary to duplicate this check in popFront and empty? At the very least, I don't think it's a good idea to make empty have side-effects. It doesn't make much sense for popFront, either, as the value of front is discarded with each call to popFront anyway.

Can somebody comment on this?

mihails-strasuns · 2014-07-14T10:18:29Z

Can somebody comment on this?

I actually think what Andrei proposes is fundamentally wrong approach. We do have a problem of unspecified standard range consumption and this proposal is an attempt to hide it via hack.

There was recently a similar discussion in one of other pull requests I can't spot right now :( I don't know how to better proceed with it - there seems to be no agreement between @WalterBright and @andralex on this topic and keeping it as permissive as it is creates collateral damage in Phobos.

MetaLang · 2014-07-14T13:20:05Z

Perhaps we should leave it for now, then, and I can make further pull requests in the future once everyone can agree on an answer.

quickfur · 2014-07-14T14:04:39Z

I'm not sure I understand the context of @andralex 's proposal. What advantage does checking the flag 3 times in empty, front, and popFront, as opposed to only checking it in front? I don't see how it adds anything of value, only needless overhead from what I can tell.

MetaLang · 2014-07-14T15:34:46Z

I think his desire was to have the input "tee'd" whether the user accesses front first, empty, or popFront. I did not really understand at the time either, but it seems this is in the context of trying to hash out what the conventions for range semantics should be (there was a thread on the newsgroup about it some time ago).

…ide implementation details from users.

mihails-strasuns · 2014-07-14T21:19:33Z

@quickfur problem is that currently one does not know what range methods will be called each iteration cycle and how many times each. It is unspecified which forces all kinds of weird hacks when you try to implement something like "do this exactly once for each range element".

@MetaLang meh I am afraid if we won't do nothing it will just get lost again. I don't know what to do :(

quickfur · 2014-07-14T22:04:10Z

Ah, I see. So basically if you chain tee to another range adaptor that skips over N elements then starts processing the rest, the sink won't receive the first N elements.
So looks like the issue here is whether the sink is supposed to receive every element in the input, no matter what, or only those that are subsequently processed downstream, etc., and whether it should be called exactly once per element, or once per downstream access, etc., and what order it will be (which is not obvious if you're dealing with a bidirectional or random access range, for example).
Based on the name, I'm thinking the idea of tee is to pipe every element of the input into the sink exactly once, in that order, since every other semantics just seems so counterintuitive it would make the resulting code extremely difficult to reason about. That being the case, I'd argue for exposing only an input range interface in the output -- so that all downstream accesses are confined to linear access.
Otherwise, I don't see how you can define sane semantics for it. For example, a forward range can be saved arbitrarily often downstream, and re-iterated over an unknown number of times. A bidirectional range can get elements on either end popped off in arbitrary order. A random access range can randomly skip around and fail to access all elements. All of these run counter to the idea of tee.
If we want to support anything more than an input range in the result, I'd argue we have a different range, maybe call it trace, that doesn't care how many times any given element is accessed, it just sends it to the sink every time front is accessed -- the idea being, of course, that you're using it to trace how downstream filters are accessing the original range, which is useful for debugging, etc.. Perhaps the source of our woes come from conflating tee with trace, when they are logically two different, though similar, functions.

quickfur · 2014-07-14T22:09:07Z

Bah, looks like I'm talking nonsense again: tee already always returns an input range. The issue is whether the sink should receive the elements when empty is called, or front, or popFront. I'm inclined to say it should only do this in the ctor (if the input is non-empty) and in popFront. Then the sink is guaranteed to receive each element exactly once. There's no need to wrap around empty or front, and no need for boolean flags.

MetaLang · 2014-07-15T03:33:16Z

It will do so in either front or popFront, configurable by the user with pipeOnPop.

quickfur · 2014-07-15T04:18:30Z

Ah, i see. That makes sense. I'm inclined to say this is good to merge. I don't agree with Andrei that we should check the flag in all 3 places, especially in empty - what if the downstream range stops processing after checking empty? There is no guarantee it's actually going to continue just because empty returns false -- it may have other conditions to check. Then that element shouldn't be sent to the sink.

MetaLang · 2014-07-15T15:22:16Z

I'm inclined to say this is good to merge.

Then what are we waiting for? @Dicebot @monarchdodra

mihails-strasuns · 2014-07-16T16:28:51Z

Ok, lets do it and see how badly it breaks :)

mihails-strasuns · 2014-07-16T16:29:05Z

Auto-merge toggled on

JakobOvrum · 2014-07-16T16:33:19Z

Blergh. I bet later we'll consider tee a mistake.

quickfur · 2014-07-16T16:44:02Z

I don't think the breakage will be very bad, if any at all, since it returns a non-forward input range, and there are only a limited number of things you can do with an input range.

Revive pull request #1348: "Issue 9882 - Implement a "tee" style InputRange...

mihails-strasuns · 2014-07-16T17:38:25Z

Blergh. I bet later we'll consider tee a mistake.

I agree it is a terrible name but I want this functionality in Phobos and tired of endless bikeshedding.

MetaLang · 2014-07-16T17:42:16Z

I didn't get a chance to squash my commits before the merge. Does a squash after merge still work?

quickfur · 2014-07-16T18:10:15Z

It's too late now. While in theory you can modify history in master after the fact, doing so will break pretty much everyone's repo the next time they pull from master, which is a very bad idea and will make everyone hate you.

But in the end unsquashed commits don't matter, since git tracks the branching structure of the commits, so git log --graph for example will show a nice structure of exactly which commits came from which pulls. And there are other ways of filtering history so that irrelevant stuff is filtered out. No big deal.

MetaLang · 2014-07-16T18:56:26Z

Righto. Thanks.

JakobOvrum · 2014-07-17T04:36:31Z

I agree it is a terrible name but I want this functionality in Phobos and tired of endless bikeshedding.

The functionality is the problem.

monarchdodra · 2014-08-29T10:48:04Z

std/range.d

+        .tee!(a => writefln("pre-map: %d", a))
+        .map!(a => a + 1)
+        .tee!(a => writefln("post-map: %d", a))
+        .filter!(a => a < 10);


This unittest is producing stdout output. This is especially bad since the unittest is un-documented, so completely useless.

@MetaLang : Can it be re-written in a way that does not write to stdout, or does it conditionally? Could you write to an output range instead, maybe? By printing to an output range, you can also assert the correct output of your range.

In any case, the printing is disruptive to unit-testing:
https://auto-tester.puremagic.com/pull.ghtml?projectid=1&runid=1127154

MetaLang · 2014-08-29T11:28:11Z

@monarchdodra Will do. I'll make another pull request after work.

MetaLang · 2014-09-02T04:51:17Z

Pull request: #2480

JakobOvrum reviewed Feb 25, 2014
View reviewed changes

Revive dlang#1348: "Issue 9882 - Implement a "tee" style InputRange s…

45c4e51

…o that a function can be called during a chain of InputRanges."

Updated documentation to say that tee is useful for debugging long UF…

512edca

…CS chains, and that the default behaviour is to call func on popFront

MetaLang added 2 commits February 24, 2014 23:29

Forward back, popBack, and length if available. If _input.length is a…

8bdb382

…vailable statically, make it available statically in the result as well. Change pipeOnFront back to pipeOnPop because _input may have back and popBack.

Change documentation to reflect pipeOnPop, fix unittest failure.

1cebf18

monarchdodra reviewed Feb 25, 2014
View reviewed changes

Add explanation for overload of tee that can take a template lambda o…

5717690

…r a function.

Change comments on tee overload taking functions as OutputRanges to h…

064375d

…ide implementation details from users.

mihails-strasuns pushed a commit that referenced this pull request Jul 16, 2014

Merge pull request #1965 from MetaLang/std-range-tee-fixup

6d5ab30

Revive pull request #1348: "Issue 9882 - Implement a "tee" style InputRange...

mihails-strasuns merged commit 6d5ab30 into dlang:master Jul 16, 2014

MetaLang deleted the std-range-tee-fixup branch July 16, 2014 18:56

mihails-strasuns mentioned this pull request Jul 17, 2014

fix Issue 12409 - Add "each" function as found in Ruby and jQuery #2024

Merged

monarchdodra reviewed Aug 29, 2014
View reviewed changes

MetaLang mentioned this pull request Oct 4, 2014

Implement issue 13433 - add option for using coarser realtime clock. #2584

Merged

Revive pull request #1348: "Issue 9882 - Implement a "tee" style InputRange... #1965

Revive pull request #1348: "Issue 9882 - Implement a "tee" style InputRange... #1965

Conversation

MetaLang commented Feb 25, 2014

brad-anderson commented Feb 25, 2014

JakobOvrum Feb 25, 2014

Choose a reason for hiding this comment

brad-anderson commented Feb 25, 2014

JakobOvrum Feb 25, 2014

Choose a reason for hiding this comment

MetaLang Feb 25, 2014

Choose a reason for hiding this comment

JakobOvrum commented Feb 25, 2014

MetaLang commented Feb 25, 2014

JakobOvrum commented Feb 25, 2014

MetaLang commented Feb 25, 2014

JakobOvrum commented Feb 25, 2014

MetaLang commented Feb 25, 2014

JakobOvrum commented Feb 25, 2014

MetaLang commented Feb 25, 2014

MetaLang commented Feb 25, 2014

JakobOvrum commented Feb 25, 2014

MetaLang commented Feb 25, 2014

monarchdodra commented Feb 25, 2014

monarchdodra Feb 25, 2014

Choose a reason for hiding this comment

monarchdodra commented Feb 25, 2014

monarchdodra commented Feb 25, 2014

JakobOvrum commented Feb 25, 2014

monarchdodra commented Feb 25, 2014

JakobOvrum commented Feb 25, 2014

monarchdodra commented Feb 25, 2014

Poita commented Feb 25, 2014

MetaLang commented Jul 14, 2014

mihails-strasuns commented Jul 14, 2014

MetaLang commented Jul 14, 2014

quickfur commented Jul 14, 2014

MetaLang commented Jul 14, 2014

mihails-strasuns commented Jul 14, 2014

quickfur commented Jul 14, 2014

quickfur commented Jul 14, 2014

MetaLang commented Jul 15, 2014

quickfur commented Jul 15, 2014

MetaLang commented Jul 15, 2014

mihails-strasuns commented Jul 16, 2014

mihails-strasuns commented Jul 16, 2014

JakobOvrum commented Jul 16, 2014

quickfur commented Jul 16, 2014

mihails-strasuns commented Jul 16, 2014

MetaLang commented Jul 16, 2014

quickfur commented Jul 16, 2014

MetaLang commented Jul 16, 2014

JakobOvrum commented Jul 17, 2014

monarchdodra Aug 29, 2014

Choose a reason for hiding this comment

MetaLang commented Aug 29, 2014

MetaLang commented Sep 2, 2014