First draft at tee algorithms, for critique #302

domenic · 2015-03-18T11:53:12Z

First shot at addressing #271. Want to get these up early for review before formalizing them.

TeeReadableStream: when a consumer pulls on branch1 or branch2 (triggering their underlying source's pull method), they in turn "pull" from stream (using reader.read()). Once a chunk has been pulled from stream, it gets enqueued in both branches---no matter which branch initiated the pull. This should preserve backpressure and also has the "OOM" property discussed in Define "tee"ing a stream #271 such that if the consumer for branch1 is slow and the consumer for branch2 is fast, then branch1 ends up with a lot of unconsumed chunks in its queue.
SpeculativeTeeReadableByteStream is based on the vision outlined in What should we call ReadableByteStream.prototype.getBYOBReader()? #294 (comment) where byte streams also have internal queues, which can be used sometimes, whereas other times, a direct call through to the underlying source read() will happen. The underlying sources for branch1 and branch2 both have pull methods like in TeeReadableStream, which might be used if someone goes for an auto-reader on either of them. But they also have read methods, such that if someone does e.g. reader1.read(view1), it will call through to us1.read(view1) which does two things: (1) reader.read(view1), pulling a chunk from stream and using it to fulfill the direct read call; (2) enqueue the view into the queue for branch2. Thus, later, if someone does reader2.read(view2), there will be a copy/transfer from branch2's queue into view2.

All of these include a clone boolean argument, which if present, will result in structured clones happening for anything enqueued. This is important if we envision the two branches being consumed on different threads, e.g. as in #244 or #276.

Would love a review to validate!

domenic · 2015-03-18T12:10:25Z

This should preserve backpressure and also has the "OOM" property discussed in #271 such that if the consumer for branch1 is slow and the consumer for branch2 is fast, then branch1 ends up with a lot of unconsumed chunks in its queue.

Wait, upon re-reading #271 I realized this is contradictory. What I meant to say is that if both branch1 and branch2 are not read from, then backpressure is applied to stream. Whereas, if branch1 is consumed slowly and and branch2 is consumed quickly, no backpressure is applied.

wanderview · 2015-03-18T13:13:31Z

Glad to see this part getting spec'd, but I have a few questions:

Why is this defined as an external function instead of a method on the stream itself? Wouldn't stream.tee() be more straightforward? It would also let the stream implementation better specialize the tee() operation? For example, I would think a stream that is implemented natively in C++ by the DOM layer might do things a bit differently, although semantically the same, from the exact steps here.
Should we define transferability first before adding the clone argument? It seems that the structured clone for a transfer would need to be performed implicitly by the UA instead of via passing an arg like this.
Can you provide an example showing how SpeculativeTeeReadableByteStream() would be used? I really don't understand that one from the given description.

I guess I'm thinking of how our native C++ stream does tee'ing when I read this. Our nsPipe class internally maintains a single buffer. Every tee'd stream then has its own cursor into that shared buffer. The buffer is split up into segments and as the slowest reader finishes a segment it is free'd back to the allocator. I think this is more efficient then duplicating buffers in two queues, etc.

Do you anticipate us being able to implement streams returned from DOM APIs using this kind of native code?

domenic · 2015-03-18T13:32:24Z

@wanderview thanks for jumping into this!!

Yes, I was just starting with an algorithm other specs could reference, but it'd make great sense to define ReadableStream.prototype.tee() that returns TeeReadableStream(this) and ReadableByteStream.prototype.tee() that returns TeeReadableByteStream(this).
Hmm, I don't quite follow this. Could you explain more? In particular, I don't think we want to allow transferring the contents of a stream---the entire point here is to actually produce two copies, such that both can be operated on in separate threads at the same time without affecting each other.
Here you go, keeping in mind that ReadableByteStreams are still a bit in-our-heads instead of explained-on-paper:

const [branch1, branch2] = SpeculativeTeeReadableByteStream(rbs);

const reader1 = branch1.getReader({ feedBuffers: true });
const reader2 = branch2.getReader({ feedBuffers: true });

const view1 = new Uint8Array(512);
const view2 = new Uint8Array(1024);

reader1.read(view1).then(({ value, done }) => {
  // value uses same backing memory as view1, via transferrence
  // we were able to pass view1 directly to the `reader` for `rbs`
});
reader2.read(view2).then(({ value, done }) => {
  // value uses same backing memory as view2
  // however, how the data gets there was a bit different, and less efficient:
  // once rbs was finished with view1, we cloned it into a new buffer that we enqueued
  // into branch2's internal queue. then, the call to reader2.read(view2) caused us to
  // copy the queued buffer into the backing memory used by view2.

  // hmm, there's a redundant copy here :-/ ... might be able to avoid ...
});

I guess I'm thinking of how our native C++ stream does tee'ing when I read this. Our nsPipe class internally maintains a single buffer. Every tee'd stream then has its own cursor into that shared buffer. The buffer is split up into segments and as the slowest reader finishes a segment it is free'd back to the allocator. I think this is more efficient then duplicating buffers in two queues, etc.

Hmm, very interesting. It sounds like the tee'd streams are not really of the same "type" as the original stream? That is, they are not very general purpose, but instead there's a cooperation between the branches and the original, where the branches largely consist of a cursor and not much else? The duplication is not really avoidable when dealing with the no-data-races-in-JS mandate. But I'll try to think on this more to see if there's something we can learn.

wanderview · 2015-03-18T13:55:23Z

Yes, I was just starting with an algorithm other specs could reference, but it'd make great sense to define ReadableStream.prototype.tee() that returns TeeReadableStream(this) and ReadableByteStream.prototype.tee() that returns TeeReadableByteStream(this).

Do you mean "ReadableByteStream.prototype.tee() then returns SpeculativeTeeReadableByteStream(this)"? So SpeculativeTeeReadableByteStream() conforms to the same semantics at TeeReadableStream()?

I think its the word "speculative" that's throwing me off here.

Defining the contract that all tee() functions must conform to separately might help me. I like to see interface separate from implementation, etc.

Hmm, I don't quite follow this. Could you explain more? In particular, I don't think we want to allow transferring the contents of a stream---the entire point here is to actually produce two copies, such that both can be operated on in separate threads at the same time without affecting each other.

Well if you do postMessage(stream), the transfer is going to have to structure clone the contents of the stream during the transfer. Even if you tee() first and then do postMessage(branch2), the transfer will still have to structure clone again since branch2 is still accessible in the original JS context. It seems the structure clone must be embedded in the postMessage() transfer to me. Doing it here is duplicative and kind of wasteful.

The one case I see where the clone argument makes sense for single JS context like this is if the ReadableStream chunks are mutable. In that case branch1 could read chunks, modify them, and then branch2 would see the modifications when it later reads. Doing the clone would avoid that mutation from being observed in branch2.

Hmm, very interesting. It sounds like the tee'd streams are not really of the same "type" as the original stream? That is, they are not very general purpose, but instead there's a cooperation between the branches and the original, where the branches largely consist of a cursor and not much else?

Well, no. From the consumers point of view they are exactly the same as the original. The consumer just sees a stream interface with a read() method. For this nsPipe case the original stream was originally just a cursor with an underlying buffer; the buffer just wasn't shared yet. Add'ing another branch to the nsPipe doesn't effect the original stream at all.

Of course, this is a purely byte stream, so maybe it only applies to ReadableByteStream.

domenic · 2015-03-18T14:08:56Z

Do you mean "ReadableByteStream.prototype.tee() then returns SpeculativeTeeReadableByteStream(this)"? So SpeculativeTeeReadableByteStream() conforms to the same semantics at TeeReadableStream()?

Yes, sorry. My "speculative" adjective is indeed confusing things; it's only meant there as "if we actually had ReadableByteStream in the spec/refernece implementation, I think this is what it would look like." It's not about the semantics of the tee. So, implicitly, if we actually had a ReadableByteStream to add a ReadableByteStream.prototype.tee too, I would probably have removed the "Speculative" prefix at that point.

And yes, the semantics should be the same.

Defining the contract that all tee() functions must conform to separately might help me. I like to see interface separate from implementation, etc.

Definitely. Roughly, I think it would be:

X.prototype.tee() returns [branch1, branch2] where both are X instances.
X.prototype.tee() will lock this, making it no longer usable.
Both returned branches will be unlocked, and you can separately call getReader() on them, with any arguments it might support (like { feedBuffers: true } for RBS). You can then read from them independently.
If tee was done with cloning (perhaps stream.tee({ clone: true })? Certainly res.clone() will need to do structured-cloning, since e.g. cache.put(res.clone()) will start manipulating the res stream in another thread), then reading from the branches should give independent objects (and, in the case of byte streams, not backed by the same buffer).
Otherwise, reading from each branch should give the same corresponding objects.
If neither branch is read from, backpressure should be applied to the original stream.
If both branches are canceled, the original stream should be canceled.

It seems the structure clone must be embedded in the postMessage() transfer to me. Doing it here is duplicative and kind of wasteful.

I think I see. So, I think postMessaging a stream actually doesn't use the tee functionality. Instead it just grabs a reader and sends the bytes over the wire to the counterpart stream. So similar to how when you cache.put(res), when you postMessage(res), or postMessage(res.body), the stream is used up. You need to clone/tee it first.

So in particular,

Even if you tee() first and then do postMessage(branch2), the transfer will still have to structure clone again since branch2 is still accessible in the original JS context.

If you do this, branch2 will be locked for the rest of its life, until the postMessage algorithm has drained all its contents.

The one case I see where the clone argument makes sense for single JS context like this is if the ReadableStream chunks are mutable.

Right, that's pretty much all objects in JS, including ArrayBuffers :).

From the consumers point of view they are exactly the same as the original. The consumer just sees a stream interface with a read() method.

Yeah, I meant, in terms of them being different implementations of the same interface (and thus in JS, different classes). This might manifest as creating a new stream class of some sort, TeeBranchReadableByteStream or something, which reaches into the innards of its parent as arranged by the tee algorithm. I am unsure this makes that much sense given that we need to clone anyway for pretty much all cases, so it's not like we gain efficiency by having multiple pointers into the same buffer. But, maybe it can help me avoid that extra allocation I noted in the example...

wanderview · 2015-03-18T15:43:26Z

Definitely. Roughly, I think it would be:

X.prototype.tee() returns [branch1, branch2] where both are X instances.
X.prototype.tee() will lock this, making it no longer usable.
Both returned branches will be unlocked, and you can separately call getReader() on them, with any arguments it might support (like { feedBuffers: true } for RBS). You can then read from them independently.

Ok, this is where I had different expectations. I was thinking branch1 === originalStream, but here you are effectively consuming it.

Is it consumed because of the way this is written as an external function right now? It seems like particular stream implementations could be smart enough to copy their internal details to produce a new branch.

In gecko we actually do something similar:

Our streams can advertize they understand how to tee. In that case the original stream's state is untouched.
An external function attempts to use the tee interface if advertized, but if not, then consumes the original stream and produces two branches. We have to do this because some things, like a file stream, cannot be tee'd without performing operations that might not be allowed in a content process.

The constraints that force us to use the external function in native don't seem to apply to streams exposed in js, though. So I'm trying to understand why we can't require stream implementations to produce a tee without any observable state change in the original stream handle.

yutakahirano · 2015-03-19T01:56:36Z

Ok, this is where I had different expectations. I was thinking branch1 === originalStream, but here you are effectively consuming it.

In my opinion it is more natural from the Streams POV though it looks a bit strange from the Fetch API POV (res.clone() changes res.body). In this case the former is more important for me, so the current proposal is fine with me.

yutakahirano · 2015-03-19T02:00:31Z

reference-implementation/lib/readable-stream.js

+
+  function maybeCancelSource() {
+    if (canceled1 && canceled2) {
+      reader.cancel([cancelReason1, cancelReason2]);


Should we return a promise got from reader.cancel(...) here?

Fixed. It turns out to be a bit more complicated since we need to return a promise even to the first person to cancel, not just the second.

Unless we think the first should fulfill ASAP and the second should signal? I will mention below so it doesn't get lost in this old diff.

domenic · 2015-03-19T06:24:08Z

Ok, this is where I had different expectations. I was thinking branch1 === originalStream, but here you are effectively consuming it.

Is it consumed because of the way this is written as an external function right now? It seems like particular stream implementations could be smart enough to copy their internal details to produce a new branch.

Sort of. It's consumed because it's meant to be general functionality that can work with any stream. It's true that if you created a stream implementation with a focus around being clone-able, you could make it work without consumption. For example, you could create a stream implementation that keeps track of all clones that have been produced, and every time a chunk is enqueued into the stream by the underlying source, it also enqueues it into the clones. (Or for byte streams, each read---from either the original stream or the clone---also copies the data into an internal buffer. You then ref-count which memory regions have been consumed by which subset of [original stream, ...clones], and only release a region when all clones have consumed it.)

This seems a lot more complex though. In particular we either have certain streams being complex and cloneable and other not, or we add complexity to all streams to track their clonability. Compared to just a simple operation that can work with any stream through its preexisting interface, I'm not sure it gives much. What is the advantage?

tyoshino · 2015-03-19T06:27:54Z

To keep the original stream available and its state unchanged even after teeing, we need to add one more layer in addition to the stream and the reader. Introduction of the new layer doesn't necessarily require definition of a new class but we need to at least clearly define a concept to explain the new layer.

Like Blob's slice() creating a new reference to the same underlying bytes, readableStream.tee() internally creates a cloner like Domenic's code. readableStream detaches its underlying source and connects the source with the cloner and creates a new stream, and attach itself and the new stream to the read end of the cloner. Here, the "underlying source" is a new boundary. The underlying source must not hold the resolving function / rejecting function of the promise returned to the consumer since once the detaching happens and the cloner is inserted, the cloner must do cloning and the cloner should call the resolving function / rejection function. As far as we define this thin-wrapper style stream class in good form, I think it's not going to be so terrible. But I'd rather want to try to keep the number of layers small.

For Response without a stream, it's simple and natural to add a method named clone() directly onto it which doesn't affect itself but creates a new cloned instance.

But once there's a stream on it, the stream doesn't fit this approach well. It's unfortunate but we shouldn't be dragged by the fact and introduce unnecessary complexity.

domenic · 2015-03-19T06:32:33Z

Right. To be clear, the impact on Response would be that:

var s = res.body;
var res2 = res.clone();

var s1 = res.body;
var s2 = res2.body;

assert(s !== s1);
assert(s !== s2);
assert(s1 !== s2);

// s is locked (cannot be really used)
// s1, s2 are unlocked and usable

I think it is totally OK for res.body to change as a side-effect of calling res.clone(). There's a clear method mutation going on; it's not as if res.body !== res.body ever holds.

tyoshino · 2015-03-19T06:33:25Z

Even more aggressive approach is making clone() lock the stream on the original Response and leave it unchanged but return two Responses. This requires unnecessary copy of headers. Maybe no one likes it.

tyoshino · 2015-03-19T06:36:56Z

By "leave it unchanged" in the last post, I meant keep .body return the same instance that is no longer available (locked).

tyoshino · 2015-03-19T06:37:40Z

I basically agree with the idea in Domenic's #302 (comment) so far.

Fixes #298.

First step toward formalizing

annevk · 2015-03-19T08:40:47Z

@tyoshino I don't think we can change clone() at this point to return two instances. @domenic this also impacts Request and the first step of HTTP network or cache fetch.

domenic · 2015-03-19T08:51:09Z

@annevk agreed.

One thing I forgot to mention: I think people who are "response focused" will likely not touch body, and will do res.clone() and res.json() etc. So the mutation of .body will not matter to them. Whereas people with "stream focused" use cases will do e.g. const [branch1, branch2] = res.body.tee() and other things that involve understanding and manipulating the stream.

tyoshino · 2015-03-19T08:57:18Z

Anne: It's just a thought experiment. It's not realistic, right. Sorry for confusing. I wanted to clarify from where this mismatch came.

Domenic: Nice justification. It's not hard to expect users of body to be aware of that clone() affects it.

domenic · 2015-03-19T10:06:30Z

New revision up that is starting to get to be what we might put in the real spec. Including tests, taking care of many edge cases, and using internal methods to avoid accessing public APIs. (I was tempted to try to use only public APIs and make this generically applicable to any ReadableStream. But, I think we can save that for later if there is a good use case. I have proven that you can write it using only public APIs, and that is enough to satisfy my no-magic impulse for now.)

domenic · 2015-03-19T10:17:51Z

One question @yutakahirano brought up in #302 (comment) is a good one. Consider the following situation:

const rs = new ReadableStream({
  cancel() {
    throw new Error('wheee');
  }
});

const [branch1, branch2] = rs.tee();
branch1.cancel().then(/* what should happen here? (1) */);
branch2.cancel().catch(/* what should happen here? (2) */);

The semantics with regard to cancel is that if you cancel both branches, only then will we communicate back to the original stream that it should be cancelled, and will perform an action. That action might fail, as shown here.

First cancel fulfills, second rejects

Maybe branch1.cancel() should fulfill. No actual stuff happened when you called branch1.cancel()---for all intents and purposes, branch1.cancel() is just a ref-count-decrease. That succeeded, and can fulfill immediately. Whereas, branch2.cancel() should reject, since it is a ref-count decrease plus the actual cancellation operation to the original underlying source.

Both cancels reject

Maybe they should both reject: that is, maybe we should not resolve or reject either promise until the ref count has decreased to zero, and we can tell whether the original underlying source cancel succeeded or failed.

One possibly-unintuitive consequence of this is that if you only ever cancel branch1, then branch1.cancel() will stay pending forever (since we have to wait until branch2 cancels to resolve the promise).

Both cancels fulfill

Maybe they should both fulfill: maybe we treat the success or failure of the actual underlying source cancellation as irrelevant, and say that the promise returned by branchX.cancel() just represents whether the ref-count was successfully decreased. Which it always will be.

The downside of this is that you lose any information about errors canceling the original underlying source.

It's important to keep in mind that this is a pretty small point. Many consumers will not care if canceling fails (or succeeds). The whole point of cancel() is to communicate "I don't care about this stream anymore," so it's kind of rare you care about how well your not-caring went. But, we do need to pick one.

I think I have a slight preference for both reject, but could also go with one fulfills, second one rejects. Both fulfills seems bad.

wanderview · 2015-03-19T17:05:18Z

I think it is totally OK for res.body to change as a side-effect of calling res.clone(). There's a clear method mutation going on; it's not as if res.body !== res.body ever holds.

I think this is really bad and unexpected.

Why can't body be a wrapper around a concrete stream? Then clone() would swap out the underlying stream, but the external users of .body would not be able to observe it.

wanderview · 2015-03-19T17:05:58Z

To clarify, I don't think anyone expects .clone() to mutate the original object. Is there another example of that anywhere?

wanderview · 2015-03-19T17:15:50Z

Sort of. It's consumed because it's meant to be general functionality that can work with any stream. It's true that if you created a stream implementation with a focus around being clone-able, you could make it work without consumption. For example, you could create a stream implementation that keeps track of all clones that have been produced, and every time a chunk is enqueued into the stream by the underlying source, it also enqueues it into the clones. (Or for byte streams, each read---from either the original stream or the clone---also copies the data into an internal buffer. You then ref-count which memory regions have been consumed by which subset of [original stream, ...clones], and only release a region when all clones have consumed it.)

It seems to me we should be able to wrap the underlying source in a TeeSource that does this work. The original stream just replaces its source with the new TeeSource and hands the TeeSource to the new stream. The new stream then attaches itself to the TeeSource as well. Any new tee() requests do the same thing (without creating a new TeeSource wrapper again, ideally.)

But I haven't read the spec recently enough to write this code. So maybe I am missing something.

This would keep the mutation as non-observable to the immediate user of the original stream.

yutakahirano · 2015-03-20T01:01:04Z

Why can't body be a wrapper around a concrete stream? Then clone() would swap out the underlying stream, but the external users of .body would not be able to observe it.

It's an interesting idea. The wrapper has

cancel()
getReader()
pipeTo()
pipeThrough()

methods but doesn't have tee(), right?
That's possible at the fetch API level and I like it.

domenic · 2015-03-20T01:47:08Z

I don't want to create a whole new type of stream just to satisfy an esoteric equality test that nobody should be doing in practice. What is the real hazard here?

domenic · 2015-03-20T02:57:00Z

Here is a more broad take on the issue here, that maybe will be helpful.

I think we have simply run into an ordering issue with our design process. While designing res.clone(), we didn't have streams, and so we didn't try to think about how it would or would not be possible to do this kind of clone-while-leaving-unconsumed thing.

Now that we have streams, it becomes clear that clone-while-leaving-unconsumed is not a very natural thing to do with streams as specified. You can probably make it work, but doing so would be invasive to the stream implementation. Whereas, teeing is perfectly natural, and falls out of basic usage patterns of the public stream APIs.

The question at hand, I think, is whether we believe clone-while-leaving-unconsumed is a core use case for streams as a primitive. If it is, we should do appropriate re-design of stream internals and algorithms to support it. There are several possibilities here mentioned already: @tyoshino's three-tiered approach; the stream-wrapper idea; the hooking-and-unhooking underlying source path; or the all-streams-keep-track-of-clones-internally path.

But my perspective is that, given how natural teeing is (and not just here, but in other streaming or reactive or iterator APIs), and how unnatural clone-without-consuming is, it really isn't worth the added complexity. Basically, we goofed a bit by choosing res.clone() instead of res.tee(), which we did because of the order in which we designed and shipped APIs. And now we have to eat it, one way or the other. Having .body change seems like a pretty minor cost to pay, compared to all the other alternatives being discussed here.

wanderview · 2015-03-20T03:11:41Z

Ok. I think you convinced me. The zero-copy-clone of the stream is an optimization that can be added later if there is a perf need for it.

wanderview · 2015-03-20T13:55:59Z

In regards to Response.body changing in Response.clone(), I have a question:

Does reading .body() set the bodyUsed flag to prevent Response.clone()?

If "yes", then I'm ok with .body being swapped to a different stream.

If "no", then how do we prevent something like this:

var readPromise = resp.body.read();
var resp2 = resp.clone();
readPromise.then(function(value, done) {
  // what happens here?  does it ever get called?
});

domenic · 2015-03-20T14:01:15Z

@wanderview great question. Fortunately I think we took care of it with the reader design :).

To read from a stream, you need to acquire a reader: so, var reader = resp.body.getReader(); var readPromise = reader.read().

If anyone else wants to read from the stream---say, the clone procedure---they will need to also get a reader. But, they cannot get a reader at the same time as you. So, you need to release your reader first.

AND! You're not allowed to release your reader, until all of the read-promises you've asked the reader to create, have settled.

So, the example is more like:

var reader = resp.body.getReader();
var readPromise = reader.read();

// this will throw, since it needs to get a reader, but the stream is already locked
try {
  var resp2 = resp.clone();
} catch (e) { }

// this will also throw, since we haven't waited for readPromise to settle
try {
  reader.releaseLock();
} catch (e) { }

// this will work:
readPromise.then(_ => {
  reader.releaseLock();
  var resp2 = resp.clone();
});

Make sense?

wanderview · 2015-03-20T14:11:47Z

And if they try to use reader after the resp.clone() it will throw because the original stream is locked?

Sounds good. Thanks!

domenic · 2015-03-20T14:16:54Z

Once you release the reader it acts like a closed stream, is the idea. (We could make it act like an errored stream, but in the rare case that you care, closed seems more likely to go down the right path.)

wanderview · 2015-03-20T14:23:17Z

In regard to this:

const [branch1, branch2] = rs.tee();
branch1.cancel().then(/* what should happen here? (1) */);
branch2.cancel().catch(/* what should happen here? (2) */);

I think its important to make it work such that a library consuming a stream:

Cannot tell that the stream came from a tee().
Cannot tell if the stream is branch1 or branch2.
Can use the stream without side effect from the usage of the other branch. (Outside of OOM behavior.)

So, from an interface point of view, I think this should work exactly how normal stream cancellation works. If cancel() normally rejects() then it should reject. If cancel() normally fulfills, then it should fulfill.

domenic · 2015-03-20T14:28:57Z

Hmm, yeah. I think those are good principles. They point me to both-cancels-fulfill or both-cancels-reject. Since both-cancels-fulfill loses information, I think that means we want both-cancels-reject.

Although, even one-fulfills-one-rejects wouldn't necessarily violate any of those constraints. It would tell you whether you were the first to cancel, but not whether you were the "left" side of the tee or the right side. But, even that is a kind of information leakage between the two branches, so yeah, let's not do that.

(BTW my mismatched .then + .catch was not intentional, both should be .then)

tyoshino · 2015-03-23T05:30:03Z

If the consumer of either branch really wants to interact with the original stream via cancel(), we should ensure that the consumer's cancel() call results in calling cancel() on the original stream.

I basically think such needs should be taken care of by more complex custom tee-ing code. But if we're to build some helper for convenience (so that any change on the consumer code is unnecessary), I'd propose something like the following:

rs.tee() takes cancelPropagator
rs.tee() calls cancelPropagator.init(cancel) where cancel is a function that calls rs.cancel()
when the first branch is cancel()-ed, cancelPropagator.handleFirst(reason) is called where reason is the argument the consumer of the first branch passed to cancel() call. The return value of handleFirst is used for resolving the promise returned to the consumer of the first branch for its cancel() call
the same for the second branch
if rs.tee() is invoked without cancelPropagator, the default handler is used which is:

class DefaultCancelPropagator {
  constructor() {
    this._firstCancelled = false;
    this._secondCancelled = false;
  }
  init(cancel) {
    this._cancel = cancel;
  }
  handleFirst(reason) {
    this._firstCancelled = true;
    if (this._secondCancelled) {
      this._cancel(undefined);
    }
    return undefined;
  }
  handleSecond(reason) {
    this._secondCancelled = true;
    if (this._firstCancelled) {
      this._cancel(undefined);
    }
    return undefined;
  }
}

domenic · 2015-03-31T20:32:09Z

@tyoshino

I basically think such needs should be taken care of by more complex custom tee-ing code.

I think that is the right approach, especially for now. I do want to decide on a default though---both for other specs to use (like res.clone()), and for an author-exposed .tee() method. We could allow the author-exposed .tee() method to be customizable with a framework like you suggest (which is pretty slick!)... but later. For now let's just decide a default.

Reading your DefaultCancelPropagator, it seems you support "Both cancels fulfill" as the default. Hmm.

For the tee algorithm being drafted in #302, it's important to have an abstract operation CloseReadableStream that does all the things that ReadableStreamController.prototype.close does. So, we factor that out. The old CloseReadableStream is renamed to "FinishClosingReadableStream".

Again, needed for formalizing the tee algorithm from #302.

Closes #271; supercedes #302. Includes an abstract operation, TeeReadableStream(stream, shouldClone) which is meant for use by other specs, plus a method ReadableStream.prototype.tee() (which does no cloning).

For the tee algorithm being drafted in #302, it's important to have an abstract operation CloseReadableStream that does all the things that ReadableStreamController.prototype.close does. So, we factor that out. The old CloseReadableStream is renamed to "FinishClosingReadableStream".

Again, needed for formalizing the tee algorithm from #302.

Closes #271; supercedes #302. Includes an abstract operation, TeeReadableStream(stream, shouldClone) which is meant for use by other specs, plus a method ReadableStream.prototype.tee() (which does no cloning).

domenic · 2015-04-06T21:07:06Z

Superceded by #311!

yutakahirano reviewed Mar 19, 2015
View reviewed changes

tyoshino mentioned this pull request Mar 19, 2015

Define "tee"ing a stream #271

Closed

domenic added 5 commits March 19, 2015 16:56

Throw on underlying source double-close or double-error

98685d4

Fixes #298.

First draft at tee algorithms, for critique

b593a3f

Remove original TeeReadableStream with broken backpressure

5202e10

Remove SpeculativeTeeReadableByteStream for now

782aec7

First step toward formalizing

stash

c8223d9

Start taking care of edge cases, and using internals

8f76e98

domenic force-pushed the tee branch from 5a4bbf0 to 8f76e98 Compare March 19, 2015 10:01

Factor out ReadFromReadableStreamReader to avoid using public API

43ada8c

domenic added a commit that referenced this pull request Mar 31, 2015

Factor out ReadFromReadableStreamReader abstract operation

38c6803

Again, needed for formalizing the tee algorithm from #302.

domenic added a commit that referenced this pull request Mar 31, 2015

Factor out ReadFromReadableStreamReader abstract operation

ec27914

Again, needed for formalizing the tee algorithm from #302.

domenic mentioned this pull request Mar 31, 2015

Implement teeing, plus a bunch of stage-setting tweaks #311

Merged

domenic added a commit that referenced this pull request Apr 6, 2015

Factor out ReadFromReadableStreamReader abstract operation

05674df

Again, needed for formalizing the tee algorithm from #302.

domenic closed this Apr 6, 2015

domenic deleted the tee branch April 20, 2015 15:34

First draft at tee algorithms, for critique #302

First draft at tee algorithms, for critique #302

Conversation

domenic commented Mar 18, 2015

domenic commented Mar 18, 2015

wanderview commented Mar 18, 2015

domenic commented Mar 18, 2015

wanderview commented Mar 18, 2015

domenic commented Mar 18, 2015

wanderview commented Mar 18, 2015

yutakahirano commented Mar 19, 2015

yutakahirano Mar 19, 2015

Choose a reason for hiding this comment

domenic Mar 19, 2015

Choose a reason for hiding this comment

domenic commented Mar 19, 2015

tyoshino commented Mar 19, 2015

domenic commented Mar 19, 2015

tyoshino commented Mar 19, 2015

tyoshino commented Mar 19, 2015

tyoshino commented Mar 19, 2015

annevk commented Mar 19, 2015

domenic commented Mar 19, 2015

tyoshino commented Mar 19, 2015

domenic commented Mar 19, 2015

domenic commented Mar 19, 2015

First cancel fulfills, second rejects

Both cancels reject

Both cancels fulfill

wanderview commented Mar 19, 2015

wanderview commented Mar 19, 2015

wanderview commented Mar 19, 2015

yutakahirano commented Mar 20, 2015

domenic commented Mar 20, 2015

domenic commented Mar 20, 2015

wanderview commented Mar 20, 2015

wanderview commented Mar 20, 2015

domenic commented Mar 20, 2015

wanderview commented Mar 20, 2015

domenic commented Mar 20, 2015

wanderview commented Mar 20, 2015

domenic commented Mar 20, 2015

tyoshino commented Mar 23, 2015

domenic commented Mar 31, 2015

domenic commented Apr 6, 2015