Blocking buffer until experiment #864

akarnokd · 2014-02-12T13:22:46Z

This is a solution to the time gap problem for #844.

I've added an subscribeOn overload where the user can explicitly request a buffering behavior. In addition, SubscribeOn checks the type of the Observable and enters buffering mode for GroupedObservable and PublishSubject. I think these code options should be mutually exclusive:

either we only check for Observable type, but then new kinds of observables or hidden observables won't work,
or ask the programmer in the documentation/tutorial to explicitly request buffering in certain operator compositions.

I personally favor option 2).

A drawback is that this blocking subscribeOn deadlocks on pools with a single thread. We can, of course, check for Trampoline, Test and Immediate schedulers, but not schedulers created via Schedulers.executor, or the computation scheduler on a single-core machine.

cloudbees-pull-request-builder · 2014-02-12T13:29:42Z

RxJava-pull-requests #792 FAILURE
Looks like there's a problem with this pull request

cloudbees-pull-request-builder · 2014-02-12T13:50:32Z

RxJava-pull-requests #793 SUCCESS
This pull request looks good

akarnokd · 2014-02-12T13:55:52Z

Test testRepeatTakeWithSubscribeOn passed locally.

I guess there is a race issue with repeat() as it schedules a new repeat after take unsubscribes. I guess adding a child.isUnsubscribed test before L85 should do the trick.

benjchristensen · 2014-02-12T18:01:44Z

Reviewing code ... considering the drawbacks, what do you think is worse, possibility of (deterministic?) deadlock with this solution? or possibility of non-deterministic data-loss when using subscribeOn on hot Observables?

Is the deadlock deterministic (it would always happen in dev so it gets found) or could it happen if a Scheduler becomes saturated, or the buffer size is higher than available threads?

akarnokd · 2014-02-12T18:08:16Z

Non-deterministic data loss is definitely worse.

Deadlock due to the computation scheduler being single threaded is worrying, but might affect other concurrent operators as well regardless. I think the documentation could mention that if pushback or blocking behavior is expected, one should use NewThread or IO scheduler for the unblocking operation.

benjchristensen · 2014-02-12T18:10:33Z

Non-deterministic data loss is definitely worse.

I agree, so let's continue down this path :-) I'll review through your code in a bit.

benjchristensen · 2014-02-12T21:50:34Z

First pass through reading this code it seems good, and mature enough to handle the different scenarios we could throw against it. I'm going to spend some more time playing but nothing right now suggests that this should not be the path we take.

benjchristensen · 2014-02-13T00:14:53Z

rxjava-core/src/main/java/rx/Observable.java

+     *         on the specified {@link Scheduler}
+     * @see <a href="https://github.com/Netflix/RxJava/wiki/Observable-Utility-Operators#wiki-subscribeon">RxJava Wiki: subscribeOn()</a>
+     */
+    public final Observable<T> subscribeOn(Scheduler scheduler, int bufferSize) {


This overload should help mitigate issues when subscribing to a PublishSubject (and derivatives such as GroupedObservable in operator groupBy) and events fired between the original and actual subscriptions are lost.

That doesn't seem to need this overload since we special-case those two Observable instances even if bufferSize is not passed in and we say false for dontLoseEvents.

Ah, I see ... the automatic has an unbounded buffer so it never blocks. This overload allows for blocking as well.

Are you suggesting subscribeOn by itself never use the blocking form but any operator implementation that uses subscribeOn would choose to do so?

In that case, should subscribeOn do any buffering by default or only on demand?

My suggestion is that subscribeOn(Scheduler) never blocks with any source and events may be lost, and subscribeOn(Scheduler, int) does not lose events and may block depending on the buffer size.

benjchristensen · 2014-02-13T05:24:39Z

I added some things on top of this at #869.

benjchristensen · 2014-02-14T03:35:28Z

Work on this is picked up in #869

akarnokd added 2 commits February 12, 2014 11:17

Proposed solution to the time gap, using unbounded buffering.

dc4ee52

Added bounded buffering capability to SubscribeOn

dade7e1

Check child unsubscription status more eagerly.

5209ab1

benjchristensen reviewed Feb 13, 2014
View reviewed changes

benjchristensen mentioned this pull request Feb 13, 2014

subscribeOn + groupBy #869

Merged

benjchristensen closed this Feb 14, 2014

akarnokd deleted the BlockingBufferUntilExperiment branch May 6, 2014 13:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blocking buffer until experiment #864

Blocking buffer until experiment #864

akarnokd commented Feb 12, 2014

cloudbees-pull-request-builder commented Feb 12, 2014

cloudbees-pull-request-builder commented Feb 12, 2014

akarnokd commented Feb 12, 2014

benjchristensen commented Feb 12, 2014

akarnokd commented Feb 12, 2014

benjchristensen commented Feb 12, 2014

benjchristensen commented Feb 12, 2014

benjchristensen Feb 13, 2014

benjchristensen Feb 13, 2014

akarnokd Feb 13, 2014

benjchristensen commented Feb 13, 2014

benjchristensen commented Feb 14, 2014

Blocking buffer until experiment #864

Blocking buffer until experiment #864

Conversation

akarnokd commented Feb 12, 2014

cloudbees-pull-request-builder commented Feb 12, 2014

cloudbees-pull-request-builder commented Feb 12, 2014

akarnokd commented Feb 12, 2014

benjchristensen commented Feb 12, 2014

akarnokd commented Feb 12, 2014

benjchristensen commented Feb 12, 2014

benjchristensen commented Feb 12, 2014

benjchristensen Feb 13, 2014

Choose a reason for hiding this comment

benjchristensen Feb 13, 2014

Choose a reason for hiding this comment

akarnokd Feb 13, 2014

Choose a reason for hiding this comment

benjchristensen commented Feb 13, 2014

benjchristensen commented Feb 14, 2014