-repeat doesn't stop after being disposed of #94

joshaber · 2012-11-01T17:04:35Z

Noted in #93:

RACReplaySubject *subject = [RACReplaySubject subject];
[subject sendNext:@(1)];
[subject sendCompleted];
[[[subject repeat] take:1] subscribeNext:^(id x) { ... }];

Loops indefinitely.

joshaber · 2012-11-04T01:56:46Z

Ugh. So this is a pretty annoying one. Since the repeat and replay happen on the same thread, it gets stuck replaying and repeating before -repeat's ever returned a disposable for -take: to dispose of.

It might be a bit more obvious in this example:

RACSubscribable *subscribable = [RACSubscribable createSubscribable:^ RACDisposable * (id<RACSubscriber> subscriber) {
    [subscriber sendNext:@1];
    [subscriber sendCompleted];
    return nil;
}];

__block RACDisposable *disposable = [[subscribable repeat] subscribeNext:^(id _) {
    NSLog(@"disposable: %@", disposable);
    [disposable dispose];
}];

Disposable will always be nil so the repeat can't be stopped. It gets stuck in the infinite loop within the original subscribe to -repeat before the disposable's ever returned. Bleh.

We had a similar problem with the old implementation of generators. I'm not really sure how to solve this generally.

I think Rx solves this by saying, you can never know what scheduler results might be delivered on, unless you manually specify it. That way they can split the repeat and replay so that the disposable's returned before the repeat actually happens.

jspahrsummers · 2012-11-04T02:30:50Z

Maybe we could add a subscription variant that gives the disposable to the block? Seems like that might be generally useful anyways.

joshaber · 2012-11-04T04:22:27Z

If I understand you right, the tricky thing is that we don't have the important disposable until after the subscribable's didSubscribe block has returned... and didSubscribe never returns in cases like these.

Coneko · 2012-11-05T17:08:03Z

Generally speaking, are subscribers supposed to be able to dispose their own subscription?

joshaber · 2012-11-05T17:18:08Z

Absolutely, they really have to to be able to end potentially unending subscribables.

Coneko · 2012-11-05T17:39:15Z

What about timeliness? Is it enough if they're only able to dispose the subscription eventually and not immediately?

The general pattern used for implementing a lot of the methods on <RACSubscribable>, as in calling +createSubscribable, immediately sending values to the subscriber and only then returning the disposable, doesn't allow subscribers to dispose of the subscription until after all the initial values have been received. Especially noticeable on RACReplaySubject since it sends the whole replay before returning the disposable.

I was wondering if that behaviour was to be considered a bug too.

joshaber · 2012-11-05T18:06:53Z

Yes, I think that's a good summary of the heart of this bug.

Coneko · 2012-11-05T18:42:52Z

Oh ok, because I fixed -repeat by resubscribing asynchronously instead of synchronously, which avoids the buffer overflow, and gives the subscriber a chance to dispose of the subscription between a repeat and the next, guaranteeing the subscribable will stop sending stuff eventually, but of course it doesn't do anything to fix the underlying problem of being unable to unsubscribe during each repeat.

So it's just a band-aid, not a real fix. (https://gist.github.com/4019491)

joshaber · 2012-11-05T18:49:38Z

I think a generalized version of that is the best fix. But dispatch_get_current_queue is going away/already gone so we can't use it. +[NSOperationQueue currentQueue] doesn't seem to make strong enough guarantees for us.

Which takes us back to the Rx position of saying you can't depend on the queue it's in unless specified.

jspahrsummers · 2012-11-05T18:51:25Z

It's not generally safe to asynchronously dispatch to the current GCD queue, because it could be a temporary user-created queue that's going away. The block will still get executed (because GCD makes such a guarantee), but could result in crazy unpredictable behavior to the user.

Coneko · 2012-11-05T19:52:24Z

Right, GCD does say queued blocks retain the queue they're queued on.

You could implement a policy of "if you call RAC methods from a RACScheduler, stuff gets delivered on the calling scheduler, otherwise stuff gets delivered on the main thread".
At least then chaining -deliverOn: and -subscribeOn: will still work like it does now.

jspahrsummers · 2012-11-05T19:54:49Z

The "calling scheduler" is a nebulous concept, though. For an operation queue, that's +currentQueue. For a GCD queue, that's dispatch_get_current_queue. Those aren't interchangeable, since an operation queue may be backed by any number of GCD queues, and they're not required to stay the same.

joshaber · 2012-11-05T20:12:12Z

Exactly, that's why it's so awkward. We can still make guarantees about -deliverOn: and -subscribeOn:, but they become much more necessary.

Coneko · 2012-11-05T21:34:38Z

What I mean by "calling scheduler" is something like +currentQueue does. +currentQueue only returns a valid NSOperationQueue if you call it from code that was queued on a NSOperationQueue. Likewise, a +currentScheduler method would return a RACScheduler only if the code it was called from was scheduled on a RACScheduler in the first place.

The implementation could mix +currentQueue with associated objects and dispatch_queue_set_specific/dispatch_get_specific to get the current RACScheduler whether it used one or the other as backing. (Not sure how to solve the problem of clearing the reference to the RACScheduler without potentially getting blocks scheduled while it's in -dealloc)

The user would lose the ability to create new schedulers with arbitrary scheduling backing because of this.

Since all the scheduling of subscribables goes through RACScheduler anyway, that would help keep the scheduling predictable.

I'm not against explicitly specifying the scheduler on which to deliver, I just wouldn't want to have to specify it multiple times in a chain because each link of the chain can potentially reschedule on a different one.
At least, that's how I understand it would turn out if subscribers didn't give any guarantees about that.

jspahrsummers · 2012-11-25T21:55:32Z

@Coneko I think there are still problems with that idea, because it'd be possible to get into a case where one thread or queue is technically associated with multiple schedulers.

This is mostly an issue with concurrent GCD queues. For example, an operation queue RACScheduler might use a global GCD queue for execution (as an implementation detail). Blocks scheduled on the +backgroundScheduler shouldn't see the operation queue scheduler, and vice versa, but there's no way to ensure this with dispatch_set_queue_specific.

Coneko · 2012-11-25T22:15:44Z

Yes, the general idea isn't very robust, so the implementation has a lot of gotchas, but in your example the RACScheduler should never use a global GCD queue directly, instead it should use a private GCD queue, and set the target queue for it to the global GCD queue. That way dispatch_set_queue_specific can be called on a meaningful target.

I agree it's it's not perfect, and as I mentioned before it makes implementing custom RACSchedulers very error-prone, so it'll end up not being something a user of the framework is expected to be able to do.
It definitely brings more problems than it solves as long as the only real problem with the current implementation is with certain instances of -repeat.

jspahrsummers · 2012-11-25T22:17:07Z

That's a good point about target queues. I'm curious if some mix of deferred scheduling, operation queue scheduling, etc. could still result in an incorrect currentScheduler, though.

Coneko · 2012-11-26T01:13:32Z

I'd be backing my idea a lot more if unit testing these threading shenanigans reliably were possible.

jspahrsummers · 2012-11-28T02:09:22Z

The implementation could mix +currentQueue with associated objects and dispatch_queue_set_specific/dispatch_get_specific to get the current RACScheduler whether it used one or the other as backing. (Not sure how to solve the problem of clearing the reference to the RACScheduler without potentially getting blocks scheduled while it's in -dealloc)

I found out today, while working on #138, that dispatch_get_specific only reads from the current queue and its target queues. If you dispatch_sync from one targetless queue to another targetless queue, you won't be able to read specific data from the former in a block running on the latter.

This makes it just as broken as dispatch_get_current_queue, and kind of hamstrings anything we could do with RACScheduler.

Coneko · 2012-11-28T08:34:35Z

I understood dispatch_get_specific to work that way, but I didn't think it was a problem. It still falls under "only works if called from code running on a RACScheduler" clause right? After all dispatch_sync doesn't call the code from the scheduler's queue, it locks the queue and calls the code from another queue. You wouldn't expect it to work if you implemented something like that yourself.
It does mean you can't use it to implement RACScheduler, but not that it's broken from a caller's perspective.

jspahrsummers · 2012-11-28T20:33:43Z

I think that means we'll end up in a lot of cases where +[RACScheduler currentScheduler] is nil. In particular, we won't have a reliable +deferredScheduler.

jspahrsummers · 2012-11-30T21:22:14Z

I think the sanest way to resolve this would be to change the subscription API a bit. For example, if +createSignal: were changed to accept a subscriber and a block or pointer that would tell it if it were disposed, you could implement a pattern like the following:

return [RACSignal createSignal:^(id<RACSubscriber> subscriber, BOOL *stop) {
    while (YES) {
        if (*stop) break;
        [subscriber sendNext:RACUnit.defaultUnit];
    }

    [subscriber sendCompleted];
    return nil;
}];

Coneko · 2012-11-30T21:29:26Z

I think that's a very good idea. Regardless of how the thing with the schedulers evolves, having to do all that dispatching/scheduling just to implement even something as simple as +return: properly is unwieldy.

joshaber · 2012-11-30T21:50:48Z

@jspahrsummers I'm not sure how that'd solve the problem. You'd get stuck in the loop before you'd get access to the address of stop.

jspahrsummers · 2012-11-30T21:54:29Z

With an API like that available, you could also implement stuff like:

- (void)subscribeNext:(void (^)(id value, BOOL *stop))next error:(void (^)(NSError *))error completed:(void (^)(void))completed;

I could take a pass at it. I'm pretty sure it would solve this issue.

joshaber · 2012-11-30T22:02:19Z

But now we'd have two different ways of stopping a signal: with a disposable or with *stop = YES. Also, this problem is solved by the new scheduler work. I'm not sure what this would give us besides an uglier API.

jspahrsummers · 2012-11-30T22:05:53Z

This API is uglier, but the new scheduler work feels like a hack to work around the timing of how disposables are created. Why not just solve when they're created/made available?

The idea of setting a BOOL *stop could also be encapsulated in a disposable:

- (RACDisposable *)subscribeNext:(void (^)(id, RACDisposable *))next;

Coneko · 2012-11-30T22:24:24Z

I don't think adding new parameters to the subscription methods is right. Subscriptions should work correctly regardless of how they're disposed of, you can't give the user two ways of disposing them, and then have them behave differently.

Rather, change the +createSignal: method to take two blocks. One that returns a disposable, and one that creates the subscription. Call the one that returns the disposable first, return the disposable, then... oops.

I guess that would go back to the scheduler fix anyway.

Still, it would hide the scheduling complexity in the internal implementation, leaving more elegant APIs both for the user that creates the signal and the user that consumes it.

joshaber · 2012-11-30T22:27:03Z

I don't see how any of these ideas would work.

You can't create/return a disposable without subscribing, and as soon as you subscribe you're going down the rabbit hole of infinite subscriptions. The re-subscribe has to be deferred. I don't see any other way.

jspahrsummers · 2012-11-30T22:30:21Z

Maybe it was a mistake to bring up concrete APIs before really explaining my thought process.

The key I'm trying to communicate is this: you can return a disposable without subscribing, as long as:

Infinite signals can watch its status.
Subscribers can access it before the subscription invocation actually completes.

That's what the BOOL *stop argument was encapsulating, but it can be done in other ways too.

Coneko · 2012-11-30T22:35:53Z

What I meant before was in fact returning the disposable immediately, but subscribing deferred to ensure the caller received the disposable in the meanwhile and was able to dispose it if needed.

@jspahrsummers : I agree it would work, I just think the API would be really ugly if all the methods that currently return RACDisposable * would have to accept a RACDisposable ** argument instead.

joshaber · 2012-11-30T22:37:08Z

Again, this seems to just make the API uglier for a problem that can be solved other ways.

jspahrsummers · 2012-11-30T22:38:47Z

Alright, well, I'm willing to give deferred subscription a shot. I just think it's going to be really surprising sometimes; for example, what happens to RACAbleWithStart and code dependent upon those values arriving immediately?

joshaber · 2012-11-30T22:39:57Z

@jspahrsummers That's exactly what +subscriptionScheduler is solving. Since that (should) be on the main queue, it'd start immediately like it always has.

jspahrsummers · 2012-11-30T22:41:00Z

👍

In order to solve issue #94, it's necessary for the +iterativeScheduler to enqueue the block, return full control to the currently-scheduled block, and then let that block complete. Otherwise, we may still not get the disposable we need.

thenikso · 2012-12-07T22:01:10Z

👍

Fix CocoaPods spec

joshaber mentioned this issue Nov 1, 2012

Replay subject with asMaybes and take #93

Closed

Coneko mentioned this issue Nov 5, 2012

Subject to support producer-consumer pattern. #101

Closed

Coneko mentioned this issue Nov 9, 2012

<RACStream> monad, RACSequence #92

Merged

jspahrsummers mentioned this issue Nov 26, 2012

Move more methods into <RACStream> #135

Merged

jspahrsummers added a commit that referenced this issue Nov 26, 2012

Workaround for #94 specifically for the case of -bind:

d24b995

joshaber mentioned this issue Nov 26, 2012

Concurrent RACSchedulers can result in delivery race conditions #136

Closed

jspahrsummers mentioned this issue Nov 28, 2012

Fix RACSubscriber, RACReplaySubject thread safety; remove RACAsyncSubject #147

Merged

joshaber mentioned this issue Nov 29, 2012

Scheduler improvements #150

Closed

ghost assigned jspahrsummers Dec 2, 2012

jspahrsummers mentioned this issue Dec 5, 2012

Remove problematic schedulers, add -scheduleRecursiveBlock: #169

Merged

joshaber closed this as completed Dec 7, 2012

Coneko mentioned this issue Nov 7, 2013

+create: and a compound disposable on <RACSubscriber> #917

Merged

8 tasks

jifang mentioned this issue Feb 4, 2016

Deadlock of NSLock on same(?) thread #2707

Closed

andersio pushed a commit that referenced this issue Sep 22, 2016

Merge pull request #94 from dmcrodrigues/dr/fix-cocoapods-spec

31ea996

Fix CocoaPods spec

-repeat doesn't stop after being disposed of #94

-repeat doesn't stop after being disposed of #94

Comments

joshaber commented Nov 1, 2012

joshaber commented Nov 4, 2012

jspahrsummers commented Nov 4, 2012

joshaber commented Nov 4, 2012

Coneko commented Nov 5, 2012

joshaber commented Nov 5, 2012

Coneko commented Nov 5, 2012

joshaber commented Nov 5, 2012

Coneko commented Nov 5, 2012

joshaber commented Nov 5, 2012

jspahrsummers commented Nov 5, 2012

Coneko commented Nov 5, 2012

jspahrsummers commented Nov 5, 2012

joshaber commented Nov 5, 2012

Coneko commented Nov 5, 2012

jspahrsummers commented Nov 25, 2012

Coneko commented Nov 25, 2012

jspahrsummers commented Nov 25, 2012

Coneko commented Nov 26, 2012

jspahrsummers commented Nov 28, 2012

Coneko commented Nov 28, 2012

jspahrsummers commented Nov 28, 2012

jspahrsummers commented Nov 30, 2012

Coneko commented Nov 30, 2012

joshaber commented Nov 30, 2012

jspahrsummers commented Nov 30, 2012

joshaber commented Nov 30, 2012

jspahrsummers commented Nov 30, 2012

Coneko commented Nov 30, 2012

joshaber commented Nov 30, 2012

jspahrsummers commented Nov 30, 2012

Coneko commented Nov 30, 2012

joshaber commented Nov 30, 2012

jspahrsummers commented Nov 30, 2012

joshaber commented Nov 30, 2012

jspahrsummers commented Nov 30, 2012

thenikso commented Dec 7, 2012