Reactive Streams impl #1282

danielkec · 2020-01-09T14:26:45Z

Reactive Streams Operators

Reactive Streams implementation based on existing Helidon Common Reactive Library compliant with
Reactive streams for JVM.

Part of the implementation of /issues/1206

danielkec · 2020-01-09T14:40:34Z

common/reactive/src/main/java/io/helidon/common/reactive/MultiFlatMapProcessor.java

danielkec · 2020-01-14T09:13:34Z

Rebased on @tomas-langer 's native image changes in the master, sorry for force-push

olotenko

Why is this declared R if onNext expects it to be X? (I know, this change is just adapting old to new, but still)

olotenko

Atomic properties of these are not used. It's better to declare them volatile. (But I am not even convinced they have to be volatile - the onSubscribe/onNext/onComplete/onError protocol is single-threaded)

olotenko

Hybrid* may be better expressed as an interface that implements both, with mutually-recursive default methods, then from* static methods construct concrete implementations that override either half of those mutually-recursive methods. This way you don't need to always check for what type of processor is set.

danielkec · 2020-01-20T10:45:04Z

@olotenko

Why is this declared R if onNext expects it to be X? (I know, this change is just adapting old to new, but still)

I can't see to which file is the comment pointing, but I guess this is about FlatMapProcessor, its a fix of the previous PR #1260 , its different/inner subscriber

        public InnerSubscriber<? super X> executeMapper(U item) {
         ...
      }

    private class InnerSubscriber<R> implements Flow.Subscriber<R> {
        ...
        public void onNext(R o) {
            Objects.requireNonNull(o);
            MultiFlatMapProcessor.this.subscriber.onNext((X) o);
            ...

olotenko · 2020-01-20T10:54:57Z

Right, that's the place. Note that X is still available in onNext of the InnerSubscriber<R>, so that onNext knows what X is, and expects the item to be of type X. So it seems only suitable to declare it implementing Flow.Subscriber<X>, not introducing a new type parameter R.

Mind you, I am not blocking the merge, just suggesting what may be a good idea to review as a future improvement.

danielkec · 2020-01-20T21:38:52Z

@olotenko Thx a lot, Hybrids with default methods are great idea, why I didn't think of it! Also thanks for generics in the flatMap.
You are right about over-usage of Atomic* but I wouldn't get rid of volatiles in the processors so fast, its true that by the spec some of the signals must be executed serially, but reactive streams can incorporate to the stream 3rd party publishers/subscribers/processors and those can be implemented in "various" ways. It seems to me better to be defensive in this case.

danielkec · 2020-01-20T22:12:32Z

Apology for force push, needed rebase on shrinkwrap upgrade in master

danielkec · 2020-01-24T14:35:38Z

This is not what has been discussed by email.

This is a nonsensical implementation of that rule in that spec.

It's just a quick fix passing tck tests so we are able to move forward, not the final solution.

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

danielkec · 2020-02-05T10:17:30Z

The implementation is not what has been discussed.

Please, split into smaller commits: one for changes to BaseProcessor, others for other changes.

60739a8#diff-ca92013a99c748b99a9e08660b9dea17R99 - racy. This should have been called only when this.subscription has been set.

it is

    private void tryOnSubscribe() {
        if (Objects.nonNull(subscription) && subscriber.tryOnSubscribe(this)) {
            if (done) {
                tryComplete();
            }
        }
    }

tomas-langer · 2020-02-18T16:10:37Z

Hi Alex, is there any blocking issue that would prevent us from merging this pull request, or can we create a follow up issue (or issues) to fix some of your comments?
We need to move forward with the MP messaging work that depends on these changes.

akarnokd · 2020-02-18T17:05:30Z

Why did you implement operators as Flow.Processors? Implementing one implies you either want to multicast events to any number of downstream consumers or requires you to ensure there is at most one downstream consumer that can subscribe.

danielkec · 2020-02-19T11:36:10Z

Why did you implement operators as Flow.Processors? Implementing one implies you either want to multicast events to any number of downstream consumers or requires you to ensure there is at most one downstream consumer that can subscribe.

Hi @akarnokd , sure second subscribe signals onError with IllegalStateException to second subscriber which is initialized with empty subscription

EDIT: wow thanks a lot, totally missed that!

akarnokd · 2020-02-19T11:46:39Z

Yes, I saw those.

Still, I don't see why implement operators as Flow.Processors. Do you intend to drive such chains via onNext calls imperatively somewhere?

With a processor chain, you trigger a subscribe() storm while you are still assembling maps and filters, which could trigger side-effects, resource utilization or even failure before you are ready to consume the whole sequence with a Flow.Subscriber. This is similar to how composing CompletionStage operators actually can race with the values in each of those stages. In addition, you are not allowed to drive individual Flow.Processors in a chain externally because that could violate the serial requirement of onXXXs.

You can have simply a chain of Flow.Publishers as operators and if necessary, have the upmost source be a Flow.Processor for imperative item emission.

akarnokd · 2020-02-19T12:05:49Z

If you need one example why a chain of Flow.Processors don't work: repeating or retrying a chain is not possible because the intermediate Flow.Processors will be in terminal state. You'd have to recreate the entire chain from scratch with fresh Flow.Processors to allow a new run.

With Flow.Publishers, they hold the blueprint of the operators thus a retry due to an error is straightforward and requires no manual chain recreation.

* Fix second subscriber cancellation in the BaseProcessor Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

Include helidon-io#1429 for a known good Multi.from implementation

Multi.from can drive multiple subscribers

tomas-langer

LGTM

akarnokd · 2020-02-24T10:01:30Z

How is it not emitted serially?

olotenko · 2020-02-24T10:03:33Z

request can be during onNext and can be concurrent. Emitting onError concurrently breaks the guarantee that all on* invocations are in total order. Unguarded invocation of onNext / onError will fail the recursion limitations - the stack depth becomes unbounded.

akarnokd · 2020-02-24T10:10:13Z

Wrong. The atomic state transition of addRequest with requiring the transition from zero ensures there is only one thread entering the emission section.

olotenko · 2020-02-24T10:36:13Z

Well, it's more subtle than that, because it also interacts with canceled, but ok, yes, it is serial then. The alternative that was in review used the same trick, just it was more obvious: olotenko@ea05dd2#diff-75074b35969a7ab9a07495bf20596b1fR102

akarnokd · 2020-02-24T10:52:22Z

Yes, the TCK expects a clear failure for non-positive request amounts which means the Publisher should not fail or complete on its own or get cancelled before that. This is one of the pain points of the TCK.

olotenko · 2020-02-24T10:58:16Z

That requirement ("should not fail or complete on its own or get cancelled before that") is non-enforceable, and the code you submitted does not guarantee that either.

Eg cancel followed synchronously by request(-1) - request wins, and will produce onError. Whereas the spec requires a cancelled Subscription to behave as no-op. I treat this as not a bug, because there is no requirement for cancel and request to be observed in a total order.

But the part where you say "should not complete or error before that" is just not enforceable. If the Publisher has passed the branch where it determined !hasNext, then a request(-1) will not produce onError. I treat this as not a bug, just the nature of concurrent systems. But this means you can't enforce that claim.

All the try-catch can, and possibly should, be fused into one: the only difference in behaviour is that if onNext throws, it will produce onError, whereas currently it won't - but this is not wrong; and if it doesn't throw, then the fusion of all try-catch doesn't harm. The intermediate checks for canceled are unnecessary for the above reasons. The spec allows cancel and request(...) to be observed "eventually". There is no requirement to observe them immediately. So if it races against other on*, there is no requirement to prefer issuing error produced by a bad value in request(...). After that you get pretty much the loop body that was in review: olotenko@ea05dd2#diff-75074b35969a7ab9a07495bf20596b1fR107-R133 - with a subtle difference in how we determine mutual exclusion of who executes the on*.

akarnokd · 2020-02-24T11:11:12Z

Eg cancel followed synchronously by request(-1) - request wins, and will produce onError. Whereas the spec requires a cancelled Subscription to behave as no-op.

Yes and such bad request amounts should not go unnoticed because it means there is a bug somewhere in the chain. Since there is currently no established approach in the module to not lose exceptions, this is the best it can be done while other operators are being rewritten. We are discussing the possibility of a global error consumer on Slack.

All the try-catch can, and possibly should, be fused into one. The checks for canceled are unnecessary for the above reasons.

With a general Iterator, it could take an arbitrary time for hasNext and next to return, which if the sequence get canceled due to timeout, would enter the other method and wait even more. This setup bails out eagerly.

olotenko · 2020-02-24T11:14:44Z

Yes and such bad request amounts should not go unnoticed

Yes, but you can't guarantee that after cancel has been fired (the spec requires no-op behaviour, and the same is true with all implementations - after canceled has been observed, negative request values will not be notified via onError), or Publisher committed to fire onError / onComplete. In other cases it will be noticed. The time to execute hasNext / next is immaterial for correctness. Given these premises, there is no proof eager notification is any better than lazy. On the contrary, there are good reasons to optimize the happy path at expense of handling error cases somewhat slower.

olotenko · 2020-02-24T11:16:44Z

"global error consumer" is a "unhandled exception handler" and should be at the root of any thread.

akarnokd · 2020-02-24T11:24:08Z

The reactive foundation of Helidon is currently incomplete and is not yet prepared for all corner cases within and outside the spec and the TCK. This implementation passes the TCK and enables the development of more operators.

The time to execute hasNext / next is immaterial for correctness.

They pose practical considerations.

"global error consumer" is a "unhandled exception handler" and should be at the root of any thread

If you control all the threads that may come into contact with the reactive operators.

olotenko · 2020-02-24T11:26:57Z

They pose practical considerations.

:) what can be more practical than fencing off problematic implementations instead of punishing everyone? Eg EagerCancellationIterablePublisher, or TimedIterableWrapper to be used with Iterables that are problematic.

3f61017#diff-6e58085854ee1341e169599585b70a21R117-R119 - you've already spent CPU cycles getting the value, you may just as well let the Subscriber have it.

akarnokd · 2020-02-24T11:39:16Z

I welcome you to benchmark implementations with and without those volatile reads.

olotenko · 2020-02-24T11:52:23Z

:) simpler linear code with fewer things to think about, rather. It's not just a volatile read here, too.

I get 15% difference with a iterator that just produces a range of integers. Propose a test where it matters or does not matter?

danielkec · 2020-02-24T19:03:34Z

Continuation of alternative cancelation strategy moved to /issues/1441

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

m0mus mentioned this pull request Jan 9, 2020

MP Reactive Messaging and Reactive Streams support #1206

Closed

39 tasks

danielkec self-assigned this Jan 9, 2020

danielkec added this to the 2.0.0 milestone Jan 9, 2020

danielkec added the enhancement New feature or request label Jan 9, 2020

danielkec requested a review from tomas-langer January 9, 2020 14:35

danielkec force-pushed the reactive-streams branch from 6f031ba to f2eaa48 Compare January 10, 2020 11:57

spericas reviewed Jan 10, 2020

View reviewed changes

common/reactive/src/main/java/io/helidon/common/reactive/MultiFlatMapProcessor.java Outdated Show resolved Hide resolved

danielkec mentioned this pull request Jan 10, 2020

MP Reactive Messaging impl #1287

Merged

danielkec force-pushed the reactive-streams branch from 29a8cb1 to 9ba3ee7 Compare January 14, 2020 09:11

danielkec requested a review from spericas January 14, 2020 19:17

olotenko reviewed Jan 20, 2020

View reviewed changes

danielkec force-pushed the reactive-streams branch from 3b0d186 to 059a1b7 Compare January 20, 2020 22:09

danielkec force-pushed the reactive-streams branch from ab8fe90 to 045874c Compare January 23, 2020 15:44

danielkec added a commit to danielkec/helidon that referenced this pull request Feb 3, 2020

BaseProcessor back-pressure fix backport from helidon-io#1282

beebd46

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

danielkec added 5 commits February 21, 2020 16:07

Improve detection of rule 2.3 violation in coupled operator TCK test

3aa2f41

* Fix second subscriber cancellation in the BaseProcessor Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

Checkstyle fix

a2a8e2d

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

Update version to 2.0.0-SNAPSHOT

63c4b3c

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

SingleMappingProcessor bad mapper fix

2b6823c

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

Missing module fix

93e16d1

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

danielkec dismissed tomas-langer’s stale review via 93e16d1 February 21, 2020 15:18

danielkec force-pushed the reactive-streams branch from cd574d9 to 93e16d1 Compare February 21, 2020 15:18

akarnokd and others added 4 commits February 22, 2020 14:40

Include helidon-io#1429 for a known good Multi.from impl.

3f61017

Merge pull request #1 from akarnokd/reactivestreams_and_iterable

8bad42c

Include helidon-io#1429 for a known good Multi.from implementation

Multi.from can drive multiple subscribers

915826f

Merge pull request #2 from akarnokd/DoubleSubscribeAllowed

5de0c01

Multi.from can drive multiple subscribers

tomas-langer approved these changes Feb 24, 2020

View reviewed changes

danielkec merged commit f7f1486 into helidon-io:master Feb 24, 2020

danielkec mentioned this pull request Feb 24, 2020

IteratorPublisher - alternative cancel handling #1441

Open

akarnokd mentioned this pull request Feb 24, 2020

Reimplement Multi.from(Iterable) + TCK test #1429

Closed

danielkec added a commit that referenced this pull request Mar 16, 2020

BaseProcessor back-pressure fix backport from #1282 (#1343)

480249a

Signed-off-by: Daniel Kec <daniel.kec@oracle.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reactive Streams impl #1282

Reactive Streams impl #1282

danielkec commented Jan 9, 2020 •

edited

Loading

danielkec commented Jan 9, 2020

danielkec commented Jan 14, 2020

olotenko left a comment

olotenko left a comment

olotenko left a comment

danielkec commented Jan 20, 2020

olotenko commented Jan 20, 2020

danielkec commented Jan 20, 2020

danielkec commented Jan 20, 2020

danielkec commented Jan 24, 2020

danielkec commented Feb 5, 2020

tomas-langer commented Feb 18, 2020

akarnokd commented Feb 18, 2020

danielkec commented Feb 19, 2020 •

edited

Loading

akarnokd commented Feb 19, 2020

akarnokd commented Feb 19, 2020

tomas-langer left a comment

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 •

edited

Loading

akarnokd commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 •

edited

Loading

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 •

edited

Loading

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 •

edited

Loading

danielkec commented Feb 24, 2020

Reactive Streams impl #1282

Reactive Streams impl #1282

Conversation

danielkec commented Jan 9, 2020 • edited Loading

Reactive Streams Operators

danielkec commented Jan 9, 2020

danielkec commented Jan 14, 2020

olotenko left a comment

Choose a reason for hiding this comment

olotenko left a comment

Choose a reason for hiding this comment

olotenko left a comment

Choose a reason for hiding this comment

danielkec commented Jan 20, 2020

olotenko commented Jan 20, 2020

danielkec commented Jan 20, 2020

danielkec commented Jan 20, 2020

danielkec commented Jan 24, 2020

danielkec commented Feb 5, 2020

tomas-langer commented Feb 18, 2020

akarnokd commented Feb 18, 2020

danielkec commented Feb 19, 2020 • edited Loading

akarnokd commented Feb 19, 2020

akarnokd commented Feb 19, 2020

tomas-langer left a comment

Choose a reason for hiding this comment

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 • edited Loading

akarnokd commented Feb 24, 2020 • edited Loading

olotenko commented Feb 24, 2020 • edited Loading

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 • edited Loading

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 • edited Loading

olotenko commented Feb 24, 2020

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 • edited Loading

akarnokd commented Feb 24, 2020

olotenko commented Feb 24, 2020 • edited Loading

danielkec commented Feb 24, 2020

danielkec commented Jan 9, 2020 •

edited

Loading

danielkec commented Feb 19, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading

akarnokd commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading

olotenko commented Feb 24, 2020 •

edited

Loading