BlacklistingChannel probation and per-endpoint info #349

iamdanfox · 2020-02-17T11:30:30Z

Before this PR

Blacklisting was an all or nothing thing (so on endpoint could stop traffic to an entire host)
After blacklisting, we'd send a deluge of requests to the newly unblacklisted host, which might still be broken, resulting in lots of failures.
Also 500s wouldn't result in blacklisting

(It's way easier to review this when I have the :simulation graphs to demonstrate what's broken)

After this PR

==COMMIT_MSG==
BlacklistingChannel keeps track of per-endpoint information and conservatively ramps up.
==COMMIT_MSG==

Possible downsides?

changelog-app · 2020-02-17T11:30:35Z

Generate changelog in `changelog/@unreleased`

Type

Description

BlacklistingChannel keeps track of per-endpoint information and conservatively ramps up.

Check the box to generate changelog(s)

Generate changelog entry

dialogue-core/src/main/java/com/palantir/dialogue/core/BlacklistingChannel.java

carterkozak · 2020-02-17T14:04:43Z

dialogue-core/src/main/java/com/palantir/dialogue/core/BlacklistingChannel.java

+        this.duration = duration;
+        this.ticker = ticker;
+        this.perEndpointBlacklistState =
+                Caffeine.newBuilder().maximumSize(1000).ticker(ticker).build();


per-endpoint state feels a bit odd here. At the channel level I think we'd want to track health of the channel as a whole. I wonder if we could invert the relationship between channel and Endpoint to allow endpoints to be wrapped similarly to conjure-undertow?

Is this something that you want to nail down before moving forwards with this PR? (We've already got a Cache<Endpoint, Limiter> in our concurrency limiters...)

This BlacklistingChannel is a key ingredient for an alternative node selection strategy I've been working on, but I'd really like to land this PR as it's not super usable as is.

…isting

ferozco · 2020-02-18T17:20:00Z

dialogue-core/src/main/java/com/palantir/dialogue/core/BlacklistingChannel.java

+    }
+
+    // I wish java had union types
+    interface BlacklistState {}


derive4j is your friend here

So I actually really don't want to have the extra faff and tooling setup of derive4j - can you forgive these two purely implementation-detail classes?

…isting

ferozco · 2020-02-18T23:05:22Z

👍

iamdanfox · 2020-02-18T23:11:05Z

FWIW Channels#create doesn't currently use BlacklistingChannel - it was a piece of floating dead code... but maybe not for long ;)

carterkozak · 2020-02-18T23:16:13Z

If we want to use blacklisting channel we’ll need to wire in eventing to attempt to flush the queue when channels are no longer blacklisted.

…

-ck

On Feb 18, 2020, at 6:11 PM, iamdanfox ***@***.***> wrote: FWIW Channels#create doesn't currently use BlacklistingChannel - it was a piece of floating dead code... but maybe not for long ;) — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

iamdanfox · 2020-02-18T23:46:37Z

Ah right fair enough, as the concurrency limiter's behaviour only changes when a request goes in or comes back (which triggers the queued thing to schedule), whereas since blacklisting is time based, we could have a spike of requests that all get queued and if no further requests come in then they'd just sit on the queue forever.

iamdanfox added 2 commits February 17, 2020 11:27

BlacklistingChannel

e5b0291

Add generated changelog entries

57e4b06

iamdanfox mentioned this pull request Feb 17, 2020

[test-only] Simulate dialogue clients against a DeterministicScheduler #348

Merged

New format who dis

3eceb03

carterkozak reviewed Feb 17, 2020

View reviewed changes

Merge remote-tracking branch 'origin/develop' into dfox/better-blackl…

1521de8

…isting

ferozco reviewed Feb 18, 2020

View reviewed changes

iamdanfox added 2 commits February 18, 2020 22:43

Merge remote-tracking branch 'origin/develop' into dfox/better-blackl…

60ee538

…isting

Redundant null

936927a

iamdanfox added the merge when ready label Feb 18, 2020

bulldozer-bot bot merged commit 96ebe45 into develop Feb 18, 2020

bulldozer-bot bot deleted the dfox/better-blacklisting branch February 18, 2020 23:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BlacklistingChannel probation and per-endpoint info #349

BlacklistingChannel probation and per-endpoint info #349

iamdanfox commented Feb 17, 2020 •

edited

Loading

changelog-app bot commented Feb 17, 2020 •

edited by iamdanfox

Loading

carterkozak Feb 17, 2020

iamdanfox Feb 18, 2020

ferozco Feb 18, 2020

iamdanfox Feb 18, 2020

ferozco commented Feb 18, 2020

iamdanfox commented Feb 18, 2020

carterkozak commented Feb 18, 2020 via email

iamdanfox commented Feb 18, 2020

BlacklistingChannel probation and per-endpoint info #349

BlacklistingChannel probation and per-endpoint info #349

Conversation

iamdanfox commented Feb 17, 2020 • edited Loading

Before this PR

After this PR

Possible downsides?

changelog-app bot commented Feb 17, 2020 • edited by iamdanfox Loading

Generate changelog in changelog/@unreleased

carterkozak Feb 17, 2020

Choose a reason for hiding this comment

iamdanfox Feb 18, 2020

Choose a reason for hiding this comment

ferozco Feb 18, 2020

Choose a reason for hiding this comment

iamdanfox Feb 18, 2020

Choose a reason for hiding this comment

ferozco commented Feb 18, 2020

iamdanfox commented Feb 18, 2020

carterkozak commented Feb 18, 2020 via email

iamdanfox commented Feb 18, 2020

iamdanfox commented Feb 17, 2020 •

edited

Loading

changelog-app bot commented Feb 17, 2020 •

edited by iamdanfox

Loading

Generate changelog in `changelog/@unreleased`