Design client backoff protocol #100

ckarlof · 2013-07-31T22:42:35Z

Handle misbehaving clients or periods of high load. A 20x response with a header or a 503 with or without a header would be appropriate.

ckarlof · 2013-07-31T22:44:52Z

Here what Sync 2.0 does: http://docs.services.mozilla.com/storage/apis-2.0.html#response-headers

rfk · 2013-07-31T22:59:33Z

Based on an IRC conversation with warner, I strawman-propose that we do a blend of the sync2.0 style backoff and a proof-of-work scheme. We can start with a polite request for the client to back off:

200 OK
Backof:  <time to back off, in seconds>

If things get really hairy, we send out a 503:

503 Service Unavailable
Retry-After:  <time to wait before re-trying, in seconds>
PoW-Required:  <proof-of-work protocol parameters>

The client at this point has two options. It can just wait and try again later, or it can do a hashcash-style proof-of-work thing and re-submit its request:

PUT /whatever HTTP/1.1
Host:  blah blah
PoW:  <proof-of-work hash>

The client is expected to submit a fresh proof-of-work with each new request, until the retry-after time has expired.

@warner does this adequately capture the gist of our conversation? Thoughts?

ckarlof · 2013-07-31T23:30:41Z

Details on PoW protocol here:
https://wiki.mozilla.org/Identity/AttachedServices/KeyServerProtocol#Proof-Of-Work

On Wed, Jul 31, 2013 at 3:59 PM, Ryan Kelly notifications@github.comwrote:

Based on an IRC conversation with warner, I strawman-propose that we do a
blend of the sync2.0 style backoff and a proof-of-work scheme. We can start
with a polite request for the client to back off:

200 OK
Backof: <time to back off, in seconds>

If things get really hairy, we send out a 503:

503 Service Unavailable
Retry-After: <time to wait before re-trying, in seconds>
PoW-Required:

The client at this point has two options. It can just wait and try again
later, or it can do a hashcash-style proof-of-work thing and re-submit its
request:

PUT /whatever HTTP/1.1
Host: blah blah
PoW:

The client is expected to submit a fresh proof-of-work with each new
request, until the retry-after time has expired.

@warner https://github.com/warner does this adequately capture the gist
of our conversation? Thoughts?

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/100#issuecomment-21902561
.

ckarlof · 2013-07-31T23:32:46Z

Client side support for PoW needs to be baked in from the start.

warner · 2013-08-26T21:25:41Z

Yeah, that mostly matches what I remember.

One thing to clarify for the docs: the client's "options" (retry-after and PoW) aren't really equivalent. We can't distinguish one client from another, so there's no way for us to tell that a client has been politely/patiently waiting (and then accept their request without the PoW).

If the DoS attack has stopped by the time they retry (and we're no longer requiring PoWs), then the retry-after might happen to work. But that state might last for hours or days. So only a really lazy client should just do retry-after without the proof-of-work, and they should be prepared to not connect for long periods of time.

How exactly would 503+Retry-After fit in? I guess if we're busy enough to emit 200+Backoff, and find that's not enough, the next stage is to start rejecting requests randomly, and 503+Retry-After tells them "it's ok, it's not your fault, please come back eventually". At that point, most good clients should already have been honoring the Backoff=x header from their last successful request. So either that delay is not sufficient, or there are clients who aren't honoring it (who might go away if we require PoW).

normal operations:
- POST -> 200 OK
somewhat busy:
- POST -> 200 OK, Backoff=x
- wait x
- POST -> 200 OK, Backoff=x
really busy:
- POST -> 503, Retry-After=x (probabilistically)
- wait x
- POST -> 200 or 503 (probabilistically)
really really busy:
- POST -> 503, PoW-Required=params (always)
- compute PoW
- POST (with PoW) -> 200 OK, Backoff=x
- wait x
- POST -> 503, PoW-Required=params
- compute PoW
- POST (with PoW) -> 200 OK, Backoff=x

(The time between the fetch of the PoW parameters and the submission of the completed PoW should be as short as possible)

So I guess I'm wondering if we should report 503+Retry-After, or 503+PoW-Required, but never both.

rfk · 2013-08-26T22:30:53Z

Good points. One small nit: clients might arrive in the middle of a DoS and never have seen a Backoff header before being hit with a 503.

What I was going for with Retry-After was basically "we estimate it will be at least this long until we switch off PoW", which might let the client make a more intelligent choice between waiting versus doing the work. It's not a promise that your request will succeed if you wait that long - more a guideline than an actual rule.

Happy to make these two headers exclusive if it will simplify things for the client.

rfk · 2013-10-25T00:53:14Z

/cc @telliott for perspective on proof-of-work idea

telliott · 2013-10-25T04:37:06Z

I like the general idea of proof-of-work for clients hitting us too often, but 503 isn't really a good match, since it's a server-side-problem status code, and this is a client problem. 403 is probably the appropriate status here and is explicit that this is a client-fixable issue.

rfk · 2013-10-25T04:39:43Z

RFC6585 also defines a "429 Too Many Requests" status which is appropriate here.

dannycoates · 2013-10-28T18:25:44Z

PoW might be useful for both kinds of load but I'm not sure I like penalizing clients (computationally) in the 503 high server load case. It seems nicer to return a RETRY-AFTER header and trust the client to respect it.

For the 429 case where individual clients are too chatty I like PoW.

ckarlof · 2013-10-28T18:40:17Z

We should distinguish between these two cases.

ckarlof · 2013-11-19T03:25:21Z

Basic backoff design in #323.

Fixes #100

fixes #100

rfk mentioned this issue Oct 14, 2013

Proof of work #217

Closed

ghost assigned rfk Oct 16, 2013

rfk mentioned this issue Oct 18, 2013

Design and implement "createToken" for limiting creation of new accounts #227

Closed

zaach mentioned this issue Oct 22, 2013

New api to detect if an account exists #236

Closed

ckarlof closed this as completed Nov 19, 2013

vladikoff added a commit that referenced this issue Feb 17, 2017

feat(reminder): add verification reminder email templates and strings

8763bac

Fixes #100

rfk pushed a commit that referenced this issue Oct 24, 2018

don't use multipleStatements in mysql

d71d988

fixes #100

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design client backoff protocol #100

Design client backoff protocol #100

ckarlof commented Jul 31, 2013

ckarlof commented Jul 31, 2013

rfk commented Jul 31, 2013

ckarlof commented Jul 31, 2013

ckarlof commented Jul 31, 2013

warner commented Aug 26, 2013

rfk commented Aug 26, 2013

rfk commented Oct 25, 2013

telliott commented Oct 25, 2013

rfk commented Oct 25, 2013

dannycoates commented Oct 28, 2013

ckarlof commented Oct 28, 2013

ckarlof commented Nov 19, 2013

Design client backoff protocol #100

Design client backoff protocol #100

Comments

ckarlof commented Jul 31, 2013

ckarlof commented Jul 31, 2013

rfk commented Jul 31, 2013

ckarlof commented Jul 31, 2013

ckarlof commented Jul 31, 2013

warner commented Aug 26, 2013

rfk commented Aug 26, 2013

rfk commented Oct 25, 2013

telliott commented Oct 25, 2013

rfk commented Oct 25, 2013

dannycoates commented Oct 28, 2013

ckarlof commented Oct 28, 2013

ckarlof commented Nov 19, 2013