Dynamic semaphore initialization from ticket quota #98

dalehamel · 2017-02-08T16:45:08Z

The gist of what is proposed here is to allow us to eliminate the assumption that there are a fixed number of workers (resource consumers) on a particular host. In a more dynamic scheduling environment (think: kubernetes), we cannot be certain of the number of resource consumers on a given host.

This is problematic, because under the current model we would have a ticket quota of a fixed size for a fixed number of workers. For illustration:

Assume we have a resource that permits 5 tickets (T) -> T = 5
Assume we have 10 workers (W) -> W = 10
In this case, a quota (Q) of only half of the workers may access the resource at a time - > Q = 0.5

So, since W is no longer static, we need T to be able to react to it, such to preserve Q at 0.5.

The proposed implementation is as follows:

Have a new semaphore based on the maximum number of tickets that tracks the tickets per worker (needs to be unique per process or thread -probably parent pid or pid_threadid). This would be the "quota semaphore", or "worker quota semaphore", tracking the number of worker tickets that have been issued. As new, unique, workers are added, we decrement this value. As they are removed, increment it.
The difference between the quota semaphore and the configured global maximum is the number of workers participating in the quota. This allows us to keep track of W.
As we update the quota semaphore, we can dynamically update the number of available RPC tickets (T), in order to maintain the desired quota (Q).

A nuance to this is:

When workers unregister themselves (they're killed or stop and SEM_UNDO does its thing), the worker count needs to be adjusted by something. We can cache the worker count in a semaphore in the semaphore set for the resource. On #acquire if it's different, we call update_ticket_count. It seems better for this reason to do it at #acquire time rather than #register time.

dalehamel · 2017-02-08T19:00:30Z

Had a chat with @sirupsen and I think that the approach we've settled on boils down to:

Have a new semaphore based on the maximum number of tickets that tracks the tickets per worker (needs to be unique per process or thread  -probably parent pid or pid_threadid).

The difference between this and the configured global maximum is the number of tickets currently available for that resource.

Based on the updates to this, we can dynamically update the number of available RPC tickets, based on some fraction. If the floor of this is different from the previous floor, do a thread-safe update of the ticket count.

sirupsen · 2017-02-08T19:21:04Z

@csfrancis @casperisfine any comments on this approach? Namely the newest comment from Dale

dalehamel · 2017-02-08T19:31:04Z

@csfrancis asked me:

in k8s are sysv semaphores shared across the entire physical host? that seems strange

and this brings up a point of clarification: the reason why any of this is necessary is because we have to use hostIPC for logging. Because we are logging to SySV MQ, we are forced into using the host IPC namespace.

sirupsen · 2017-02-08T19:52:09Z

K, @csfrancis had a short conversation and he's onboard with the solution of dynamically adjusting ticket counts. Some excellent point Scott made:

We should call the configuration option quota, not tickets to not confuse the two. ArgumentError should be raised if both are set.
When workers unregister themselves (they're killed or stop and SEM_UNDO does its thing), the worker count needs to be adjusted by something. We can cache the worker count in a semaphore in the semaphore set for the resource. On #acquire if it's different, we call update_ticket_count. It seems better for this reason to do it at #acquire time rather than #register time.
A problem with the alternative approach of Semian["#{Process.ppid}_#{Thread.id}_#{resource_name} that Scott pointed out is that you'll basically have to GC it. By default, this limit is 32,000 on Linux. We have about a 100 in Shopify. If we run, say 10 pods per host, it'll take 32,000 / (100 * 10) = 32 deploys before we exhaust this space.

dalehamel mentioned this issue Feb 13, 2017

Configure tickets per workers, not staticaly #50

Closed

sirupsen mentioned this issue Feb 13, 2017

Figure out how to set good ticket counts #30

Closed

dalehamel mentioned this issue Feb 19, 2017

Support quota based allocation strategy #120

Merged

3 tasks

dalehamel closed this as completed in #120 Mar 13, 2017

sirupsen mentioned this issue Mar 20, 2017

redis: lazily instantiate client on first i/o #132

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic semaphore initialization from ticket quota #98

Dynamic semaphore initialization from ticket quota #98

dalehamel commented Feb 8, 2017 •

edited

Loading

dalehamel commented Feb 8, 2017

sirupsen commented Feb 8, 2017 •

edited

Loading

dalehamel commented Feb 8, 2017

sirupsen commented Feb 8, 2017 •

edited

Loading

Dynamic semaphore initialization from ticket quota #98

Dynamic semaphore initialization from ticket quota #98

Comments

dalehamel commented Feb 8, 2017 • edited Loading

dalehamel commented Feb 8, 2017

sirupsen commented Feb 8, 2017 • edited Loading

dalehamel commented Feb 8, 2017

sirupsen commented Feb 8, 2017 • edited Loading

dalehamel commented Feb 8, 2017 •

edited

Loading

sirupsen commented Feb 8, 2017 •

edited

Loading

sirupsen commented Feb 8, 2017 •

edited

Loading