Fixed size ThreadPool scheduler #20808

maierlars · 2024-04-03T13:07:43Z

Scope & Purpose

This PR implements a new ThreadPoolScheduler with a fixed number of threads, and a number of different thread pools. The ThreadPoolScheduler creates a separate thread pool for each priority lane.

We have the following thread pools:

SimpleThreadPool - this is a very simple thread pool with a std::queue, mutex + condition variable
LockFreeThreadPool - this one is similar to the SimpleThreadPool but replaces the std::queue with a boost::lockfree::queue
WorkStealingThreadPool - this is a first version of a work stealing pool where instead of a globally shared queue, each thread has a local queue

This PR also implements a number of synthetic benchmarks as tests (by default these are disabled so that they do not run durng regular builds). These benchmarks showed, that the WorkStealingThreadPool scales significantly better than the other two thread pools, and also much better then the current SupervisedScheduler.

…scheduler

goedderz

Soo much simpler 😍

arangod/Scheduler/SimpleThreadPool.cpp

jsteemann

LGTM

…readPoolScheduler

The longer name exceeded the limit so setting the thread name failed.

js/client/modules/@arangodb/testutils/san-file-handler.js

arangod/Scheduler/ThreadPoolScheduler.h

arangod/Scheduler/ThreadPoolScheduler.cpp

jsteemann · 2024-06-07T11:10:47Z

arangod/Scheduler/ThreadPoolScheduler.cpp

+
+// queue fill grad tracking is currently not implemented for this scheduler
+double ThreadPoolScheduler::approximateQueueFillGrade() const { return 0; }
+double ThreadPoolScheduler::unavailabilityQueueFillGrade() const { return 0; }


Note that the queue fill grade may be checked by a load balancer periodically to determine if the instance is still in good shape.
Consider this calling code, which determines the instance's availability for further requests, from a load balancer's perspective:

// if the scheduler's queue is more than x% full, render // the server unavailable double unavailabilityFillGrade = scheduler->unavailabilityQueueFillGrade(); if (unavailabilityFillGrade > 0.0) { double fillGrade = scheduler->approximateQueueFillGrade(); if (fillGrade >= unavailabilityFillGrade) { // oops, queue is relatively full available = false; } }

While this code works if the unavailability fill grade is 0, it will lead to more requests being sent to the instance now even if the queue is relatively full.
If this is an intentional change, fine. If not, is it much work to implement the fill grade methods in the new scheduler, or is there a reason why we shouldn't do this?

These are functions defined in the Scheduler interface, but currently only make sense for the SupervisedScheduler (leaky abstraction). We will have to discuss if/how to add this to the new scheduler.

jsteemann

LGTM

maierlars self-assigned this Apr 3, 2024

cla-bot bot added the cla-signed label Apr 3, 2024

maierlars force-pushed the feature/thread-pool-scheduler branch from f9a3e4e to 35d0162 Compare April 3, 2024 13:15

maierlars added 3 commits April 4, 2024 12:23

ThreadPool + tests.

353c06a

Added more tests.

7b059ae

Experiment with new scheduler.

d2994c5

maierlars force-pushed the feature/thread-pool-scheduler branch from 35d0162 to d2994c5 Compare April 4, 2024 10:23

maierlars added 11 commits April 4, 2024 14:11

Increase number of threads.

85bdc63

Sort out min-max-confusion.

8d3b74a

Fixing statistics and queue times.

5d44080

Implement toVelocyPack to fix dump and restore tests.

6ad55c7

Fix thread names in scheduler.

b84c4ef

Move scheduler metrics in to a shared struct.

7c8b4dc

Fix compilation. Set metrics value.

6ee7eac

Remove metrics.

2648b4b

Merge remote-tracking branch 'origin/devel' into feature/thread-pool-…

137ead3

…scheduler

Temporarily disable logging.

5e4028a

Merge branch 'devel' into feature/thread-pool-scheduler

b185fed

maierlars marked this pull request as ready for review April 6, 2024 09:09

maierlars added 3 commits April 8, 2024 09:44

Merge remote-tracking branch 'origin/devel' into feature/thread-pool-…

70e467f

…scheduler

Fixing lane assignment.

949882d

Swallow exceptions in thread pool.

ac40adc

maierlars marked this pull request as draft April 8, 2024 12:12

goedderz reviewed Apr 8, 2024

View reviewed changes

arangod/Scheduler/SimpleThreadPool.cpp Outdated Show resolved Hide resolved

arangod/Scheduler/SimpleThreadPool.cpp Outdated Show resolved Hide resolved

maierlars and others added 6 commits April 9, 2024 15:18

Add queue length metrics.

5c08e33

Refactor test

0d52975

Rename ThreadPool to SimpleThreadPool

78c8c3c

Adapt tests and add performance comparison tests

6637afb

Extend tests and revert now unneeded changes

5ac752b

Add LockfreeThreadPool

57636c3

mpoeter added 2 commits May 23, 2024 14:18

Address review comments

655f185

Address more review comments

3e97de4

jsteemann approved these changes May 24, 2024

View reviewed changes

mpoeter added 12 commits May 27, 2024 16:26

Add some debug output

d367707

Merge branch 'devel' into feature/thread-pool-scheduler

e5f8e27

Introduce alias to make it easier to switch pool implementation in Th…

e75e384

…readPoolScheduler

Shutdown thread pools in scheduler shutdown

f3cd3da

Fix debug output

ffcfff2

Merge branch 'devel' into feature/thread-pool-scheduler

398be7c

Default to SupervisedScheduler

933e248

Change scheduler parameter to "threadpools"

d8f7f76

Use shorter name for maintenance threads

f43b53e

The longer name exceeded the limit so setting the thread name failed.

Use ThreadPoolScheduler as default

4d6b467

Merge branch 'devel' into feature/thread-pool-scheduler

3039601

Default to SupervisedScheduler

959b559