[Runtime] [ThreadPool] Make SpscTaskQueue::Pop(..) spin_count configurable #3577

ajtulloch · 2019-07-18T20:08:40Z

In cases where we have multiple models or threadpools active, spinning around
sched_yield() may not be desirable, as it prevents the OS from effectively
scheduling other threads.

Thus, allow users to conditionally disable this behaviour (via an environment
variable TVM_THREAD_POOL_SPIN_COUNT, similar to existing environment flags for
the thread pool such as TVM_BIND_THREADS, etc).

This substantially improves tail latencies in some of our multi-tenant
workloads in practice.

Unit tests have been added - on my laptop, running:

TVM_THREAD_POOL_SPIN_COUNT=0 ./build/threading_backend_test;
TVM_THREAD_POOL_SPIN_COUNT=1 ./build/threading_backend_test;
./build/threading_backend_test;

gives https://gist.github.com/ajtulloch/1805ca6cbaa27f5d442d23f9d0021ce6 (i.e.
97ms -> <1ms after this change)

Thanks for contributing to TVM! Please refer to guideline https://docs.tvm.ai/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers.

ajtulloch · 2019-07-18T20:10:54Z

cc @yidawang, @eqy, @nhynes.

tqchen · 2019-07-18T22:48:09Z

Thanks @ajtulloch please fix the CI error(was due to compiler warning)

yidawang

LGTM. Thanks @ajtulloch

src/runtime/thread_pool.cc

tqchen · 2019-07-22T16:58:14Z

ping @ajtulloch

ajtulloch · 2019-07-22T21:35:01Z

Sorry for the delay, will fix asap and update.

…rable In cases where we have multiple models or threadpools active, spinning around `sched_yield()` may not be desirable, as it prevents the OS from effectively scheduling other threads. Thus, allow users to conditionally disable this behaviour (via an environment variable `TVM_THREAD_POOL_SPIN_COUNT`, similar to existing environment flags for the thread pool such as `TVM_BIND_THREADS`, etc). This substantially improves tail latencies in some of our multi-tenant workloads in practice. Unit tests have been added - on my laptop, running: ``` TVM_THREAD_POOL_SPIN_COUNT=0 ./build/threading_backend_test; TVM_THREAD_POOL_SPIN_COUNT=1 ./build/threading_backend_test; ./build/threading_backend_test; ``` gives https://gist.github.com/ajtulloch/1805ca6cbaa27f5d442d23f9d0021ce6 (i.e. 97ms -> <1ms after this change)

tqchen · 2019-07-23T03:36:30Z

Thanks @ajtulloch @yidawang @u99127 !

…rable (apache#3577) In cases where we have multiple models or threadpools active, spinning around `sched_yield()` may not be desirable, as it prevents the OS from effectively scheduling other threads. Thus, allow users to conditionally disable this behaviour (via an environment variable `TVM_THREAD_POOL_SPIN_COUNT`, similar to existing environment flags for the thread pool such as `TVM_BIND_THREADS`, etc). This substantially improves tail latencies in some of our multi-tenant workloads in practice. Unit tests have been added - on my laptop, running: ``` TVM_THREAD_POOL_SPIN_COUNT=0 ./build/threading_backend_test; TVM_THREAD_POOL_SPIN_COUNT=1 ./build/threading_backend_test; ./build/threading_backend_test; ``` gives https://gist.github.com/ajtulloch/1805ca6cbaa27f5d442d23f9d0021ce6 (i.e. 97ms -> <1ms after this change)

yidawang approved these changes Jul 18, 2019

View reviewed changes

u99127 reviewed Jul 19, 2019

View reviewed changes

src/runtime/thread_pool.cc Show resolved Hide resolved

ajtulloch force-pushed the threadpool-spin-fix branch 2 times, most recently from 9c53c96 to a5d84b4 Compare July 19, 2019 17:19

tqchen added the status: need update need update based on feedbacks label Jul 21, 2019

ajtulloch force-pushed the threadpool-spin-fix branch from a5d84b4 to eab0cc2 Compare July 22, 2019 21:57

tqchen merged commit 9b1c2e0 into apache:master Jul 23, 2019

tqchen added status: accepted and removed status: need update need update based on feedbacks labels Jul 23, 2019

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Runtime] [ThreadPool] Make SpscTaskQueue::Pop(..) spin_count configurable #3577

[Runtime] [ThreadPool] Make SpscTaskQueue::Pop(..) spin_count configurable #3577

ajtulloch commented Jul 18, 2019

ajtulloch commented Jul 18, 2019

tqchen commented Jul 18, 2019

yidawang left a comment

tqchen commented Jul 22, 2019

ajtulloch commented Jul 22, 2019

tqchen commented Jul 23, 2019 •

edited

[Runtime] [ThreadPool] Make SpscTaskQueue::Pop(..) spin_count configurable #3577

[Runtime] [ThreadPool] Make SpscTaskQueue::Pop(..) spin_count configurable #3577

Conversation

ajtulloch commented Jul 18, 2019

ajtulloch commented Jul 18, 2019

tqchen commented Jul 18, 2019

yidawang left a comment

Choose a reason for hiding this comment

tqchen commented Jul 22, 2019

ajtulloch commented Jul 22, 2019

tqchen commented Jul 23, 2019 • edited

tqchen commented Jul 23, 2019 •

edited