Adding flags to scheduler allowing to control thread stealing and idle back-off #3768

hkaiser · 2019-04-03T13:12:40Z

This adds a couple of things:

adding mark_end_of_scheduling hook to execution parameters allowing to be called back after the scheduling phase of the parallel algorithm
added invocation of mark_begin/mark_end scheduling parameters to sequential for_each and for_loop
added flags to scheduling loop that allows to control thread stealing and idle back-off (effective only for schedulers that support it)
changed implementation of static_priority_scheduler to full rely on local_priority_scheduler except that it starts of with stealing disabled
made fast_idle_mode scheduler flag more useful as it now speeds up idle back-off in case of unsuccessful thread stealing (no work available through thread stealing)
set HPX_WITH_THREAD_MANAGER_IDLE_BACKOFF=ON by default (can now be disabled at runtime)
added add_scheduler_mode/remove_scheduler_mode/add_remove_scheduler_mode APIs allowing to control the scheduler mode dynamically
flyby: removed unused arguments for get_next_thread, added argument controlling enable_stealing for wait_or_add_new
modified for_each_scaling test to not measure data initialization also added command line options --disable_stealing and --fast_idle_mode
duplicated scheduler mode across all cores (with cache alignment) to reduce false sharing

~~This relies on #3745 being merged first.~~ This subsumes #3745

- reduce false sharing in local priority scheduler and thread_queue - reduce false sharing in latch and condition_variable - disable stealing counters by default

- disable exponential idle backoff by default - simplify wait_or_add_new functionality

…::reset - flyby: add warning suppression for MSVC

- flyby: don't use alignment with NVCC

msimberg · 2019-04-04T08:05:47Z

hpx/runtime/threads/detail/scheduling_loop.hpp

@@ -822,10 +846,11 @@ namespace hpx { namespace threads { namespace detail
            // if nothing else has to be done either wait or terminate
            else
            {
-                ++idle_loop_count;
+                --idle_loop_count;


Does this need to be reversed? Just thinking about naming. Right now it's more like idle_loop_countdown ;) and it's different from the other counts that actually grow.

I reversed this as it helped me not to carry around the end value to the point where it needed to be reset.

msimberg · 2019-04-04T08:24:28Z

hpx/runtime/threads/policies/local_priority_queue_scheduler.hpp

@@ -247,7 +247,7 @@ namespace hpx { namespace threads { namespace policies
                num_pending_misses += high_priority_queues_[num_thread].data_->
                    get_num_pending_misses(reset);
            }
-            if (num_thread == 0)
+            if (num_thread == num_queues_-1)


Does this make a difference?

There was some inconsistency that we were accessing the low-priority queue from different cores, sometimes it was core zero sometimes the last one. I made it consistent to reduce false sharing on that queue.

hkaiser · 2019-04-05T00:58:34Z

@msimberg, @biddisco: this is ready to go from my end, please review.

…e back-off This adds a couple of things: - adding mark_end_of_scheduling hook to execution parameters allowing to be called back after the scheduling phase of the parallel algorithm - added invocation of mark_begin/mark_end scheduling parameters to sequential for_each and for_loop - added flags to scheduling loop that allows to control thread stealing and idle back-off (effective only for schedulers that support it) - changed implementation of static_priority_scheduler to full rely on local_priority_scheduler except that it starts of with stealing disabled - made fast_idle_mode scheduler flag more useful as it now speeds up idle back-off in case of unsuccessful thread stealing (no work available through thread stealing) - set HPX_WITH_THREAD_MANAGER_IDLE_BACKOFF=ON by default (can now be disabled at runtime) - added add_scheduler_mode/remove_scheduler_mode/add_remove_scheduler_mode APIs allowing to control the scheduler mode dynamically - flyby: removed unused arguments for get_next_thread, added argument controlling enable_stealing for wait_or_add_new - modified for_each_scaling test to not measure data initialization also added command line options `--disable_stealing` and `--fast_idle_mode`

- flyby: remove obsolete periodic_maintenance

msimberg · 2019-04-10T09:44:13Z

What was the reason the shared_priority_queue_scheduler was timing out?

hkaiser · 2019-04-10T09:58:44Z

@msimberg the wait_or_add_new was not properly setting its return value forcing the scheduler loop to wait forever for it to signal its termination status; see for instance here: https://github.com/STEllAR-GROUP/hpx/pull/3768/files#diff-c730f66e3d702c09217b1bbcc6b570f2R1317.

hkaiser added 6 commits April 2, 2019 12:19

Introduced cache_aligned_data and cache_line_data helper structure

50a359d

- reduce false sharing in local priority scheduler and thread_queue - reduce false sharing in latch and condition_variable - disable stealing counters by default

Make latch counter atomic, avoid calling spinlock as much as possible

d29c9e8

More false sharing reductions

2f2eed7

- disable exponential idle backoff by default - simplify wait_or_add_new functionality

Fixing local_latch test to conform to preconditions required by latch…

914041b

…::reset - flyby: add warning suppression for MSVC

Making wait-count for exponential backoff core-specific

b68deef

- flyby: don't use alignment with NVCC

Adding compiler options related to alignment on pre-C++17

536cc01

hkaiser added type: enhancement category: threadmanager labels Apr 3, 2019

hkaiser added this to the 1.3.0 milestone Apr 3, 2019

msimberg reviewed Apr 4, 2019

View reviewed changes

hkaiser force-pushed the control_thread_stealing branch 2 times, most recently from 044e0e9 to b1b9e7a Compare April 5, 2019 00:57

hkaiser marked this pull request as ready for review April 5, 2019 00:58

hkaiser force-pushed the control_thread_stealing branch 2 times, most recently from ba55402 to 01b43d2 Compare April 6, 2019 02:17

Moved scheduler_base implementation to source file

89b45d3

- flyby: remove obsolete periodic_maintenance

hkaiser force-pushed the control_thread_stealing branch from 01b43d2 to 89b45d3 Compare April 6, 2019 12:49

This was referenced Apr 6, 2019

Introduced cache_aligned_data and cache_line_data helper structure #3745

Closed

Fix SIGSEGV handler #3769

Merged

Remove wait or add new #3243

Closed

Executor overheads #3579

Closed

hkaiser merged commit 825c3ec into master Apr 8, 2019

hkaiser deleted the control_thread_stealing branch April 8, 2019 16:51

msimberg mentioned this pull request Jun 14, 2021

Extend thread state logging and change default stealing parameters #5387

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding flags to scheduler allowing to control thread stealing and idle back-off #3768

Adding flags to scheduler allowing to control thread stealing and idle back-off #3768

hkaiser commented Apr 3, 2019 •

edited

msimberg Apr 4, 2019

hkaiser Apr 4, 2019 •

edited

msimberg Apr 4, 2019

hkaiser Apr 4, 2019

hkaiser commented Apr 5, 2019

msimberg commented Apr 10, 2019

hkaiser commented Apr 10, 2019

Adding flags to scheduler allowing to control thread stealing and idle back-off #3768

Adding flags to scheduler allowing to control thread stealing and idle back-off #3768

Conversation

hkaiser commented Apr 3, 2019 • edited

msimberg Apr 4, 2019

Choose a reason for hiding this comment

hkaiser Apr 4, 2019 • edited

Choose a reason for hiding this comment

msimberg Apr 4, 2019

Choose a reason for hiding this comment

hkaiser Apr 4, 2019

Choose a reason for hiding this comment

hkaiser commented Apr 5, 2019

msimberg commented Apr 10, 2019

hkaiser commented Apr 10, 2019

hkaiser commented Apr 3, 2019 •

edited

hkaiser Apr 4, 2019 •

edited