[V4] Busy pool (dev version) by ConorWilliams · Pull Request #91 · ConorWilliams/libfork

ConorWilliams · 2026-04-06T15:30:47Z

A stripped back busy pool, just for dev (missing lock free submission queue)

Summary by CodeRabbit

Release Notes

New Features
- Added busy_thread_pool scheduler for multi-threaded task execution with configurable worker threads.
- Added new context accessor methods for internal state access.
- Added steal_overflow_error exception for detecting steal operation thresholds.
- Added new execute() overload supporting work stealing operations.
Improvements
- Enhanced error output to include thread ID for better debugging.
Breaking Changes
- Removed allocator_type typedef from geometric_stack.

coderabbitai · 2026-04-06T15:30:55Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 7b9717f2-a871-4023-a4ca-78f53c3f7e65

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch v4-busy-pool

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

This reverts commit 5198e06.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (4)

src/batteries/contexts.cxx (1)
27-28: Consider adding [[nodiscard]] for API consistency.

The new get_underlying() accessors return references that are typically used immediately. For consistency with other accessors in the codebase (e.g., adapt_deque::thief() and adapt_deque::empty() added in this PR), consider marking these [[nodiscard]].
♻️ Proposed fix
+  [[nodiscard]]
   constexpr auto get_underlying() noexcept -> Adaptor<context_type> & { return m_container; }
Apply similarly to both derived_poly_context and mono_context.
Also applies to: 61-62
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/batteries/contexts.cxx` around lines 27 - 28, Add the [[nodiscard]]
attribute to the get_underlying() accessors to enforce return-value use and
match existing API style; specifically update the Adaptor<context_type> &
get_underlying() noexcept methods in both derived_poly_context and mono_context
so they are declared as [[nodiscard]] constexpr auto get_underlying() noexcept
-> Adaptor<context_type>&, and apply the same change to the corresponding
const/non-const overloads if present.
test/src/schedule.cpp (1)
71-76: Consider moving STATIC_REQUIRE outside the loop.

The STATIC_REQUIRE(lf::scheduler<TestType>) is a compile-time check that doesn't vary with thr. Placing it before the loop would make the intent clearer and avoid redundant runtime assertions.
 TEMPLATE_TEST_CASE("Busy schedule", "[schedule]", mono_busy_thread_pool, poly_busy_thread_pool) {
+  STATIC_REQUIRE(lf::scheduler<TestType>);
   for (std::size_t thr = 1; thr < 4; ++thr) {
     TestType scheduler{thr};
-    STATIC_REQUIRE(lf::scheduler<TestType>);
     simple_tests(scheduler);
   }
 }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/src/schedule.cpp` around lines 71 - 76, Move the compile-time assertion
STATIC_REQUIRE(lf::scheduler<TestType>) out of the runtime loop in the
TEMPLATE_TEST_CASE: the check depends only on TestType, not thr, so place
STATIC_REQUIRE(lf::scheduler<TestType>) immediately after the TEMPLATE_TEST_CASE
declaration (before the for loop) and keep the for loop creating TestType
scheduler{thr} and calling simple_tests(scheduler) as-is.
src/schedulers/busy.cxx (2)
106-108: After successful steal, continue iterates the inner for-loop.

The continue statement continues the steal-attempt for loop rather than breaking out to the while loop. This means after a successful steal-and-execute, the worker will try more steals (up to k_steal_attempts total) before re-checking posted tasks.

If the intent is to prioritize posted tasks after each execution, consider using a labeled break or restructuring:
       if (auto [err, result] = m_contexts[victim].get_underlying().thief().steal()) {
         execute(static_cast<context_type &>(ctx), result);
-        continue;
+        break; // Re-check posted tasks after successful execution
       }
If the current behavior is intentional (batch steals before checking posted), a brief comment would clarify this design choice.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/schedulers/busy.cxx` around lines 106 - 108, After a successful
steal-and-execute the current code uses "continue" which keeps iterating the
inner for-loop (the steal attempts) instead of returning to the outer while loop
to re-check posted tasks; change the control flow so a successful steal breaks
out of the steal-attempt for-loop (replace the "continue" after
execute(static_cast<context_type &>(ctx), result); with a break) so posted tasks
are prioritized after each execution, or alternatively add a clear comment at
the steal site (m_contexts[victim].get_underlying().thief().steal()) documenting
that repeated steals per iteration are intentional if you want to keep current
batching behavior.
77-78: Unsigned underflow when n == 1 creates wasteful distribution.

When n == 1, the expression n - 2 underflows to SIZE_MAX, creating dist(0, SIZE_MAX). While the if (n > 1) guard on line 94 prevents usage, the distribution is still constructed with a huge range unnecessarily.

Consider guarding the construction or using a conditional initialization:
-    std::uniform_int_distribution<std::size_t> dist(0, n - 2);
+    std::uniform_int_distribution<std::size_t> dist(0, n > 1 ? n - 2 : 0);
Or move the distribution inside the if (n > 1) block if you want to avoid the object entirely for single-threaded pools.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/schedulers/busy.cxx` around lines 77 - 78, The uniform distribution dist
is constructed with upper bound n - 2 which underflows when n == 1; avoid
creating dist with a huge range by only constructing
std::uniform_int_distribution<std::size_t> dist(0, n - 2) when n > 1 (move its
construction inside the existing if (n > 1) block) or use a conditional
initialization that selects a no-op/degenerate distribution for n <= 1; adjust
code near std::default_random_engine rng(safe_cast<unsigned>(id + 1)) to ensure
rng remains available while dist is only created when needed.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/core/execute.cxx`:
- Around line 55-77: The non-atomic access to frame->steals in execute(Context&,
steal_handle<Context>) introduces a data race; change the steals field in
frame_type<checkpoint_t<Context>> to use the same pattern as joins/exception_bit
(apply ATOMIC_ALIGN and store a std::atomic or use atomic_ref wrapper) and
update all accesses: in execute() replace the raw increment/check with atomic
operations (load, compare to k_u16_max, fetch_add) and in
join_awaitable::await_ready()/await_resume() use atomic loads; ensure the
overflow check uses an atomic-safe compare (e.g., load then conditional
fetch_add or fetch_add with an overflow guard) so all reads/writes of steals are
synchronized.

---

Nitpick comments:
In `@src/batteries/contexts.cxx`:
- Around line 27-28: Add the [[nodiscard]] attribute to the get_underlying()
accessors to enforce return-value use and match existing API style; specifically
update the Adaptor<context_type> & get_underlying() noexcept methods in both
derived_poly_context and mono_context so they are declared as [[nodiscard]]
constexpr auto get_underlying() noexcept -> Adaptor<context_type>&, and apply
the same change to the corresponding const/non-const overloads if present.

In `@src/schedulers/busy.cxx`:
- Around line 106-108: After a successful steal-and-execute the current code
uses "continue" which keeps iterating the inner for-loop (the steal attempts)
instead of returning to the outer while loop to re-check posted tasks; change
the control flow so a successful steal breaks out of the steal-attempt for-loop
(replace the "continue" after execute(static_cast<context_type &>(ctx), result);
with a break) so posted tasks are prioritized after each execution, or
alternatively add a clear comment at the steal site
(m_contexts[victim].get_underlying().thief().steal()) documenting that repeated
steals per iteration are intentional if you want to keep current batching
behavior.
- Around line 77-78: The uniform distribution dist is constructed with upper
bound n - 2 which underflows when n == 1; avoid creating dist with a huge range
by only constructing std::uniform_int_distribution<std::size_t> dist(0, n - 2)
when n > 1 (move its construction inside the existing if (n > 1) block) or use a
conditional initialization that selects a no-op/degenerate distribution for n <=
1; adjust code near std::default_random_engine rng(safe_cast<unsigned>(id + 1))
to ensure rng remains available while dist is only created when needed.

In `@test/src/schedule.cpp`:
- Around line 71-76: Move the compile-time assertion
STATIC_REQUIRE(lf::scheduler<TestType>) out of the runtime loop in the
TEMPLATE_TEST_CASE: the check depends only on TestType, not thr, so place
STATIC_REQUIRE(lf::scheduler<TestType>) immediately after the TEMPLATE_TEST_CASE
declaration (before the for loop) and keep the for loop creating TestType
scheduler{thr} and calling simple_tests(scheduler) as-is.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 87fa02ae-cc3d-4662-a943-aadeea267804

📥 Commits

Reviewing files that changed from the base of the PR and between c02b337 and ea3cd93.

📒 Files selected for processing (12)

CMakeLists.txt
benchmark/src/libfork_benchmark/fib/libfork.cpp
src/batteries/adaptor_stack.cxx
src/batteries/adaptors.cxx
src/batteries/contexts.cxx
src/batteries/geometric_stack.cxx
src/core/execute.cxx
src/core/promise.cxx
src/exception.cpp
src/schedulers/busy.cxx
src/schedulers/schedulers.cxx
test/src/schedule.cpp

src/core/execute.cxx

ConorWilliams changed the base branch from main to modules April 6, 2026 15:30

ConorWilliams force-pushed the v4-busy-pool branch 2 times, most recently from e8ad614 to 61d87f8 Compare April 6, 2026 15:43

Conor added 8 commits April 6, 2026 18:46

init

866dd1c

sketch

b2ac8a4

get underlying

50d7d3f

initial pass

cb6af81

todo

e894b02

in-flight

ab1b255

clear for except

737f200

drop resume functions

5db8c7f

ConorWilliams force-pushed the v4-busy-pool branch from 61d87f8 to 5db8c7f Compare April 6, 2026 17:46

Conor added 17 commits April 6, 2026 19:01

execute hook

92739d3

fix gen at zero

90b781f

fix warning

190377a

rm dead typedef

307ed85

rm dead todo

e338d67

lints

a677b69

rename

e2985b3

static checks

ebf489c

execute fn for steal_handle

b8a3d8a

failing bench

35b29c3

better assert ordering

dc95b7f

print debug

5198e06

tid in terminate

e047185

ensure destroy before stack release

d62be80

Revert "print debug"

6acf71b

This reverts commit 5198e06.

dual construction

10e741a

todo

bdd0d18

Conor added 2 commits April 6, 2026 20:57

format

9796827

todo

ea3cd93

coderabbitai bot reviewed Apr 6, 2026

View reviewed changes

src/core/execute.cxx Show resolved Hide resolved

Conor added 5 commits April 7, 2026 18:54

mt bench

51906df

tweak macro

cbe871e

require outside loop

cd749c8

get underlying

bd3a63e

more test iters

ab507c2

ConorWilliams merged commit 40d22b2 into modules Apr 9, 2026
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V4] Busy pool (dev version)#91

[V4] Busy pool (dev version)#91
ConorWilliams merged 32 commits intomodulesfrom
v4-busy-pool

ConorWilliams commented Apr 6, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Apr 6, 2026 •

edited

Loading

Review skipped

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ConorWilliams commented Apr 6, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ConorWilliams commented Apr 6, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 6, 2026 •

edited

Loading