fix(1365): CPU bound instances don't spread on all CPU cores #1376

JT117 · 2024-01-23T08:58:48Z

Feature or Problem

Fix the cpu spread on all cores.
For me in my mind:
The change for the actor is to put in a spawn the wasmtime module clone, so the clone can happen on any core
The change on the rpc bus is to handle each nats event in a separate spawn in order to spread on all core

Related Issues

#1365

Release Information

Instances will use 100% of cpu

Testing

none

Unit Test(s)

none

Acceptance or Integration

none

Manual Verification

1/ set the wasmcloud RPC timeout high enough to actually wait for the response :
export WASMCLOUD_RPC_TIMEOUT_MS=10000
2/ start wasmcloud 0.81.0
wash up
3/ deploy the wadm file
wash app deploy hello-world/wadm.yaml
4/ bench with any kind of app
observe the cpu core loads

…ats event. Also encapsulate the .clone on the wasmtime module. After this two modification the workload spread on all core of the CPU. Relates to issue wasmCloud#1365 Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

thomastaylor312

Just a few comments I noticed while looking at this. If we do choose to go down this route, I'd really prefer to have comments explaining why this works. I suspect it is because spawn forces the future to possibly run on a different worker queue, but as far as I could tell, await should do this too. We're using the multithread tokio runtime, and according to the docs, if a task is on the queue (which is what I assume happens when it yields with await) it should have the possibility of being stolen by another local queue.

~~Can you remind me, what operating system does this problem happen on and what are its specs?~~ Edit: found in issue

crates/host/src/wasmbus/mod.rs

crates/runtime/src/actor/module/mod.rs

thomastaylor312 · 2024-01-25T22:10:09Z

I still don't quite understand why this is happening because for_each_concurrent uses FuturesUnordered which as far as I can tell should yield properly and cause tokio to spread the work between work queues. However, I think that we might as well just get the fix in so I'd be ok with merging this once marked ready for review and comments are addressed!

crates/runtime/src/actor/module/mod.rs

thomastaylor312 · 2024-02-07T17:52:37Z

@JT117 Even though we aren't quite sure why this works, it does work. Would you mind addressing the comments, rebasing, and marking as ready for review? Then we can get this in to the next patch release

Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

…ly the instance instead of wrapping the instance's components in a future Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

thomastaylor312

Thank you so much for adding the comments as well!

JT117 added 2 commits January 23, 2024 09:48

fix(1365): fmt

a355969

Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

JT117 marked this pull request as ready for review January 24, 2024 10:04

JT117 requested a review from a team as a code owner January 24, 2024 10:04

JT117 marked this pull request as draft January 24, 2024 10:05

thomastaylor312 reviewed Jan 24, 2024

View reviewed changes

crates/host/src/wasmbus/mod.rs Show resolved Hide resolved

crates/host/src/wasmbus/mod.rs Outdated Show resolved Hide resolved

crates/runtime/src/actor/module/mod.rs Outdated Show resolved Hide resolved

thomastaylor312 reviewed Jan 25, 2024

View reviewed changes

crates/runtime/src/actor/module/mod.rs Outdated Show resolved Hide resolved

JT117 added 4 commits February 12, 2024 14:59

fix(1365): Add comments, remove useless future::ready

1fe2677

Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

Merge branch 'wasmCloud:main' into main

b4396bc

Merge remote-tracking branch 'origin/main'

65d8329

fix(1365): fix clippy warning, added ; for consistency, return direct…

5e89f50

…ly the instance instead of wrapping the instance's components in a future Signed-off-by: Julien Teruel <julien.teruel@gmail.com>

JT117 marked this pull request as ready for review February 14, 2024 08:20

thomastaylor312 approved these changes Feb 15, 2024

View reviewed changes

thomastaylor312 enabled auto-merge (rebase) February 15, 2024 00:49

thomastaylor312 merged commit c6fa704 into wasmCloud:main Feb 15, 2024
25 checks passed

thomastaylor312 mentioned this pull request Feb 28, 2024

feat(runtime,host): Updates to request throughput #1598

Merged

thomastaylor312 mentioned this pull request Mar 25, 2024

[BUG] CPU bound instances don't spread on all CPU cores #1365

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(1365): CPU bound instances don't spread on all CPU cores #1376

fix(1365): CPU bound instances don't spread on all CPU cores #1376

JT117 commented Jan 23, 2024

thomastaylor312 left a comment •

edited

Loading

thomastaylor312 commented Jan 25, 2024

thomastaylor312 commented Feb 7, 2024

thomastaylor312 left a comment

fix(1365): CPU bound instances don't spread on all CPU cores #1376

fix(1365): CPU bound instances don't spread on all CPU cores #1376

Conversation

JT117 commented Jan 23, 2024

Feature or Problem

Related Issues

Release Information

Consumer Impact

Testing

Unit Test(s)

Acceptance or Integration

Manual Verification

thomastaylor312 left a comment • edited Loading

Choose a reason for hiding this comment

thomastaylor312 commented Jan 25, 2024

thomastaylor312 commented Feb 7, 2024

thomastaylor312 left a comment

Choose a reason for hiding this comment

thomastaylor312 left a comment •

edited

Loading