feat: EXC-1754 Evict sandboxes based on their priorities #1967

dragoljub-djuric · 2024-10-10T14:35:32Z

When the canister is executed, it is most likely its Sandbox will not be evicted from the cache, since we are keeping Sandboxes based using LRU logic. At the same time, the Scheduler decreases its priority so it is less likely that it will be executed. So our caching of Sandbox processes is almost the worst possible.

Solution:
Propagate Scheduler priorities for the place we evict Sandbox processes and do evictions based on the lowest priority. That change should decrease the number of cache misses.

Link to follow-up with minimized number of reads to scheduler priorities.

Note:
Furthermore: the scheduler priorities used are from the round before the current round, because the snapshots are saved only at the end of the round, and apply_scheduling_strategy() is run before executing canisters in the round. But that should not influence results by a lot.

It remains to explore if there is an easy way to move apply_scheduling_strategy() after all canisters are executed in the round. In that case, priorities taken from the last snapshot will be exactly the priorities for the current round.

…esses-works

adambratschikaye

Thanks! Just left some minor comments.

adambratschikaye · 2024-10-15T08:18:46Z

rs/canister_sandbox/src/replica_controller/sandbox_process_eviction.rs

+    let mut evicted = candidates;

-    for candidate in candidates.into_iter() {
+    for candidate in remaining_candidates.into_iter() {


I guess now we could be iterating through all the canisters in more cases (if we don't hit evict_at_most). I guess that's fine since the list is short and this only runs every 10 seconds or so - agree?

I think we might need to change the 10s logic to something smaller, but I'm not super sure yet, let's keep this under discussion.

adambratschikaye · 2024-10-15T08:21:28Z

rs/canister_sandbox/src/replica_controller/sandbox_process_eviction.rs

    use std::time::{Duration, Instant};

    use ic_test_utilities_types::ids::canister_test_id;
+    use ic_types::AccumulatedPriority;



Could we add some tests that actually depend on the new logic?

adambratschikaye · 2024-10-15T08:33:06Z

rs/canister_sandbox/src/replica_controller/sandboxed_execution_controller.rs

    min_active_sandboxes: usize,
    max_active_sandboxes: usize,
    max_sandbox_idle_time: Duration,
+    state_reader: Arc<dyn StateReader<State = ReplicatedState>>,


I think this argument could just be a reference instead of an owned Arc. Then you won't need to Arc::clone each time you call it.

adambratschikaye · 2024-10-15T08:35:18Z

rs/interfaces/state_manager/mocks/src/lib.rs


        fn get_latest_state(&self) -> Labeled<Arc<ReplicatedState>>;

+


accidental newline?

This change applies charges for each fully executed canister. The total amount of points charged is evenly distributed across canisters, but it is not included in the compute capacity used to calculate long/new execution cores.

The idle canisters in front of the round schedule should be marked as fully executed, as they were scheduled first in the round. This helps to rotate the round schedule faster.

…)" This reverts commit e507149.

This reverts commit 6f26834.

This reverts commit a753775.

This change applies charges for each fully executed canister. The total amount of points charged is evenly distributed across canisters, but it is not included in the compute capacity used to calculate long/new execution cores.

…esses-works

No functional changes, just moving functions in one place.

This change is functionally equivalent but necessary for the upcoming priority-based eviction.

…4-change-the-way-evict-sandbox-processes-works

This reverts commit 212ba5c.

This reverts commit 9142bae.

This reverts commit 3dbea6b.

…esses-works

…54-change-the-way-evict-sandbox-processes-works

.

a80be23

github-actions bot added the feat label Oct 10, 2024

dragoljub-djuric added 9 commits October 14, 2024 10:27

.

d7d5a7c

.

f9ecc70

.

258b147

.

ef7d757

.

6b43608

.

8bf2550

.

ae627ed

Fix Cargo files.

51b0005

.

f9d590b

dragoljub-djuric mentioned this pull request Oct 14, 2024

feat: EXC-1754 Optimized number of reads of snapshot in evic_sandbox_processes #2028

Closed

Merge branch 'master' into EXC-1754-change-the-way-evict-sandbox-proc…

ab903e7

…esses-works

adambratschikaye approved these changes Oct 15, 2024

View reviewed changes

dragoljub-djuric changed the base branch from master to dimitris/scheduler-changes-sandbox-count October 15, 2024 13:46

dragoljub-djuric changed the base branch from dimitris/scheduler-changes-sandbox-count to master October 15, 2024 13:54

dragoljub-djuric changed the base branch from master to dimitris/scheduler-changes-sandbox-count October 15, 2024 13:54

dragoljub-djuric changed the base branch from dimitris/scheduler-changes-sandbox-count to master October 15, 2024 13:55

dragoljub-djuric changed the base branch from master to dimitris/scheduler-changes-sandbox-count October 15, 2024 13:56

dragoljub-djuric changed the base branch from dimitris/scheduler-changes-sandbox-count to master October 15, 2024 13:56

berestovskyy and others added 10 commits October 15, 2024 14:01

feat: Increase max sandbox count

6f26834

feat: EXC-1751: Charge idle canisters for full execution (#1806)

e507149

The idle canisters in front of the round schedule should be marked as fully executed, as they were scheduled first in the round. This helps to rotate the round schedule faster.

Fix the eviction policy.

673c0f0

.

bf5ed3b

Revert "feat: EXC-1751: Charge idle canisters for full execution (#1806…

f67ab10

…)" This reverts commit e507149.

Revert "feat: Increase max sandbox count"

494d50c

This reverts commit 6f26834.

Revert "feat: EXC-1735: Charge canisters for full execution (#1782)"

bbd3d68

This reverts commit a753775.

feat: Increase max sandbox count

3a2ff2b

dragoljub-djuric added 4 commits October 28, 2024 14:50

.

52c99e9

Merge branch 'master' into EXC-1754-change-the-way-evict-sandbox-proc…

03befc3

…esses-works

.

359a1b8

.

935e5ad

dragoljub-djuric changed the base branch from master to Add_new_dashboards_to_testnet October 30, 2024 14:16

dragoljub-djuric added 3 commits October 31, 2024 14:55

.

81e4ea5

.

b66cc12

.

576c753

Base automatically changed from Add_new_dashboards_to_testnet to master October 31, 2024 15:58

Merge branch 'master' into EXC-1754-change-the-way-evict-sandbox-proc…

b072308

…esses-works

berestovskyy changed the title ~~feat: EXC-1754 Change the way evict_sandbox_processes works~~ feat: EXC-1754 Evict sandboxes based on their priorities Oct 31, 2024

berestovskyy added 2 commits November 1, 2024 10:02

chore: EXC-1749: Consolidate scheduling logic

3dbea6b

No functional changes, just moving functions in one place.

chore: EXC-1754: Apply priority credit at the end of the round

9142bae

This change is functionally equivalent but necessary for the upcoming priority-based eviction.

dragoljub-djuric changed the base branch from master to andriy/exc-1754-apply-priority-at-the-end November 1, 2024 10:03

Merge branch 'andriy/exc-1754-apply-priority-at-the-end' into EXC-175…

6e0db40

…4-change-the-way-evict-sandbox-processes-works

berestovskyy force-pushed the andriy/exc-1754-apply-priority-at-the-end branch from 9142bae to ab95547 Compare November 1, 2024 18:47

Base automatically changed from andriy/exc-1754-apply-priority-at-the-end to master November 6, 2024 18:25

fix: EXC-1787: Fix scheduler AP divergence

4457883

dragoljub-djuric changed the base branch from master to andriy/exc-1787-scheduler-divergence-debug November 12, 2024 09:16

berestovskyy added 3 commits November 12, 2024 16:46

Reproduce scheduler unfairness for short executions

ee46cc9

Try out Alin's suggestion

212ba5c

Revert "Try out Alin's suggestion"

48303be

This reverts commit 212ba5c.

dragoljub-djuric changed the base branch from andriy/exc-1787-scheduler-divergence-debug to master November 12, 2024 21:01

dragoljub-djuric added 3 commits November 12, 2024 21:07

Revert "chore: EXC-1754: Apply priority credit at the end of the round"

8a6acc9

This reverts commit 9142bae.

Revert "chore: EXC-1749: Consolidate scheduling logic"

cb801e7

This reverts commit 3dbea6b.

Merge branch 'master' into EXC-1754-change-the-way-evict-sandbox-proc…

92aca25

…esses-works

dragoljub-djuric changed the base branch from master to andriy/exc-1787-scheduler-divergence-debug November 12, 2024 21:13

Merge branch 'andriy/exc-1787-scheduler-divergence-debug' into EXC-17…

8eb2330

…54-change-the-way-evict-sandbox-processes-works

dragoljub-djuric changed the base branch from andriy/exc-1787-scheduler-divergence-debug to master November 12, 2024 21:16

dragoljub-djuric closed this Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: EXC-1754 Evict sandboxes based on their priorities #1967

feat: EXC-1754 Evict sandboxes based on their priorities #1967

Uh oh!

dragoljub-djuric commented Oct 10, 2024 •

edited

Loading

Uh oh!

adambratschikaye left a comment

Uh oh!

adambratschikaye Oct 15, 2024

Uh oh!

alexandru-uta Oct 15, 2024

Uh oh!

adambratschikaye Oct 15, 2024

Uh oh!

adambratschikaye Oct 15, 2024

Uh oh!

adambratschikaye Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		fn get_latest_state(&self) -> Labeled<Arc<ReplicatedState>>;

feat: EXC-1754 Evict sandboxes based on their priorities #1967

feat: EXC-1754 Evict sandboxes based on their priorities #1967

Uh oh!

Conversation

dragoljub-djuric commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adambratschikaye left a comment

Choose a reason for hiding this comment

Uh oh!

adambratschikaye Oct 15, 2024

Choose a reason for hiding this comment

Uh oh!

alexandru-uta Oct 15, 2024

Choose a reason for hiding this comment

Uh oh!

adambratschikaye Oct 15, 2024

Choose a reason for hiding this comment

Uh oh!

adambratschikaye Oct 15, 2024

Choose a reason for hiding this comment

Uh oh!

adambratschikaye Oct 15, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dragoljub-djuric commented Oct 10, 2024 •

edited

Loading