Change scheduler to only store runnable tasks on run queue #1042

tsoutsman · 2023-09-16T09:38:55Z

Note that this inherently adds task migration as when a task is unblocked it can be added to any core.

Depends on #1035, #1044, #1045.

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

@NathanRoyer

There are no guarantees about the value of the interrupt flag when context switching. If the context switch is voluntary, i.e. a thread called `schedule`, interrupts will most likely be enabled, whereas if a thread is preempted, interrupts will be disabled. But this means that if a preempted thread A switches to a thread B that voluntarily yielded, thread B will return from the call to `schedule` with interrupts disabled. The AArch64 code also needs to be modified but I'll leave that to @NathanRoyer. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

…to idle-task-in-cpu-local

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

The new test is significantly more robust than the old one. As of right now, the test isn't particularly useful because we don't have task migration, but theseus-os#1042 adds implicit task migration when unblocking a task. Hence, the test has a focus on blocking and unblocking tasks. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

…n-queue Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

…n-queue

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

When spawning a pinned task, `spawn` didn't previously set `inner.pinned_cpu`. This created problems in theseus-os#1042 because the scheduler didn't know that tasks were pinned and freely migrated them across cores. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

…ueue Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

* When spawning a pinned task, `spawn` didn't previously set `inner.pinned_cpu` for the newly-created `Task`. * This is not currently a problem because the scheduler doesn't perform task migration across CPUs, but when that gets enabled (in #1042), it would cause the pinning choice to be ignore by the scheduler. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

* When spawning a pinned task, `spawn` didn't previously set `inner.pinned_cpu` for the newly-created `Task`. * This is not currently a problem because the scheduler doesn't perform task migration across CPUs, but when that gets enabled (in #1042), it would cause the pinning choice to be ignore by the scheduler. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com> cae8ca8

* The new `test_scheduler` is significantly more robust than the old one. Currently, the test isn't particularly useful because we don't have task migration enabled, but #1042 will add implicit task migration when unblocking a task. * Hence, the test currently focuses on blocking/unblocking tasks. * Add a function to iterate over all initialized CPUs. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

kevinaboos · 2023-10-05T04:47:46Z

all PR dependencies have been merged in. 👍

* The new `test_scheduler` is significantly more robust than the old one. Currently, the test isn't particularly useful because we don't have task migration enabled, but #1042 will add implicit task migration when unblocking a task. * Hence, the test currently focuses on blocking/unblocking tasks. * Add a function to iterate over all initialized CPUs. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com> e9416d6

* The new `test_scheduler` is significantly more robust than the old one. Currently, the test isn't particularly useful because we don't have task migration enabled, but theseus-os#1042 will add implicit task migration when unblocking a task. * Hence, the test currently focuses on blocking/unblocking tasks. * Add a function to iterate over all initialized CPUs. Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com> e9416d6

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

kevinaboos

left a few questions. I'm a bit concerned the overhead associated with some changes in this PR is actually counterproductive for overall performance, but most of it is solvable.

kevinaboos · 2023-10-08T04:24:15Z

kernel/spawn/src/lib.rs

+        let ExposedTask { task: mut new_task } = exposed;
+    


first line seems good, second line seems wrong (extra whitespace for no reason?)

kevinaboos · 2023-10-08T04:25:15Z

kernel/spawn/src/lib.rs

-            } else {
-                task::scheduler::add_task(task_ref.clone());
-            }
+        if !self.idle & !self.blocked {


i think you meant &&

kevinaboos · 2023-10-08T04:40:26Z

kernel/task/src/lib.rs

+    pub fn block(&self) -> Result<RunState, RunState> {
+        use RunState::{Blocked, Runnable};
+
+        let run_state = self.0.task.runstate();


Unless I'm missing something, I believe this (and the same logic in unblock()) simply obtains a copy of the task's current RunState and then modifies that copy; it doesn't actually modify the RunState of the task itself.

I think a good way to avoid this is to split up the functions:

in task_struct::Task::block(), only the runstate is modified. Basically, this function as it was.

the functions in task_struct::Task may need to be renamed.

in task::TaskRef::block(), it calls task_struct::Task::block() and then modifies the runqueue accordingly.

unblock() is similar.

kevinaboos · 2023-10-08T04:50:21Z

kernel/task/src/lib.rs

+            Ok(Runnable)
+        } else if run_state.compare_exchange(Blocked, Blocked).is_ok() {
+            // warn!("Blocked an already blocked task: {:?}", self);
+            Ok(Blocked)


It looks like blocked tasks are not proactively removed from the runqueue they're on, but rather done lazily by each scheduler policy.

This is fine, as it's much faster, but it should be clearly documented somewhere (i.e., in the docs for the Scheduler trait's next() function) because it's a non-symmetric pair of operations. Otherwise, a scheduler policy implementor wouldn't know that it's okay to simply remove a task from the runqueue when it comes across one that is blocked.

kevinaboos · 2023-10-08T04:56:12Z

kernel/task/src/scheduler.rs

+        let locked = SCHEDULERS.lock();
+
+        let mut min_busyness = usize::MAX;
+        let mut least_busy_index = None;
+
+        for (i, (_, scheduler)) in locked.iter().enumerate() {
+            let busyness = scheduler.lock().busyness();
+            if busyness < min_busyness {
+                least_busy_index = Some(i);
+                min_busyness = busyness;
+            }
        }


this function is incredibly slow. In general, I don't think we should ever really access SCHEDULERS except when initializing or changing scheduler policies, though we can think about that more later (and how to improve this, e.g., via a separate estimated utilization metric or something that avoids the need to iterate over all schedulers).

It's likely that using this function every time a task is unblocked is going to absolutely destroy performance... right? Also I imagine it will cause lots and lots of task migrations, which is generally something you want to keep to a minimum unless a given CPU core is at max utilization for a "long" time (which we currently don't track a metric for).

kevinaboos · 2023-10-08T05:05:32Z

kernel/scheduler_epoch/src/lib.rs

+        while let Some(task) = self.out_of_tokens.pop() {
+            self.have_tokens.push_back(task);


nit: Is there a way to do this in one batch rather than item-by-item? I guess not when we're using VecDeque for one list and Vec for the other...?

kevinaboos · 2023-10-08T05:15:18Z

kernel/scheduler_epoch/src/lib.rs

+                // This check prevents an interleaving where `TaskRef::unblock` wouldn't add
+                // the task back onto the run queue. `TaskRef::unblock` sets the run state and
+                // then checks `is_on_run_queue` so we have to do the inverse.
+                if unlikely(task.task.is_runnable()) {
+                    if let Some(task) = self.add_epoch_task(task) {
+                        return Some(task);
+                    }
+                }


So, IIUC, the crux of the issue surrounding this is_on_run_queue() atomic boolean is that we need to do two things in one "atomic" step:

change the task's runstate to Runnable

add the task back to a runqueue

Since we cannot do that, you're using a separate atomic bool to indicate whether a task is on a runqueue. I understand this, but I don't love this design. It's complex without bringing any other benefits, and kind of a messy design choice, as it makes both current and future scheduler policies harder to write.

Ideally this shouldn't be a necessary step. Can we avoid this complexity via an alternative design that's a bit cleaner and doesn't burden the Scheduler policy implementor with additional concerns?

Side note: if we actually wanted to store some piece of state inside a task about whether that task is on a runqueue, we might as well store a reference to (or the ID of) the actual runqueue that it's currently on. This would at least make it faster to add the task back once it gets unblocked, which addresses my other comment as a plus.

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman added 26 commits September 2, 2023 12:54

Temp

c923d37

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Works

83fe1de

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Rename scheduler_2 module to scheduler

2ddb077

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Add draining functionality

972577a

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Port epoch scheduler

ea4e592

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Port priority scheduler

be1688b

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Fix build

41c7692

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Merge branch 'save-rflags' of https://github.com/tsoutsman/Theseus in…

b5743c8

…to idle-task-in-cpu-local

Merge branch 'theseus_main' into idle-task-in-cpu-local

e6c8978

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Fix all features build

4968af4

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Remove rq_access_eval

5a5ce94

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Comments

0936352

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Fix scheduler_epoch bug

8917875

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Implement priority inheritance

618b5ce

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Port rq

cbe91ac

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Cleanup scheduler.rs

7ffdfae

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Final

414b31f

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Clippy 😍

cd2b49b

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Remove inner box

c420a74

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Merge branch 'theseus_main' into idle-task-in-cpu-local

53882f8

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Merge branch 'theseus_main' into idle-task-in-cpu-local

6d193f0

Implement

c1673f7

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Fixups

7ef4d7e

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Fix pinned tasks

6d23826

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman mentioned this pull request Sep 16, 2023

Update test_scheduler #1043

Closed

tsoutsman added 3 commits September 16, 2023 23:17

Remove whitespace

86939c8

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Merge branch 'new-scheduler-test' into remove-unrunnable-task-from-ru…

fdd2add

…n-queue Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

Clippy

730278d

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman added 2 commits September 16, 2023 23:30

Merge branch 'new-scheduler-test' into remove-unrunnable-task-from-ru…

b73000d

…n-queue

Clippy

e265609

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman marked this pull request as draft September 16, 2023 13:34

Fix bugs

a9d9fac

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman marked this pull request as ready for review September 16, 2023 22:20

tsoutsman added 2 commits September 17, 2023 08:23

Update documentation

dfd9fe2

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman mentioned this pull request Sep 16, 2023

fix: set inner.pinned_cpu for pinned tasks #1044

Merged

Merge branch 'fix-pinned-task' into remove-unrunnable-task-from-run-q…

cfac9cd

…ueue Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman mentioned this pull request Sep 17, 2023

Update test_scheduler #1045

Merged

Merge branch 'theseus_main' into remove-unrunnable-task-from-run-queue

232949d

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

kevinaboos self-requested a review October 8, 2023 05:41

kevinaboos requested changes Oct 8, 2023

View reviewed changes

Temp

368e7d4

Signed-off-by: Klimenty Tsoutsman <klim@tsoutsman.com>

tsoutsman marked this pull request as draft October 10, 2023 10:19

tsoutsman closed this Oct 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change scheduler to only store runnable tasks on run queue #1042

Change scheduler to only store runnable tasks on run queue #1042

tsoutsman commented Sep 16, 2023 •

edited

Loading

kevinaboos commented Oct 5, 2023

kevinaboos left a comment

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023

kevinaboos Oct 8, 2023 •

edited

Loading

		while let Some(task) = self.out_of_tokens.pop() {
		self.have_tokens.push_back(task);

Change scheduler to only store runnable tasks on run queue #1042

Change scheduler to only store runnable tasks on run queue #1042

Conversation

tsoutsman commented Sep 16, 2023 • edited Loading

kevinaboos commented Oct 5, 2023

kevinaboos left a comment

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023

Choose a reason for hiding this comment

kevinaboos Oct 8, 2023 • edited Loading

Choose a reason for hiding this comment

tsoutsman commented Sep 16, 2023 •

edited

Loading

kevinaboos Oct 8, 2023 •

edited

Loading