[ktest] Implement real parallel ktest and measure ktest time #834

junyang-zh · 2024-05-11T09:42:39Z

This is all you want.

Also, ktest time is measured in milliseconds to be a performance hint.

tatetian

I like the clever trick of introducing KtestDependencies so that ktest can work regardless of the runtime environment.

My big question is whether this PR is solving a problem that the users really care. I think the users want to enable benchmarks via cargo osdk bench.

#[cfg(ktest)]
mod tests {
    use super::*;
    use ktest::Bencher;

    #[kbench]
    fn bench_add_two(b: &mut Bencher) {
        b.iter(|| add_two(2));
    }
}

Not measuring the elapsed time of individual tests.

    let start_millis = (deps.monotonic_millis)();
    let test_result = test.run(&deps.catch_unwind);
    let duration_millis = (deps.monotonic_millis)() - start_millis;

A test usually repeats an operation repeatedly. Showing the total time of a test is not quite helpful.

framework/libs/ktest/src/runner.rs

junyang-zh · 2024-05-13T15:04:45Z

My big question is whether this PR is solving a problem that the users really care. I think the users want to enable benchmarks via cargo osdk bench.

No this PR doesn't cover till this point.

Measuring test time is also useful to prevent performance regression, as there are tests that contains iterations.

And we need more efforts to implement the kbench feature, which requires a dedicated library for black boxing, deciding number of iterations to run, and so. I did not investigate much on this topic.

tatetian · 2024-05-14T07:12:01Z

Measuring test time is also useful to prevent performance regression, as there are tests that contains iterations.

Tests are usually built with debug mode. The resulting execution time cannot represent the true performance. Performance regression should be detected with dedicated benchmarks.

The overarching principle in designing OSDK is to make cargo osdk <action> behave the same way as cargo <action>. This gives the users least surprise. cargo test does not show the execution time of individual tests, neither should cargo osdk test.

In the same spirit, I think cargo osdk test should measure and show the total time spent by running all tests. This is what cargo test does.

test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s

junyang-zh · 2024-05-14T07:20:20Z

Measuring test time is also useful to prevent performance regression, as there are tests that contains iterations.

Tests are usually built with debug mode. The resulting execution time cannot represent the true performance. Performance regression should be detected with dedicated benchmarks.

Ok then. I may remove it in this PR.

junyang-zh · 2024-05-14T07:42:57Z

In the same spirit, I think cargo osdk test should measure and show the total time spent by running all tests. This is what cargo test does.
test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s

The total time is reported now, although the measurement is really primitive and is inaccurate if tasks can be preempted. They can't be preempted right now so it is OK currently.

tatetian · 2024-05-14T10:45:38Z

The measurement is really primitive and is inaccurate if tasks can be preempted.

Preemption is not really a concern. The reported time is intended to give the user a sense of how long it takes to run these tests. The running time is affected by many factors, one of which is preemption. This factor has been faithfully reflected by the reported time. So we are good.

tatetian · 2024-05-14T11:00:13Z

framework/libs/ktest/src/runner.rs

+
+    // Wait for all spawned tests.
+    while FINISHED.load(Ordering::Relaxed) < crate_.nr_tot_tests() {
+        (deps.yield_fn)()


This is inefficient.

How about replacing yield_fn with join_fn?

/// Spawn a new task, returning the task ID. pub spawn_fn: fn(fn() -> ()) -> u32, /// Join the task of a given ID. pub join_fn: fn(u32),

Not planned. We haven't got a join() method in the frame yet. This is indeed inefficient for FIFO scheduling...

framework/libs/ktest/src/runner.rs

tatetian · 2024-05-14T11:23:05Z

framework/libs/ktest/src/runner.rs

-                }
-            }
+
+            spawn_ktest(deps, test);


Spawning too many tasks is not an ideal configuration. A more sensible configuration is to have as many tasks as the number of CPUs. cargo test even allow the user to specify the concurrency at run time (e.g., cargo test -- --test-threads=2). I think the ktest framework should take such an argument.

Currently (spawning == creating and running immediately) and no preemption is allowed. So it would be main -> test1 -> main -> test2 -> main -> test3 ... currently in single CPU.

It is not a big deal of problem currently I guess. But supporting test workers is indeed better.

junyang-zh · 2024-05-16T01:11:26Z

OSDK failure seems to be a known issue. Not related to this PR.

tatetian · 2024-06-08T08:27:26Z

I think it is time to revisit the design decision of making ktest relatively independent from the aster-frame.

Code like this

/// A set of functions needed to perform ktests.
#[derive(Clone)]
pub struct KtestDependencies {
    /// The corresponding utility of `std::panic::catch_unwind`.
    pub catch_unwind_fn: CatchUnwindFn,
    /// The print function to print the test prompts for the user.
    pub print_fn: fn(core::fmt::Arguments),
    /// The function returning monotonic milliseconds to measure the time.
    pub monotonic_millis_fn: fn() -> u64,
    /// The function to spawn a test.
    pub spawn_fn: fn(fn() -> ()) -> (),
    /// Yield the current task.
    /// The main task may be busy looping to wait all tasks. This is helpful
    /// for performance if there are little cores in the system.
    pub yield_fn: fn(),
}

and this

/// Define a spinlocked static mutable variable.
/// Example:
/// ```ignore
/// spinlock! {
///     pub static GLOBAL_COUNT: usize = 0;
/// }
/// ```
/// To access the variable, use [`lock_and`].

have clearly shown how much efforts we have to make to keep ktest independent from aster-frame.

I don't expect the ktest crate to have any other users than aster-frame itself and crates that are based on aster-frame. Giving up the independence of ktest can simplify its implementation significantly.

tatetian

This PR aims to introducing two aspects of improvements.

Timekeeping;
Parallel execution.

It has done a good job of achieving the first goal, but not the second one. This can been seen from code like this

            // FIXME: This spawns every ktest as a task, which may be inefficient.
            // We need to spawn runners that take tasks and run them.
            spawn_ktest(deps, test, spawned);
            spawned += 1;

and this

    // Wait for all spawned tests.
    while FINISHED.load(Ordering::Relaxed) < spawned {
        (deps.yield_fn)()

If we want to run tests in parallel, we must do it the right way. Since Asterinas does not have multiprocessor support for now, doing it the "wrong" way does not bring any true benefit. In fact, doing the "wrong" way is even harmful: having a runnable thread that keeps yielding will make it impossible to do benchmark using the ktest infrastructure as our task scheduler is not smart enough to guarantee the benchmark task to have most of the CPU time.

So I suggest (1) accomplishing the two goals in separate PRs, and (2) running parallel tests in the right way.

junyang-zh · 2024-06-26T14:04:41Z

Um, we may rethink ktest and this PR should be a reference implementing #975 , closing.

junyang-zh force-pushed the parallel_ktest branch 6 times, most recently from 42b0f7f to 2bbab26 Compare May 11, 2024 14:09

tatetian reviewed May 13, 2024

View reviewed changes

framework/libs/ktest/src/runner.rs Outdated Show resolved Hide resolved

junyang-zh force-pushed the parallel_ktest branch from 2bbab26 to d69b8eb Compare May 13, 2024 14:54

junyang-zh force-pushed the parallel_ktest branch from d69b8eb to ad0ee0f Compare May 14, 2024 03:10

junyang-zh force-pushed the parallel_ktest branch 2 times, most recently from 9f5800b to 136f28b Compare May 14, 2024 07:40

tatetian reviewed May 14, 2024

View reviewed changes

junyang-zh force-pushed the parallel_ktest branch from 136f28b to 2e34ffb Compare May 15, 2024 13:04

junyang-zh force-pushed the parallel_ktest branch 2 times, most recently from 34e69ef to 05083a8 Compare May 23, 2024 03:24

junyang-zh mentioned this pull request Jun 4, 2024

Add ktest_init for components initialization before ktests #905

Open

junyang-zh force-pushed the parallel_ktest branch from 05083a8 to 9807c98 Compare June 4, 2024 07:14

Make ktests actually parallel

58fef18

junyang-zh force-pushed the parallel_ktest branch from 9807c98 to 58fef18 Compare June 4, 2024 07:14

tatetian reviewed Jun 8, 2024

View reviewed changes

junyang-zh mentioned this pull request Jun 26, 2024

ktest as a kernel #975

Open

junyang-zh added the S-stale Questions that no one has followed up on for a long time label Jun 26, 2024

junyang-zh closed this Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ktest] Implement real parallel ktest and measure ktest time #834

[ktest] Implement real parallel ktest and measure ktest time #834

junyang-zh commented May 11, 2024

tatetian left a comment

junyang-zh commented May 13, 2024

tatetian commented May 14, 2024 •

edited

Loading

junyang-zh commented May 14, 2024

junyang-zh commented May 14, 2024

tatetian commented May 14, 2024

tatetian May 14, 2024 •

edited

Loading

junyang-zh May 15, 2024

tatetian May 14, 2024

junyang-zh May 15, 2024

junyang-zh commented May 16, 2024

tatetian commented Jun 8, 2024 •

edited

Loading

tatetian left a comment

junyang-zh commented Jun 26, 2024

[ktest] Implement real parallel ktest and measure ktest time #834

[ktest] Implement real parallel ktest and measure ktest time #834

Conversation

junyang-zh commented May 11, 2024

tatetian left a comment

Choose a reason for hiding this comment

junyang-zh commented May 13, 2024

tatetian commented May 14, 2024 • edited Loading

junyang-zh commented May 14, 2024

junyang-zh commented May 14, 2024

tatetian commented May 14, 2024

tatetian May 14, 2024 • edited Loading

Choose a reason for hiding this comment

junyang-zh May 15, 2024

Choose a reason for hiding this comment

tatetian May 14, 2024

Choose a reason for hiding this comment

junyang-zh May 15, 2024

Choose a reason for hiding this comment

junyang-zh commented May 16, 2024

tatetian commented Jun 8, 2024 • edited Loading

tatetian left a comment

Choose a reason for hiding this comment

junyang-zh commented Jun 26, 2024

tatetian commented May 14, 2024 •

edited

Loading

tatetian May 14, 2024 •

edited

Loading

tatetian commented Jun 8, 2024 •

edited

Loading