[DNM] Enable SMP on RISC-V #11354

nategraff-sifive · 2018-11-13T23:23:30Z

Do not merge. Opening a PR to seek feedback and comments.

A series of changes to enable symmetric multiprocessing on riscv32. I based my work partly on the way that SMP was enabled for Xtensa, including the use of the new _arch_switch API in place of swap. Thanks @andyross for the ground work to make this possible in Zephyr.

There are few areas that I'm concerned about:

Scheduler Changes

During development I found that I needed to make a number of changes to the common scheduler code. Most of these revolved around the fact that SMP removes the currently running threads from the ready queue, and in several places the scheduler made the incorrect assumption that threads were in the ready queue when they were not. This usually manifests in the ready queue being corrupted (threads getting dropped) and/or the scheduler handing back a bad thread pointer and making the CPU jump to nowhere.

Odds are I've made some mistakes, or at least not found the extent of the problems. Especially looking for review here.

Testing in QEMU

I've created a sifive-multicore target board for testing purposes. It's basically a clone of HiFive 1 with the SMP flags turned on. QEMU 3.0.0 doesn't have an embedded multicore RISC-V target, so in order to try this out you'll need to patch QEMU. It's a very simple one-line change. In hw/riscv/sifive_e.c, just change line 203 to

    mc->max_cpus = <some number greater than 1>;

I haven't gotten sanitycheck passing on sifive-multicore yet either. Still tracking down bugs.

Saved Registers During Interrupts

The multicore interrupt handler I created saves both callee and caller-saved registers onto the stack immediately, as opposed to the single-core handler which only saves callee-saved registers when it knows that a reschedule is about to occur. I didn't see a particularly good way to optimize this, but I wanted to point it out and see what people thought.

Add a mutex to make sure only one philosopher tries to write state to the console at a time.

Requires a patch to QEMU 3.0.0 which allows the sifive_e target to simulate with multiple cores via the `-smp` option

Allocating the start flag atomic on the stack creates a race condition. In true multicore environments you might be unlikely to hit it, but when debugging in QEMU I frequently found that the stack space got stomped on before the other cores had a chance to read the flag.

For non-SMP targets, makes sure than harts other than hart 0 are permanently parked. For SMP targets, permanently parks all harts >= CONFIG_MP_NUM_CPUS and starts up secondary harts in SMP mode.

_arch_irq_lock disables the interrupt enable bits without locking the global SMP spinlock

Rewrite ISR for SMP and use the switch API instead of the swap API The switch API hands a stack frame and not a thread context, so save the callee-saved registers on the stack frame as well. It might be helpful to retarget the switch handle to be the entire thread context to save stack space again.

The scheduler didn't properly handle SMP in a number of different cases, such as removing the current thread from the queue and determining when a thread was able to be run. The general issues tend to be that running threads in SMP mode are not present in the run queue, and the scheduler often assumes the wrong thing about whether a thread is currently in a queue or not. Adding presence checks fixes a number of these issues.

codecov-io · 2018-11-14T00:08:18Z

Codecov Report

Merging #11354 into master will increase coverage by <.01%.
The diff coverage is 71.42%.

@@            Coverage Diff             @@
##           master   #11354      +/-   ##
==========================================
+ Coverage   48.37%   48.37%   +<.01%     
==========================================
  Files         265      265              
  Lines       42188    42196       +8     
  Branches    10137    10143       +6     
==========================================
+ Hits        20408    20412       +4     
  Misses      17703    17703              
- Partials     4077     4081       +4

Impacted Files	Coverage Δ
kernel/sched.c	`91.36% <71.42%> (-1.13%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 39b2a09...29c5ab7. Read the comment docs.

The 250745e OT stack upmerge pulled upstream commit 079852b67e9b ("[uptime] enforce `UPTIME` feature for MTD and FTD builds (zephyrproject-rtos#11354)") which made `OPENTHREAD_CONFIG_UPTIME_ENABLE` mandatory for MTD builds. Update the module configuration accordingly to fix a build failure with CONFIG_OPENTHREAD_MTD=y. Signed-off-by: Aurelien Jarno <aurelien@aurel32.net>

nategraff-sifive added 13 commits November 13, 2018 14:15

samples: Add console access mutex for philosophers

99d5e1b

Add a mutex to make sure only one philosopher tries to write state to the console at a time.

riscv32: Make RISC-V machine timer multicore-aware

74354f5

boards: riscv: Add multicore target for SMP tests

391852b

Requires a patch to QEMU 3.0.0 which allows the sifive_e target to simulate with multiple cores via the `-smp` option

riscv32: Move _thread_entry_wrapper out of swap.S

deda4ef

riscv32: Add custom atomic operations for RISC-V

a2035db

riscv32: smp: Add _arch_curr_cpu

0fa53a8

riscv32: Fatal error handler shows mhartid

33ad6c8

riscv32: smp: Add _arch_start_cpu

d1480df

riscv32: smp: Create multicore startup vector

01636b8

For non-SMP targets, makes sure than harts other than hart 0 are permanently parked. For SMP targets, permanently parks all harts >= CONFIG_MP_NUM_CPUS and starts up secondary harts in SMP mode.

riscv32: soc_interrupt_init uses _arch_irq_lock

8226c2e

_arch_irq_lock disables the interrupt enable bits without locking the global SMP spinlock

nategraff-sifive added area: RISCV RISCV Architecture (32-bit & 64-bit) area: SMP Symmetric multiprocessing labels Nov 13, 2018

nategraff-sifive requested a review from andyross November 13, 2018 23:23

nategraff-sifive requested review from andrewboie, kgugala and pgielda as code owners November 13, 2018 23:23

nategraff-sifive added the DNM This PR should not be merged (Do Not Merge) label Nov 13, 2018

nategraff-sifive closed this Jul 15, 2019

aurel32 mentioned this pull request Nov 27, 2025

openthread: Kconfig: fix MTD build by enabling OPENTHREAD_UPTIME #100184

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DNM] Enable SMP on RISC-V #11354

[DNM] Enable SMP on RISC-V #11354

Uh oh!

nategraff-sifive commented Nov 13, 2018

Uh oh!

codecov-io commented Nov 14, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[DNM] Enable SMP on RISC-V #11354

[DNM] Enable SMP on RISC-V #11354

Uh oh!

Conversation

nategraff-sifive commented Nov 13, 2018

Scheduler Changes

Testing in QEMU

Saved Registers During Interrupts

Uh oh!

codecov-io commented Nov 14, 2018

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants