# MINOTAUR: A Timing-Predictable Open Source RISC-V Core Featuring Speculative Execution

Alban Gruin Thomas Carle Hugues Cassé Christine Rochange Pascal Sainrat

IRIT — Université Toulouse 3 — CNRS

#### **Motivation**

In real-time systems, precise Worst-Case Execution Time (WCET) analysis is required to schedule tasks in such a way that they meet their deadlines. In multicore processors, interference between cores (e.g. on a shared memory bus) impacts the WCET, and must be taken into account.

The usual solution is the compositional approach: latencies to shared resources are analysed separately, then combined to the WCET of tasks. However, so-called "timing anomalies" make this approach unsafe. Timing anomalies are situations in which a worst-case local situation (e.g. a cache miss) does not result in the worst global situation (i.e. the WCET).



Figure 1. Example of a timing anomaly on the Ariane processor.

In this work, we modified Ariane, an open-source RISC-V processor, to improve its predictability, without sacrificing its performance too much. We called the result MINOTAuR (Mostly IN-Order Timing predictAble processoR) [1].

#### **Monotonicity**

To allow compositionality, one needs to guarantee that the processor is free from timing anomalies. One solution is to design a "monotonous" pipeline, in which an instruction cannot prevent an "older" instruction (more advanced in the pipeline) from gaining access to a shared resource.



Figure 2. Example of monotonicity violation: the instruction in MEM misses in its cache and has to wait that a miss in the IF stage is resolved.

# **Issues with the Ariane processor**

The Ariane processor is an open-source RISC-V core. It features parallel functional units, a scoreboard and a dynamic branch prediction system.



Figure 3. Overview of the Ariane processor. The parts highlighted in orange have been modified in MINOTAuR.

To make MINOTAuR monotonic, we prevented the instruction cache from making requests when a memory instruction is in the pipeline (from the fetch stage to the load/store unit), and when the processor is speculating to avoid changing the state of the cache, in case of a misprediction.

# **How RISC-V helps**

Thanks to the open, royalty-free nature of RISC-V, it is easy to find high-quality, free and open-source processors design, like the Ariane. This allows us to focus on our research (no need to design bespoke processors), appropriate existing components (software, toolchains, operating systems), and make our changes open, so that the community can immediately benefit from them.

### Formal model

#### Is an instruction ready?

```
c.ready(i) \coloneqq \frac{(c.stg(i) \neq pre \land \neg c.pending(i, branch) \land pwrong(i))}{\lor (c.cnt(i) = 0 \land c.isnext(c.stg(i), i))} \\ \lor (c.cnt(i) = 0 \land c.isnext(c.stg(i), i)) \\ \land (c.stg(i) = \text{PC} \Rightarrow (ichit(i)) \\ \lor (\neg c.pending(i, branch) \land \neg c.pending(i, load) \land \neg c.pending(i, store) \\ \land \neg c.pending(i, atomic)))) \\ \land (c.stg(i) = \text{IS} \Rightarrow \\ ((opc(i) \in \{load, store, atomic\} \Rightarrow c.slot2(\text{LSU})) \\ \land (opc(i) \notin \{load, store, atomic\} \Rightarrow \neg c.pending(i, csr)) \\ \land (opc(i) \in \{mul, div\} \Rightarrow \neg c.pending(i, div)) \\ \land (\forall j < i.dep(i, j), c.stg(j) \sqsupseteq_S \text{CO}))) \\ \land (c.stg(i) = \text{LSU} \Rightarrow (opc(i) \in \{store, atomic\} \land \neg c.pending(i, atomic)) \\ \lor (opc(i) = load \land (\neg c.pending(i, store) \land \neg c.pending(i, atomic)))))
```

# Is a stage free?

```
c.free(s) :=
s \in \{ALU, MUL_1, CSR, MUL_2, CO, post\}
\forall (s \in \{IF, IS, LSU, SU\} \land c.slot(s))
\forall (s \in \{PC, ID, DIV, LU, ST\} \land ((\neg \exists j. c.stg(j) = s))
\forall (\exists j. c.stg(j) = s \land c.ready(j) \land c.free(c.nstg(j)))))
\forall (\exists i.c.stg(i) = s \land pwrong(i) \land \neg c.pending(i, branch))
```

# Results

We synthesized MINOTAuR on a Xilinx Zynq Z7-20 FPGA using Vivado, and compared its performance with Ariane on the TACLe benchmark suite and on CoreMark.

|                      | Ariane      | MINOTAuR    |
|----------------------|-------------|-------------|
| CoreMark score       | 110.77      | 105.72      |
| Total cycles (TACLe) | 599,720,712 | 604,672,290 |
| Average              |             | 5.67 %      |
| Weighted mean        |             | 0.83 %      |

Table 1. Performance comparison of Ariane and MINOTAuR.

We successfully modified an existing processor to make it free from timing anomalies, while reconciling performance and timing-predictability: while the MINOTAuR core is slower on memory-intensive programs, its performance remains comparable to Ariane in other benchmarks.

# References

Alban Gruin, Thomas Carle, Hugues Cassé, and Christine Rochange. Speculative execution and timing predictability in an open source RISC-V core. In 2021 IEEE Real-Time Systems Symposium (RTSS), pages 393–404. IEEE, 2021.



