# Chisel 3.0 Tutorial (Beta)

Jonathan Bachrach, Krste Asanović, John Wawrzynek EECS Department, UC Berkeley {jrb|krste|johnw}@eecs.berkeley.edu

January 12, 2017

#### 1 Introduction

This document is a tutorial introduction to Chisel (Constructing Hardware In a Scala Embedded Language). Chisel is a hardware construction language embedded in the high-level programming language Scala. At some point we will provide a proper reference manual, in addition to more tutorial examples. In the meantime, this document along with a lot of trial and error should set you on your way to using Chisel. Chisel is really only a set of special class definitions, predefined objects, and usage conventions within Scala, so when you write a Chisel program you are actually writing a Scala program. However, for the tutorial we don't presume that you understand how to program in Scala. We will point out necessary Scala features through the Chisel examples we give, and significant hardware designs can be completed using only the material contained herein. But as you gain experience and want to make your code simpler or more reusable, you will find it important to leverage the underlying power of the Scala language. We recommend you consult one of the excellent Scala books to become more expert in Scala programming.

Chisel is still in its infancy and you are likely to encounter some implementation bugs, and perhaps even a few conceptual design problems. However, we are actively fixing and improving the language, and are open to bug reports and suggestions. Even in its early state, we hope Chisel will help designers be more productive in building designs that are easy to reuse and maintain.

Through the tutorial, we format commentary on our design choices as in this paragraph. You should be able to skip the commentary sections and still fully understand how to use Chisel, but we hope you'll find them interesting.

We were motivated to develop a new hardware language by years of struggle with existing hardware description languages in our research projects and hardware design courses. Verilog and VHDL were

developed as hardware simulation languages, and only later did they become a basis for hardware synthesis. Much of the semantics of these languages are not appropriate for hardware synthesis and, in fact, many constructs are simply not synthesizable. Other constructs are non-intuitive in how they map to hardware implementations, or their use can accidently lead to highly inefficient hardware structures. While it is possible to use a subset of these languages and yield acceptable results, they nonetheless present a cluttered and confusing specification model, particularly in an instructional setting.

However, our strongest motivation for developing a new hardware language is our desire to change the way that electronic system design takes place. We believe that it is important to not only teach students how to design circuits, but also to teach them how to design circuit generators—programs that automatically generate designs from a high-level set of design parameters and constraints. Through circuit generators, we hope to leverage the hard work of design experts and raise the level of design abstraction for everyone. To express flexible and scalable circuit construction, circuit generators must employ sophisticated programming techniques to make decisions concerning how to best customize their output circuits according to high-level parameter values and constraints. While Verilog and VHDL include some primitive constructs for programmatic circuit generation, they lack the powerful facilities present in modern programming languages, such as object-oriented programming, type inference, support for functional programming, and reflection.

Instead of building a new hardware design language from scratch, we chose to embed hardware construction primitives within an existing language. We picked Scala not only because it includes the programming features we feel are important for building circuit generators, but because it was specifically developed as a base for domain-specific languages.

## 2 Hardware expressible in Chisel

This version of Chisel only supports binary logic, and does not support tri-state signals.

We focus on binary logic designs as they constitute the vast majority of designs in practice. We omit support for tri-state logic in the current Chisel language as this is in any case poorly supported by industry flows, and difficult to use reliably outside of controlled hard macros.

# 3 Datatypes in Chisel

Chisel datatypes are used to specify the type of values held in state elements or flowing on wires. While hardware designs ultimately operate on vectors of binary digits, other more abstract representations for values allow clearer specifications and help the tools generate more optimal circuits. In Chisel, a raw collection of bits is represented by the Bits type. Signed and unsigned integers are considered subsets of fixed-point numbers and are represented by types SInt and UInt respectively. Signed fixed-point numbers, including integers, are represented using two's-complement format. Boolean values are represented as type Bool. Note that these types are distinct from Scala's builtin types such as Int or Boolean. Additionally, Chisel defines Bundles for making collections of values with named fields (similar to structs in other languages), and Vecs for indexable collections of values. Bundles and Vecs will be covered later.

Constant or literal values are expressed using Scala integers or strings passed to constructors for the types:

```
1.U    // decimal 1-bit lit from Scala Int.
"ha".U    // hexadecimal 4-bit lit from string.
"o12".U    // octal 4-bit lit from string.
"b1010".U    // binary 4-bit lit from string.

5.S    // signed decimal 4-bit lit from Scala Int.
-8.S    // negative decimal 4-bit lit from Scala Int.
5.U    // unsigned decimal 3-bit lit from Scala Int.
true.B    // Bool lits from Scala lits.
false.B
```

By default, the Chisel compiler will size each constant to the minimum number of bits required to hold the constant, including a sign bit for signed types. Bit widths can also be specified explicitly on literals, as shown below:

```
"ha".U(8.W) // hexadecimal 8-bit lit of type UInt
"o12".U(6.W) // octal 6-bit lit of type UInt
"b1010".U(12.W) // binary 12-bit lit of type UInt
```

```
5.S(7.W) // signed decimal 7-bit lit of type SInt 5.U(8.W) // unsigned decimal 8-bit lit of type UInt
```

For literals of type UInt, the value is zero-extended to the desired bit width. For literals of type SInt, the value is sign-extended to fill the desired bit width. If the given bit width is too small to hold the argument value, then a Chisel error is generated.

We are working on a more concise literal syntax for Chisel using symbolic prefix operators, but are stymied by the limitations of Scala operator overloading and have not yet settled on a syntax that is actually more readable than constructors taking strings.

We have also considered allowing Scala literals to be automatically converted to Chisel types, but this can cause type ambiguity and requires an additional import.

The SInt and UInt types will also later support an optional exponent field to allow Chisel to automatically produce optimized fixed-point arithmetic circuits.

#### 4 Combinational Circuits

A circuit is represented as a graph of nodes in Chisel. Each node is a hardware operator that has zero or more inputs and that drives one output. A literal, introduced above, is a degenerate kind of node that has no inputs and drives a constant value on its output. One way to create and wire together nodes is using textual expressions. For example, we can express a simple combinational logic circuit using the following expression:

```
(a & b) | (~c & d)
```

The syntax should look familiar, with & and | representing bitwise-AND and -OR respectively, and ~ representing bitwise-NOT. The names a through d represent named wires of some (unspecified) width.

Any simple expression can be converted directly into a circuit tree, with named wires at the leaves and operators forming the internal nodes. The final circuit output of the expression is taken from the operator at the root of the tree, in this example, the bitwise-OR.

Simple expressions can build circuits in the shape of trees, but to construct circuits in the shape of arbitrary directed acyclic graphs (DAGs), we need to describe fan-out. In Chisel, we do this by naming a wire that holds a subexpression that we can then reference multiple times in subsequent expressions. We name a wire in Chisel by declaring a variable. For example, consider the select expression, which is used twice in the following multiplexer description:

```
val sel = a | b
val out = (sel & in1) | (~sel & in0)
```

The keyword val is part of Scala, and is used to name variables that have values that won't change. It is used here to name the Chisel wire, sel, holding the output of the first bitwise-OR operator so that the output can be used multiple times in the second expression.

### 5 Builtin Operators

Chisel defines a set of hardware operators for the builtin types shown in Table 1.

#### 5.1 Bitwidth Inference

Users are required to set bitwidths of ports and registers, but otherwise, bit widths on wires are automatically inferred unless set manually by the user. The bit-width inference engine starts from the graph's input ports and calculates node output bit widths from their respective input bit widths according to the following set of rules:

#### operation bit width z = x + ywz = max(wx, wy)z = x - ywz = max(wx, wy)z = x & ywz = min(wx, wy)z = Mux(c, x, y) wz = max(wx, wy)wz = wx + wyz = w \* ywz = wx + maxNum(n) $z = x \ll n$ $z = x \gg n$ wz = wx - minNum(n)z = Cat(x, y)wz = wx + wyz = Fill(n. x)wz = wx \* maxNum(n)

where for instance wz is the bit width of wire z, and the & rule applies to all bitwise logical operations.

The bit-width inference process continues until no bit width changes. Except for right shifts by known constant amounts, the bit-width inference rules specify output bit widths that are never smaller than the input bit widths, and thus, output bit widths either grow or stay the same. Furthermore, the width of a register must be specified by the user either explicitly or from the bitwidth of the reset value or the *next* parameter. From these two requirements, we can show that the bit-width inference process will converge to a fixpoint.

Our choice of operator names was constrained by the Scala language. We have to use triple equals === for equality and =/= for inequality to allow the native Scala equals operator to remain usable.

We are also planning to add further operators that constrain bitwidth to the larger of the two inputs.

#### 6 Functional Abstraction

We can define functions to factor out a repeated piece of logic that we later reuse multiple times in a design. For example, we can wrap up our earlier example of a simple combinational logic block as follows:

```
def clb(a: UInt, b: UInt, c: UInt, d: UInt): UInt =
  (a & b) | (~c & d)
```

where clb is the function which takes a, b, c, d as arguments and returns a wire to the output of a boolean circuit. The def keyword is part of Scala and introduces a function definition, with each argument followed by a colon then its type, and the function return type given after the colon following the argument list. The equals (=) sign separates the function argument list from the function definition.

We can then use the block in another circuit as follows:

```
val out = clb(a,b,c,d)
```

We will later describe many powerful ways to use functions to construct hardware using Scala's functional programming support.

#### 7 Bundles and Vecs

Bundle and Vec are classes that allow the user to expand the set of Chisel datatypes with aggregates of other types.

Bundles group together several named fields of potentially different types into a coherent unit, much like a struct in C. Users define their own bundles by defining a class as a subclass of Bundle:

A Scala convention is to capitalize the name of new classes and we suggest you follow that convention in Chisel too. The W method converts a Scala Int to a Chisel Width, specifying the number of bits in the type.

Vecs create an indexable vector of elements, and are constructed as follows:

```
// Vector of 5 23-bit signed integers.
val myVec = Vec(5, SInt(23.W))

// Connect to one element of vector.
val reg3 = myVec(3)
```

| Example                                                             | Explanation                                            |  |  |  |
|---------------------------------------------------------------------|--------------------------------------------------------|--|--|--|
| Bitwise operators. Valid on SInt, UInt, Bool.                       |                                                        |  |  |  |
| val invertedX = ~x                                                  | Bitwise NOT                                            |  |  |  |
| <pre>val hiBits = x &amp; "h_ffff_0000".U</pre>                     | Bitwise AND                                            |  |  |  |
| val flagsOut = flagsIn   overflow                                   | Bitwise OR                                             |  |  |  |
| <pre>val flagsOut = flagsIn ^ toggle</pre>                          | Bitwise XOR                                            |  |  |  |
| Bitwise reductions. Valid on SInt and UInt. Returns Bool.           |                                                        |  |  |  |
| val allSet = andR(x)                                                | AND reduction                                          |  |  |  |
| <pre>val anySet = orR(x)</pre>                                      | OR reduction                                           |  |  |  |
| <pre>val parity = xorR(x)</pre>                                     | XOR reduction                                          |  |  |  |
| Equality comparison. Valid on SInt, UInt, and Bool. Returns Bool.   |                                                        |  |  |  |
| val equ = x === y                                                   | Equality                                               |  |  |  |
| val neq = $x = /= y$                                                | Inequality                                             |  |  |  |
| Shifts. Valid on SInt and UInt.                                     |                                                        |  |  |  |
| val twoToTheX = 1.S << x                                            | Logical left shift.                                    |  |  |  |
| val hiBits = x >> 16.U                                              | Right shift (logical on UInt and& arithmetic on SInt). |  |  |  |
| Bitfield manipulation. Valid on SInt, UInt, and Bool.               |                                                        |  |  |  |
| val xLSB = x(0)                                                     | Extract single bit, LSB has index 0.                   |  |  |  |
| val $xTopNibble = x(15,12)$                                         | Extract bit field from end to start bit position.      |  |  |  |
| <pre>val usDebt = Fill(3, "hA".U)</pre>                             | Replicate a bit string multiple times.                 |  |  |  |
| <pre>val float = Cat(sign,exponent,mantissa)</pre>                  | Concatenates bit fields, with first argument on left.  |  |  |  |
| Logical operations. Valid on Bools.                                 |                                                        |  |  |  |
| val sleep = !busy                                                   | Logical NOT                                            |  |  |  |
| val hit = tagMatch && valid                                         | Logical AND                                            |  |  |  |
| val stall = src1busy    src2busy                                    | Logical OR                                             |  |  |  |
| val out = Mux(sel, inTrue, inFalse)                                 | Two-input mux where sel is a Bool                      |  |  |  |
| Arithmetic operations. Valid on Nums: SInt and UInt.                |                                                        |  |  |  |
| val sum = a + b                                                     | Addition                                               |  |  |  |
| val diff = a - b                                                    | Subtraction                                            |  |  |  |
| val prod = a * b                                                    | Multiplication                                         |  |  |  |
| val div = a / b                                                     | Division                                               |  |  |  |
| val mod = a % b                                                     | Modulus                                                |  |  |  |
| Arithmetic comparisons. Valid on Nums: SInt and UInt. Returns Bool. |                                                        |  |  |  |
| val gt = a > b                                                      | Greater than                                           |  |  |  |
| val gte = a >= b                                                    | Greater than or equal                                  |  |  |  |
| val lt = a < b                                                      | Less than                                              |  |  |  |
| val lte = a <= b                                                    | Less than or equal                                     |  |  |  |

Table 1: Chisel operators on builtin data types.

(Note that we have to specify the type of the Vec elements inside the trailing curly brackets, as we have to pass the bitwidth parameter into the SInt constructor.)

The set of primitive classes (SInt, UInt, and Bool) plus the aggregate classes (Bundles and Vecs) all inherit from a common superclass, Data. Every object that ultimately inherits from Data can be represented as a bit vector in a hardware design.

Bundles and Vecs can be arbitrarily nested to build complex data structures:

```
class BigBundle extends Bundle {
  // Vector of 5 23-bit signed integers.
  val myVec = Vec(5, SInt(23.W))
  val flag = Bool()
  // Previously defined bundle.
  val f = new MyFloat()
}
```

Note that the builtin Chisel primitive and aggregate classes do not require the new when creating an instance, whereas new user datatypes will. A Scala apply constructor can be defined so that a user datatype also does not require new, as described in Section 14.

#### 8 Ports

Ports are used as interfaces to hardware components. A port is simply any Data object that has directions assigned to its members.

Chisel provides port constructors to allow a direction to be added (input or output) to an object at construction time. Simply wrap the object in an Input() or Output() function.

An example port declaration is as follows:

```
class Decoupled extends Bundle {
  val ready = Output(Bool())
  val data = Input(UInt(32.W))
  val valid = Input(Bool())
}
```

After defining Decoupled, it becomes a new type that can be used as needed for module interfaces or for named collections of wires.

The direction of an object can also be assigned at instantation time:

The methods asInput and asOutput force all modules of the data object to the requested direction.

By folding directions into the object declarations, Chisel is able to provide powerful wiring constructs described later.

#### 9 Modules

Chisel *modules* are very similar to Verilog *modules* in defining a hierarchical structure in the generated circuit. The hierarchical module namespace is accessible in downstream tools to aid in debugging and physical layout. A user-defined module is defined as a *class* which:

- inherits from Module,
- contains an interface wrapped in an IO() function and stored in a port field named io, and
- wires together subcircuits in its constructor.

As an example, consider defining your own twoinput multiplexer as a module:

```
class Mux2 extends Module {
  val io = IO(new Bundle{
    val sel = Input(UInt(1.W))
    val in0 = Input(UInt(1.W))
    val in1 = Input(UInt(1.W))
    val out = Output(UInt(1.W))
  })
  io.out := (io.sel & io.in1) | (~io.sel & io.in0)
}
```

The wiring interface to a module is a collection of ports in the form of a Bundle. The interface to the module is defined through a field named io. For Mux2, io is defined as a bundle with four fields, one for each multiplexer port.

The := assignment operator, used here in the body of the definition, is a special operator in Chisel that wires the input of left-hand side to the output of the right-hand side.

#### 9.1 Module Hierarchy

We can now construct circuit hierarchies, where we build larger modules out of smaller sub-modules. For example, we can build a 4-input multiplexer module in terms of the Mux2 module by wiring together three 2-input multiplexers:

```
class Mux4 extends Module {
  val io = IO(new Bundle {
    val in0 = Input(UInt(1.W))
    val in1 = Input(UInt(1.W))
  val in2 = Input(UInt(1.W))
  val in3 = Input(UInt(1.W))
  val sel = Input(UInt(2.W))
  val out = Output(UInt(1.W))
```

```
})
val m0 = Module(new Mux2())
m0.io.sel := io.sel(0)
m0.io.in0 := io.in0; m0.io.in1 := io.in1

val m1 = Module(new Mux2())
m1.io.sel := io.sel(0)
m1.io.in0 := io.in2; m1.io.in1 := io.in3

val m3 = Module(new Mux2())
m3.io.sel := io.sel(1)
m3.io.in0 := m0.io.out; m3.io.in1 := m1.io.out
io.out := m3.io.out
}
```

We again define the module interface as io and wire up the inputs and outputs. In this case, we create three Mux2 children modules, using the Module constructor function and the Scala new keyword to create a new object. We then wire them up to one another and to the ports of the Mux4 interface.

# 10 Running Examples

Now that we have defined modules, we will discuss how we actually run and test a circuit.

Testing is a crucial part of circuit design, and thus in Chisel we provide a mechanism for testing circuits by providing test vectors within Scala using tester method calls which binds a tester to a module and allows users to write tests using the given debug protocol. In particular, users utilize:

- poke to set input port and state values,
- step to execute the circuit one time unit,
- peek to read port and state values, and
- expect to compare peeked circuit values to expected arguments.

Chisel produces Firrtl intermediate representation (IR). Firrtl can be interpreted directly or can be translated into Verilog, which can then be used to generate a C++ simulator through verilator.

For example, in the following:

```
expect(c.io.out, (if (s == 1) i1 else i0))
     }
}
```

assignments for each input of Mux2 are set to the appropriate values using poke. For this particular example, we are testing the Mux2 by hardcoding the inputs to some known values and checking if the output corresponds to the known one. To do this, on each iteration we generate appropriate inputs to the module and tell the simulation to assign these values to the inputs of the device we are testing c, step the circuit 1 clock cycle, and test the expected value. Steps are necessary to update registers and the combinational logic driven by registers. For pure combinational paths, poke alone is sufficient to update all combinational paths connected to the poked input wire.

Finally, the following the tester is invoked by calling runPeekPokeTester:

```
def main(args: Array[String]): Unit = {
  runPeekPokeTester(() => new GCD()){
     (c,b) => new GCDTests(c,b)}
}
```

This will run the tests defined in GCDTests with the GCD module being simulated but the Firrtl interpreter. We can instead have the GCD module be simulated by a C++ simulator generated by Verilator by calling the following:

```
def main(args: Array[String]): Unit = {
  runPeekPokeTester(() => new GCD(), "verilator"){
    (c,b) => new GCDTests(c,b)}
}
```

#### 11 State Elements

The simplest form of state element supported by Chisel is a positive edge-triggered register, which can be instantiated as:

```
val reg = Reg(next = in)
```

This circuit has an output that is a copy of the input signal in delayed by one clock cycle. Note that we do not have to specify the type of Reg as it will be automatically inferred from its input when instantiated in this way. In the current version of Chisel, clock and reset are global signals that are implicity included where needed.

Using registers, we can quickly define a number of useful circuit constructs. For example, a rising-edge detector that takes a boolean signal in and outputs

true when the current value is true and the previous value is false is given by:

```
def risingedge(x: Bool) = x && !Reg(next = x)
```

Counters are an important sequential circuit. To construct an up-counter that counts up to a maximum value, max, then wraps around back to zero (i.e., modulo max+1), we write:

```
def counter(max: UInt) = {
  val x = Reg(init = 0.U(max.getWidth.W))
  x := Mux(x === max, 0.U, x + 1.U)
  x
}
```

The counter register is created in the counter function with a reset value of 0 (with width large enough to hold max), to which the register will be initialized when the global reset for the circuit is asserted. The := assignment to x in counter wires an update combinational circuit which increments the counter value unless it hits the max at which point it wraps back to zero. Note that when x appears on the right-hand side of an assignment, its output is referenced, whereas when on the left-hand side, its input is referenced.

Counters can be used to build a number of useful sequential circuits. For example, we can build a pulse generator by outputting true when a counter reaches zero:

```
// Produce pulse every n cycles.
def pulse(n: UInt) = counter(n - 1.U) === 0.U
```

A square-wave generator can then be toggled by the pulse train, toggling between true and false on each pulse:

```
// Flip internal state when input true.
def toggle(p: Bool) = {
  val x = Reg(init = false.B)
  x := Mux(p, !x, x)
  x
}

// Square wave of a given period.
def squareWave(period: UInt) = toggle(pulse(period/2))
```

#### 11.1 Forward Declarations

Purely combinational circuits cannot have cycles between nodes, and Chisel will report an error if such a cycle is detected. Because they do not have cycles, combinational circuits can always be constructed in a feed-forward manner, by adding new nodes whose inputs are derived from nodes that have already been defined. Sequential circuits naturally have feedback between nodes, and so it is sometimes necessary to reference an output wire before the producing node has been defined. Because Scala evaluates program statements sequentially, we allow data nodes to serve as a wire providing a declaration of a node that can be used immediately, but whose input will be set later. For example, in a simple CPU, we need to define the pcPlus4 and brTarget wires so they can be referenced before defined:

```
val pcPlus4 = UInt()
val brTarget = UInt()
val pcNext = Mux(io.ctrl.pcSel, brTarget, pcPlus4)
val pcReg = Reg(next = pcNext, init = 0.U(32.W))
pcPlus4 := pcReg + 4.U
...
brTarget := addOut
```

The wiring operator := is used to wire up the connection after pcReg and addOut are defined.

#### 11.2 Conditional Updates

In our previous examples using registers, we simply wired the combinational logic blocks to the inputs of the registers. When describing the operation of state elements, it is often useful to instead specify when updates to the registers will occur and to specify these updates spread across several separate statements. Chisel provides conditional update rules in the form of the when construct to support this style of sequential logic description. For example,

```
val r = Reg(init = 0.U(16.W))
when (cond) {
   r := r + 1.U
}
```

where register r is updated at the end of the current clock cycle only if cond is true. The argument to when is a predicate circuit expression that returns a Bool. The update block following when can only contain update statements using the assignment operator :=, simple expressions, and named wires defined with val.

In a sequence of conditional updates, the last conditional update whose condition is true takes priority. For example,

```
when (c1) { r := 1.U } when (c2) { r := 2.U }
```

leads to r being updated according to the following truth table:

| c1 | c2 | r |                             |
|----|----|---|-----------------------------|
| 0  | 0  | r | r unchanged                 |
| 0  | 1  | 2 |                             |
| 1  | 0  | 1 |                             |
| 1  | 1  | 2 | c2 takes precedence over c1 |



Figure 1: Equivalent hardware constructed for conditional updates. Each when statement adds another level of data mux and ORs the predicate into the enable chain. The compiler effectively adds the termination values to the end of the chain automatically.

Figure 1 shows how each conditional update can be viewed as inserting a mux before the input of a register to select either the update expression or the previous input according to the when predicate. In addition, the predicate is OR-ed into an enable signal that drives the load enable of the register. The compiler places initialization values at the beginning of the chain so that if no conditional updates fire in a clock cycle, the load enable of the register will be deasserted and the register value will not change.

Chisel provides some syntactic sugar for other common forms of conditional update. The unless construct is the same as when but negates its condition. In other words,

```
unless (c) { body }
is the same as
when (!c) { body }
```

The update block can target multiple registers, and there can be different overlapping subsets of registers present in different update blocks. Each register is only affected by conditions in which it appears. The same is possible for combinational circuits (update of a Wire). Note that all combinational circuits need a default value. For example:

```
r := 3.5; s := 3.5
when (c1) { r := 1.5; s := 1.5 }
when (c2) { r := 2.5 }
```

leads to r and s being updated according to the following truth table:

```
c1 c2 r s  0 \quad 0 \quad 3 \quad 3 \\ 0 \quad 1 \quad 2 \quad 3 \ // \ r \ updated \ in \ c2 \ block, \ s \ at \ top \ level. \\ 1 \quad 0 \quad 1 \quad 1 \\ 1 \quad 1 \quad 2 \quad 1
```

We are considering adding a different form of conditional update, where only a single update block will take effect. These atomic updates are similar to Bluespec guarded atomic actions.

Conditional update constructs can be nested and any given block is executed under the conjunction of all outer nesting conditions. For example,

```
when (a) { when (b) { body } }
is the same as:
when (a && b) { body }
```

Conditionals can be chained together using when, .elsewhen, .otherwise corresponding to if, else if and else in Scala. For example,

```
when (c1) { u1 }
.elsewhen (c2) { u2 }
.otherwise { ud }

is the same as:
when (c1) { u1 }
when (!c1 && c2) { u2 }
when (!(c1 || c2)) { ud }
```

We introduce the switch statement for conditional updates involving a series of comparisons against a common key. For example,

```
switch(idx) {
  is(v1) { u1 }
  is(v2) { u2 }
}
```

is equivalent to:

```
when (idx === v1) { u1 } .elsewhen (idx === v2) { u2 }
```

Chisel also allows a Wire, i.e., the output of some combinational logic, to be the target of conditional update statements to allow complex combinational logic expressions to be built incrementally. Chisel does not allow a combinational output to be incompletely specified and will report an error if an unconditional update is not encountered for a combinational output.

In Verilog, if a procedural specification of a combina-

tional logic block is incomplete, a latch will silently be inferred causing many frustrating bugs.

It could be possible to add more analysis to the Chisel compiler, to determine if a set of predicates covers all possibilities. But for now, we require a single predicate that is always true in the chain of conditional updates to a Wire.

#### 11.3 Finite State Machines

A common type of sequential circuit used in digital design is a Finite State Machine (FSM). An example of a simple FSM is a parity generator:

```
class Parity extends Module {
  val io = IO(new Bundle {
    val in = Input(Bool())
    val out = Output(Bool()) })

val s_even :: s_odd :: Nil = Enum(2)
val state = Reg(init = s_even)
when (io.in) {
    when (state === s_even) { state := s_odd }
    when (state === s_odd) { state := s_even }
}
io.out := (state === s_odd)
}
```

where Enum(2) generates two UInt literals. States are updated when in is true. It is worth noting that all of the mechanisms for FSMs are built upon registers, wires, and conditional updates.

Below is a more complicated FSM example which is a circuit for accepting money for a vending machine:

```
class VendingMachine extends Module {
 val io = IO(new Bundle {
   val nickel = Input(Bool())
   val s_idle :: s_5 :: s_10 :: s_15 :: s_ok :: Nil =
      Enum(5)
 val state = Reg(init = s_idle)
 when (state === s_idle) {
   when (io.nickel) { state := s_{-}5 }
   when (io.dime) { state := s_10 }
 when (state === s_5) {
   when (io.nickel) { state := s_10 }
   when (io.dime) { state := s_15 }
  when (state === s_10) {
   when (io.nickel) { state := s_15 }
   when (io.dime) { state := s_ok }
 when (state === s_15) {
   when (io.nickel) { state := s_ok }
   when (io.dime) { state := s_-ok }
 when (state === s_ok) {
   state := s_idle
 io.valid := (state === s_ok)
```

Here is the vending machine FSM defined with switch statement:

```
class VendingMachine extends Module {
  val io = IO(new Bundle {
    val nickel = Input(Bool())
    val dime = Input(Bool())
    val valid = Output(Bool())
  val s_idle :: s_5 :: s_10 :: s_15 :: s_ok :: Nil =
       Enum(5)
  val state = Reg(init = s_idle)
  switch (state) {
    is (s_idle) {
      when (io.nickel) { state := s_{-}5 }
      when (io.dime) { state := s_{-}10 }
    is (s 5) {
      when (io.nickel) { state := s_10 }
      when (io.dime) { state := s_15 }
    is (s_{-}10) {
      when (io.nickel) { state := s_15 }
      when (io.dime) { state := s_ok }
    is (s_{-}15) {
      when (io.nickel) { state := s_ok }
      when (io.dime) { state := s_{-}ok }
    is (s_ok) {
      state := s_idle
  io.valid := (state === s_ok)
```

#### 12 Memories

Chisel provides facilities for creating both read only and read/write memories.

#### 12.1 **ROM**

Users can define read only memories with a Vec:

```
Vec(inits: Seq[T])
Vec(elt0: T, elts: T*)
```

where inits is a sequence of initial Data literals that initialize the ROM. For example, users can create a small ROM initialized to 1, 2, 4, 8 and loop through all values using a counter as an address generator as follows:

```
val m = Vec(Array(1.U, 2.U, 4.U, 8.U))
val r = m(counter(UInt(m.length.W)))
```

We can create an n value sine lookup table using a ROM initialized as follows:

```
def sinTable (amp: Double, n: Int) = {
```

```
val times =
  Range(0, n, 1).map(i => (i*2*Pi)/(n.toDouble-1) - Pi)
val inits =
  times.map(t => SInt(round(amp * sin(t)), width = 32))
Vec(inits)
}
def sinWave (amp: Double, n: Int) =
  sinTable(amp, n)(counter(UInt(n.W))
```

where amp is used to scale the fixpoint values stored in the ROM.

#### 12.2 Mem

Memories are given special treatment in Chisel since hardware implementations of memory have many variations, e.g., FPGA memories are instantiated quite differently from ASIC memories. Chisel defines a memory abstraction that can map to either simple Verilog behavioral descriptions, or to instances of memory modules that are available from external memory generators provided by foundry or IP vendors.

Chisel supports random-access memories via the Mem construct. Writes to Mems are positive-edge-triggered and reads are either combinational or positive-edge-triggered.<sup>1</sup>

Ports into Mems are created by applying a UInt index. A 32-entry register file with one write port and two combinational read ports might be expressed as follows:

```
val rf = Mem(32, UInt(64.W))
when (wen) { rf(waddr) := wdata }
val dout1 = rf(waddr1)
val dout2 = rf(waddr2)
```

If the optional parameter seqRead is set, Chisel will attempt to infer sequential read ports when the read address is a Reg. A one-read port, one-write port SRAM might be described as follows:

```
val ram1rlw =
  Mem(1024, UInt(32.W))
val reg_raddr = Reg(UInt())
when (wen) { ram1rlw(waddr) := wdata }
when (ren) { reg_raddr := raddr }
val rdata = ram1rlw(reg_raddr)
```

Single-ported SRAMs can be inferred when the read and write conditions are mutually exclusive in the same when chain:

```
val ram1p = Mem(1024, UInt(32.W))
val reg_raddr = Reg(UInt())
when (wen) { ram1p(waddr) := wdata }
.elsewhen (ren) { reg_raddr := raddr }
```

<sup>1</sup>Current FPGA technology does not support combinational (asynchronous) reads (anymore). The read address needs to be registered.

```
val rdata = ram1p(req_raddr)
```

If the same Mem address is both written and sequentially read on the same clock edge, or if a sequential read enable is cleared, then the read data is undefined.

Mem also supports write masks for subword writes. A given bit is written if the corresponding mask bit is set.

```
val ram = Mem(256, UInt(32.W))
when (wen) { ram.write(waddr, wdata, wmask) }
```

#### 13 Interfaces & Bulk Connections

For more sophisticated modules it is often useful to define and instantiate interface classes while defining the IO for a module. First and foremost, interface classes promote reuse allowing users to capture once and for all common interfaces in a useful form. Secondly, interfaces allow users to dramatically reduce wiring by supporting *bulk connections* between producer and consumer modules. Finally, users can make changes in large interfaces in one place reducing the number of updates required when adding or removing pieces of the interface.

#### 13.1 Ports: Subclasses & Nesting

As we saw earlier, users can define their own interfaces by defining a class that subclasses Bundle. For example, a user could define a simple link for handshaking data as follows:

```
class SimpleLink extends Bundle {
  val data = Output(UInt(16.W))
  val valid = Output(Bool())
}
```

We can then extend SimpleLink by adding parity bits using bundle inheritance:

```
class PLink extends SimpleLink {
  val parity = Output(UInt(5.W))
}
```

In general, users can organize their interfaces into hierarchies using inheritance.

From there we can define a filter interface by nesting two PLinks into a new FilterIO bundle:

```
class FilterIO extends Bundle {
  val x = new PLink().flip
  val y = new PLink()
}
```

where flip recursively changes the "gender" of a bundle, changing input to output and output to input.

We can now define a filter by defining a filter class extending module:

```
class Filter extends Module {
  val io = IO(new FilterIO())
  ...
}
```

where the io field contains FilterIO.

#### 13.2 Bundle Vectors

Beyond single elements, vectors of elements form richer hierarchical interfaces. For example, in order to create a crossbar with a vector of inputs, producing a vector of outputs, and selected by a UInt input, we utilize the Vec constructor:

```
class CrossbarIo(n: Int) extends Bundle {
  val in = Vec(n, new PLink().flip())
  val sel = Input(UInt(sizeof(n).W))
  val out = Vec(n, new PLink())
}
```

where Vec takes a size as the first argument and a block returning a port as the second argument.

#### 13.3 Bulk Connections

We can now compose two filters into a filter block as follows:

```
class Block extends Module {
  val io = IO(new FilterIO())
  val f1 = Module(new Filter())
  val f2 = Module(new Filter())
  f1.io.x <> io.x
  f1.io.y <> f2.io.x
  f2.io.y <> io.y
}
```

where <> bulk connects interfaces of opposite gender between sibling modules or interfaces of same gender between parent/child modules. Bulk connections connect leaf ports of the same name to each other. After all connections are made and the circuit is being elaborated, Chisel warns users if ports have other than exactly one connection to them.

#### 13.4 Interface Views

Consider a simple CPU consisting of control path and data path submodules and host and memory interfaces shown in Figure 2. In this CPU we can see that the control path and data path each connect class Dpath extends Module {



Figure 2: Simple CPU involving control and data path submodules and host and memory interfaces.

only to a part of the instruction and data memory interfaces. Chisel allows users to do this with partial fulfillment of interfaces. A user first defines the complete interface to a ROM and Mem as follows:

```
class RomIo extends Bundle {
  val isVal = Input(Bool())
  val raddr = Input(UInt(32.W))
  val rdata = Output(UInt(32.W))
}

class RamIo extends RomIo {
  val isWr = Input(Bool())
  val wdata = Input(UInt(32.W))
}
```

Now the control path can build an interface in terms of these interfaces:

```
class CpathIo extends Bundle {
  val imem = RomIo().flip()
  val dmem = RamIo().flip()
  ...
}
```

and the control and data path modules can be built by partially assigning to this interfaces as follows:

```
class Cpath extends Module {
  val io = IO(new CpathIo())
  ...
  io.imem.isVal := ...
  io.dmem.isVal := ...
  io.dmem.isWr := ...
  }
}
```

```
val io = IO(new DpathIo())
...
io.imem.raddr := ...
io.dmem.raddr := ...
io.dmem.wdata := ...
...
```

We can now wire up the CPU using bulk connects as we would with other bundles:

```
class Cpu extends Module {
  val io = IO(new CpuIo())
  val c = Module(new CtlPath())
  val d = Module(new DatPath())
  c.io.ctl <> d.io.ctl
  c.io.dat <> d.io.dat
  c.io.imem <> io.imem
  d.io.imem <> io.imem
  c.io.dmem <> io.dmem
  d.io.dmem <> io.dmem
  d.io.dmem <> io.dmem
  d.io.host <> io.host
}
```

Repeated bulk connections of partially assigned control and data path interfaces completely connect up the CPU interface.

#### 14 Functional Module Creation

It is also useful to be able to make a functional interface for module construction. For instance, we could build a constructor that takes multiplexer inputs as parameters and returns the multiplexer output:

```
object Mux2 {
  def apply (sel: UInt, in0: UInt, in1: UInt) = {
    val m = new Mux2()
    m.io.in0 := in0
    m.io.in1 := in1
    m.io.sel := sel
    m.io.out
  }
}
```

where object Mux2 creates a Scala singleton object on the Mux2 module class, and apply defines a method for creation of a Mux2 instance. With this Mux2 creation function, the specification of Mux4 now is significantly simpler.

Selecting inputs is so useful that Chisel builds it in and calls it Mux. However, unlike Mux2 defined above, the builtin version allows any datatype on in0 and in1 as long as they are the same subclass of Data. In Section 15 we will see how to define this ourselves.

Chisel provides MuxCase which is an n-way Mux

```
MuxCase(default, Array(c1 -> a, c2 -> b, ...))
```

}

where each condition / value is represented as a tuple in a Scala array and where MuxCase can be translated into the following Mux expression:

```
Mux(c1, a, Mux(c2, b, Mux(..., default)))
```

Chisel also provides MuxLookup which is an n-way indexed multiplexer:

which can be rewritten in terms of:MuxCase as follows:

Note that the cases (eg. c1, c2) must be in parentheses.

# 15 Polymorphism and Parameterization

Scala is a strongly typed language and uses parameterized types to specify generic functions and classes. In this section, we show how Chisel users can define their own reusable functions and classes using parameterized classes.

This section is advanced and can be skipped at first reading.

#### 15.1 Parameterized Functions

Earlier we defined Mux2 on Bool, but now we show how we can define a generic multiplexer function. We define this function as taking a boolean condition and con and alt arguments (corresponding to then and else expressions) of type T:

```
def Mux[T <: Bits](c: Bool, con: T, alt: T): T { ... }</pre>
```

where T is required to be a subclass of Bits. Scala ensures that in each usage of Mux, it can find a common superclass of the actual con and alt argument types, otherwise it causes a Scala compilation type error. For example,

```
Mux(c, 10.U, 11.U)
```

yields a UInt wire because the con and alt arguments are each of type UInt.

We now present a more advanced example of parameterized functions for defining an inner product FIR digital filter generically over Chisel Num's. The inner product FIR filter can be mathematically defined as:

$$y[t] = \sum_{j} w_j * x_j[t-j] \tag{1}$$

where x is the input and w is a vector of weights. In Chisel this can be defined as:

```
def delays[T <: Data](x: T, n: Int): List[T] =
   if (n <= 1) List(x) else x :: Delays(RegNext(x), n-1)

def FIR[T <: Data with Num[T]](ws: Seq[T], x: T): T =
   (ws, Delays(x, ws.length)).zipped.
   map(_ * _ ).reduce(_ + _ )</pre>
```

where delays creates a list of incrementally increasing delays of its input and reduce constructs a reduction circuit given a binary combiner function f. In this case, reduce creates a summation circuit. Finally, the FIR function is constrained to work on inputs of type Num where Chisel multiplication and addition are defined.

#### 15.2 Parameterized Classes

Like parameterized functions, we can also parameterize classes to make them more reusable. For instance, we can generalize the Filter class to use any kind of link. We do so by parameterizing the FilterIO class and defining the constructor to take a zero argument type constructor function as follow:

```
class FilterIO[T <: Data](type: T) extends Bundle {
  val x = type.asInput.flip
  val y = type.asOutput
}</pre>
```

We can now define Filter by defining a module class that also takes a link type constructor argument and passes it through to the FilterIO interface constructor:

```
class Filter[T <: Data](type: T) extends Module {
  val io = IO(new FilterIO(type))
  ...
}</pre>
```

We can now define a PLink based Filter as follows:

```
val f = Module(new Filter(new PLink()))
```

A generic FIFO could be defined as shown in Figure 3 and used as follows:

```
class DataBundle extends Bundle {
  val A = UInt(32.W)
  val B = UInt(32.W)
object FifoDemo {
  def apply () = new Fifo(new DataBundle, 32)
class Fifo[T <: Data] (type: T, n: Int)</pre>
    extends Module {
  val io = IO(new Bundle {
    val enq_val = Input(Bool())
    val enq_rdy = Output(Bool())
    val deq_val = Output(Bool())
    val deq_rdy = Input(Bool())
    val enq_dat = type.asInput
    val deq_dat = type.asOutput
  })
                   = Reg(init = 0.U(sizeof(n).W))
  val enq_ptr
  val deq_ptr
                   = Reg(init = 0.U(sizeof(n).W))
  val is_full
                   = Reg(init = false.B)
  val do_enq
                   = io.enq_rdy && io.enq_val
                   = io.deq_rdy && io.deq_val
  val do_deg
  val is_empty
                   = !is_full && (enq_ptr === deq_ptr)
  val deq_ptr_inc = deq_ptr + 1.U
  val enq_ptr_inc = enq_ptr + 1.U
  val is_full_next =
    Mux(do_enq \&\& \sim do_deq \&\& (enq_ptr_inc === deq_ptr),
        true.B.
        Mux(do_deq && is_full, false.B, is_full))
  enq_ptr := Mux(do_enq, enq_ptr_inc, enq_ptr)
  deq_ptr := Mux(do_deq, deq_ptr_inc, deq_ptr)
  is_full := is_full_next
  val ram = Mem(n)
  when (do_eng) {
    ram(enq_ptr) := io.enq_dat
  io.enq_rdy := !is_full
  io.deg val := !is empty
  ram(deq_ptr) <> io.deq_dat
```

Figure 3: Parameterized FIFO example.

It is also possible to define a generic decoupled interface:

```
class DecoupledIO[T <: Data](data: T)
    extends Bundle {
    val ready = Input(Bool())
    val valid = Output(Bool())
    val bits = data.cloneType.asOutput
}</pre>
```

This template can then be used to add a handshaking protocol to any set of signals:

```
class DecoupledDemo
  extends DecoupledIO()( new DataBundle )
```

The FIFO interface in Figure 3 can be now be simplified as follows:

```
class Fifo[T <: Data] (data: T, n: Int)
    extends Module {
  val io = IO(new Bundle {
    val enq = new DecoupledIO( data ).flip()
    val deq = new DecoupledIO( data )
  })
  ...
}</pre>
```

# 16 Multiple Clock Domains

Chisel 3.0 does not yet support of multiple clock domains. That support will be coming shortly.

# 17 Acknowlegements

Many people have helped out in the design of Chisel, and we thank them for their patience, bravery, and belief in a better way. Many Berkeley EECS students in the Isis group gave weekly feedback as the design evolved including but not limited to Yunsup Lee, Andrew Waterman, Scott Beamer, Chris Celio, etc. Yunsup Lee gave us feedback in response to the first RISC-V implementation, called TrainWreck, translated from Verilog to Chisel. Andrew Waterman and Yunsup Lee helped us get our Verilog backend up and running and Chisel TrainWreck running on an FPGA. Brian Richards was the first actual Chisel user, first translating (with Huy Vo) John Hauser's FPU Verilog code to Chisel, and later implementing generic memory blocks. Brian gave many invaluable comments on the design and brought a vast experience in hardware design and design tools. Chris Batten shared his fast multiword C++ template library that inspired our fast emulation library. Huy Vo became our undergraduate research assistant and was the first to actually assist in the Chisel implementation. We appreciate all the EECS students who participated in the Chisel bootcamp and proposed and worked on hardware design projects all of which pushed the Chisel envelope. We appreciate the work that James Martin and Alex Williams did in writing and translating network and memory controllers and non-blocking caches. Finally, Chisel's functional programming and bit-width inference ideas were inspired by earlier work on a hardware description language called Gel [2] designed in collaboration with Dany Qumsiyeh and Mark Tobenkin.

#### References

[1] Bachrach, J., Vo, H., Richards, B., Lee, Y., Waterman, A., Avižienis, Wawrzynek, J., Asanović

- Chisel: Constructing Hardware in a Scala Embedded Language. in DAC '12.
- [2] Bachrach, J., Qumsiyeh, D., Tobenkin, M. *Hardware Scripting in Gel.* in Field-Programmable Custom Computing Machines, 2008. FCCM '08. 16th.