## COMS10015 lab. worksheet #10

Although some questions have a written solution below, for others it will be more useful to experiment in a hands-on manner (e.g., using a concrete implementation). The file

https://assets.phoo.org/COMS10015\_2024\_TB-4/csdsp/sheet/lab-10\_s.tar.gz

supports such cases.

## §1. C-class, or core questions

S1[C]. A theme throughout this question is the increased level of *design* required, versus implementation: you will need to think more about how to produce a solution, rather than just *re*produce one from, e.g., the lecture slot(s). Keep in mind that there is an associated, and intentional design space of valid solutions. What is presented here is therefore one approach, which emphasises clarity and ease of understanding, among many: do not automatically assume that some other, different approach is invalid.

Figure 1 describes the decoder implementation; an associated LogisimEvo implementation is reproduced within the archive provided.

• The *alu* signal controls a multiplexer towards the top of the data-path. The multiplexer output is connected to the input (or next value) of R', meaning the multiplexer selects the operation applied to an input x, e.g.,  $x = R_i$ , to produce an output r: by inspection, we can see that one of the following cases

$$alu = \begin{cases} 0 \implies r = 0 & \text{(hard-wired to constant)} \\ 1 \implies r = x + 1 & \text{(computed via adder component)} \\ 2 \implies r = x - 1 & \text{(computed via subtractor component)} \\ 3 \implies r = \text{undefined} \end{cases}$$

will apply depending on alu. Within the decoder, alu is the output of a multiplexer whose control signal is  $inst_{8,7,6}$ , i.e., the opcode field in the encoded instruction: basically, the idea is that we connect each i-th multiplexer input to whatever we want alu to be when  $inst_{8,7,6} = i$ . So, for example:

- if  $inst_{8,7,6} = 000_{(2)} = 0_{(10)}$ , this is an increment instruction: we want alu = 1 such that r = x + 1 is produced as output, so connect the 0-th multiplexer input to 1,
- if  $inst_{8,7,6} = 001_{(2)} = 1_{(10)}$ , this is an decrement instruction: we want alu = 2 such that r = x 1 is produced as output, so connect the 1-st multiplexer input to 2,
- in all other cases, we do not use r: we want alu = 0 such that r = 0 is produced as a default (or placeholder) output, so connect the i-th multiplexer input to 0 for each  $i \in \{2, 3, 4, 5, 6, 7\}$ .
- The wr signal controls a demultiplexer towards the bottom of the data-path. Each i-th demultiplexer output is connected to the enable input of  $R_i$ , meaning the demultiplexer allows (if wr = 1) or disallows (if wr = 0) update of (or write-back of a value into) that register. Within the decoder, wr is the output of a multiplexer whose control signal is  $inst_{8,7,6}$ , i.e., the opcode field in the encoded instruction: basically, the idea is that we connect each i-th multiplexer input to whatever we want wr to be when  $inst_{8,7,6} = i$ . So, for example:
  - if  $inst_{8,7,6} = 000_{(2)} = 0_{(10)}$ , this is an increment instruction: we want wr = 1 such that a write-back occurs, so connect the 0-th multiplexer input to 1,
  - if  $inst_{8,7,6} = 001_{(2)} = 1_{(10)}$ , this is an decrement instruction: we want wr = 1 such that a write-back occurs, so connect the 1-st multiplexer input to 1,
  - in all other cases, we want no write-back to occur: we want wr = 0, so connect the i-th multiplexer input to 0 for each  $i \in \{2, 3, 4, 5, 6, 7\}$ .
- The addr signal controls a multiplexer towards the middle of the data-path. Each i-th multiplexer input is connected to the output (or current value) of  $R_i$ , meaning the multiplexer selects a register to operate on. Within the decoder, addr is extracted directly from the corresponding field in the encoded instruction. That is, we have  $addr = inst_{5,4}$ .



**Figure 1:** *The decoder implementation for an example 4-register counter machine.* 

• The *target* signal acts as an input to a multiplexer towards the middle of the control-path. The multiplexer output is connected to the input (or next value) of PC', meaning the multiplexer makes control-flow decisions: by inspection, we can see that one of the following cases

$$jmp \land cmp = \begin{cases} 1 \Rightarrow PC' = target & \text{(hard-wired to signal)} \\ 0 \Rightarrow PC' = PC + 1 & \text{(computed via adder component)} \end{cases}$$

will apply depending on jmp and cmp. Put another way, the next instruction is fetched from either PC + 1 (if a branch is not required, i.e., either jmp = 0 or cmp = 0) or target (if a branch is required, i.e., both jmp = 1 and cmp = 1). Within the decoder, target is extracted directly from the corresponding field in the encoded instruction. That is, we have  $target = inst_{3,2,1,0}$ . In contrast, we can see by inspection that  $jmp = \neg inst_8 \land inst_7 \land \neg inst_6$ , or, if you prefer,

$$jmp = \begin{cases} 1 & \text{if } inst_{8,7,6} = 010_{(2)}, \text{ i.e., this is} & \text{a branch instruction} \\ 0 & \text{otherwise, i.e., this is not a branch instruction} \end{cases}$$

Note that *cmp* is produced as output from the comparator component within the data-path, not by the decoder: basically,

$$cmp = \begin{cases} 1 & \text{if } \mathsf{R}_{addr} = 0, \text{ i.e., the branch condition is} & \text{satisfied} \\ 0 & \text{if } \mathsf{R}_{addr} \neq 0, \text{ i.e., the branch condition is not satisfied} \end{cases}$$

• The *halt* signal is combined with, or, more specifically, gates the clock signals  $\Phi_1$  and  $\Phi_2$  towards the right-hand side of the data-path. The idea is that once a halt instruction is executed, the clock signals are disabled; if halt = 1, we gate  $\Phi_1$  and  $\Phi_1$  meaning, for example, that subsequent register updates are then disallowed. Within the decoder, we can see by inspection that  $halt = \neg inst_8 \land inst_7 \land inst_6$ , or, if you prefer,

$$halt = \begin{cases} 1 & \text{if } inst_{8,7,6} = 011_{(2)}, \text{ i.e., this is} & \text{a halt instruction} \\ 0 & \text{otherwise, i.e., this is not a halt instruction} \end{cases}$$

## §2. R-class, or revision questions

 $\triangleright$  **S2**[**R**]. There is a set of solutions available at

https://assets.phoo.org/COMS10015\_2024\_TB-4/csdsp/sheet/misc-revision\_s.pdf

## §3. A-class, or additional questions

 $\triangleright$  **S3[A].** A solution for the first part can be localised to the decoder. The idea is that we want to set *halt* = 1 either 1) when a halt instruction is decoded, or 2) when an invalid instruction is decoded. Put another way, we

$$halt = \begin{cases} 0 & \text{if } inst_{8,7,6} = 000_{(2)} = 0_{(10)} \\ 0 & \text{if } inst_{8,7,6} = 001_{(2)} = 1_{(10)} \\ 0 & \text{if } inst_{8,7,6} = 010_{(2)} = 2_{(10)} \\ 1 & \text{if } inst_{8,7,6} = 011_{(2)} = 3_{(10)} \\ 1 & \text{if } inst_{8,7,6} = 100_{(2)} = 4_{(10)} \\ 1 & \text{if } inst_{8,7,6} = 101_{(2)} = 5_{(10)} \\ 1 & \text{if } inst_{8,7,6} = 110_{(2)} = 6_{(10)} \\ 1 & \text{if } inst_{8,7,6} = 111_{(2)} = 7_{(10)} \end{cases}$$

There are various ways to implement this, but arguably the simplest would be to use the expression

$$halt = (\neg inst_8 \land inst_7 \land inst_6) \lor (inst_8)$$

to capture the two cases: the left-hand sub-expression identifies a halt instruction and the right-hand sub-expression identifies an invalid instruction. However, this would need to be updated to reflect the additional instructions that stem from other parts of the question. Doing so is arguably easier by adopting a multiplexer-based approach, in a similar way to, e.g., the *alu* signal: doing so means using  $inst_{8,7,6}$  as a control signal with the multiplexer selecting between 8 inputs, one for each possible opcode, which reflect the value of *halt* required for an associated instruction.

At a high level, "adding support for an instruction" involves (at least) two steps: 1) defining the semantics and encoding for the instruction, then 2) implementing changes to the control- and data-path that allow execution of instruction instances within a program. As such, we can approach the latter parts of this question using the same steps for each instruction type.

```
0B3_{(16)}
                   010110011_{(2)} \rightarrow L_0 : if R_3 = 0 then goto L_3 else goto L_1
 070_{(16)}
                   001110000_{(2)} \mapsto
                                               L_1 : R_3 \leftarrow R_3 - 1 then goto L_2
 080_{(16)}
                   010000000_{(2)}
                                       \mapsto L<sub>2</sub> : if R<sub>0</sub> = 0 then goto L<sub>0</sub> else goto L<sub>3</sub>
 097_{(16)}
                   010010111_{(2)} \rightarrow L_3 : \text{if } R_1 = 0 \text{ then goto } L_7 \text{ else goto } L_4
                   000110000_{(2)} \quad \mapsto \quad \mathsf{L}_4 \ : \mathsf{R}_3 \leftarrow \mathsf{R}_3 + 1 \ \text{then goto} \ \mathsf{L}_5
 030_{(16)} =
                   001010000_{(2)} \mapsto
                                               L_5 : R_1 \leftarrow R_1 - 1 then goto L_6
 050_{(16)}
 083_{(16)}
                   010000011_{(2)} \rightarrow L_6: if R_0 = 0 then goto L_3 else goto L_7
0AB_{(16)} =
                   010101011_{(2)} \mapsto
                                               L_7: if R_2 = 0 then goto L_{11} else goto L_8
 030_{(16)} =
                   000110000_{(2)} \rightarrow L_8 : R_3 \leftarrow R_3 + 1 then goto L_9
                   001100000_{(2)} \mapsto L_9 : R_2 \leftarrow R_2 - 1 \text{ then goto } L_{10}
 060_{(16)} =
 087_{(16)}
                   010000111_{(2)} \rightarrow L_{10} : if R_0 = 0 then goto L_7 else goto L_{11}
0C0_{(16)}
                   011000000_{(2)} \mapsto L_{11} : halt
```

**Figure 2:** A program for an example 4-register counter machine, which sets  $R_3$  equal to  $R_1 + R_2$ .

• The first step is arguably less difficult, in part because the instruction semantics are already defined; the remaining task is therefore to define an encoding for each instruction. Although we have some degree of freedom, the encoding *must* satisfy some required requirements and *will ideally* satisfy some further optional requirements. For example, it must allow instructions to be decoded unambiguously; ideally, it will also allow efficient decoding, use existing instruction formats, etc. So, for example, we could settle on

for the clear (top) and unconditional branch (bottom) instructions respectively.

• The second step is arguably more difficult, in part because the approach required is strongly dependent on 1) the instruction semantics, and 2) the control- and data-path used to support them; there is no generic approach to follow, for example. We focus on the approach itself in what follows, presenting an overview of the implementations only.

For the clear instruction, the implementation is straightforward. The multiplexer towards the top of the data-path controlled by op already allows selection of the value then written-back into a given register; this value is already 0 if op = 0. Noting that for this instruction we have  $inst_{8,7,6} = 100_{(2)} = 4_{(10)}$ , we make two changes within the decoder. First, for the multiplexer that outputs op, we connect the 4-th input to 0 so the value written-back into  $R_{addr}$  is selected to be 0. Second, for the multiplexer that outputs wr, we connect the 4-th input to 1 so write-back into  $R_{addr}$  is enabled.

For the unconditional branch instruction, the goal is to alter how control-flow decisions are made, i.e., to accommodate both (existing) conditional and (new) unconditional cases. Conceptually, the implementation is straightforward although more involved than for the clear instruction: control-flow decisions are made using a multiplexer towards the middle of the control-path, so implementation of unconditional branch instruction basically means altering the associated control signal. Within the control-path, we change the multiplexer control signal  $jmp \land cmp$  to  $jmp \land (cmp \lor uncond)$ . This means uncond can "override" cmp: provided jmp = 1, i.e., this is a branch instruction, then whenever uncond = 1 said branch will be taken irrespective of cmp. Noting that for this instruction we have  $inst_{8,7,6} = 101_{(2)} = 5_{(10)}$ , we make two changes within the decoder. First, we update the logic that produces jmp so that jmp = 1 for both the conditional and unconditional branches. We could do so by implementing the expression

```
imp = (\neg inst_8 \land inst_7 \land \neg inst_6) \lor (inst_8 \land \neg inst_7 \land inst_6),
```

or, more radically, via a similar per-instruction multiplexer style approach as used for op and wr. Second, we introduce the new output uncond: the easiest way to do so would be to simply set

$$uncond = inst_8 \land \neg inst_7 \land inst_6,$$

meaning uncond = 1 for the unconditional branch instruction only.

 $\triangleright$  S4[A]. Figure 2 describes the program implementation, which can be thought of as three parts: L<sub>0</sub> to L<sub>2</sub> clear (or zero) R<sub>3</sub>, L<sub>3</sub> to L<sub>6</sub> add R<sub>1</sub> to R<sub>3</sub>, L<sub>7</sub> to L<sub>10</sub> add R<sub>2</sub> to R<sub>3</sub>. Also note that it depends on having R<sub>0</sub> = 0, allowing

```
= (0,0,3,2,1)
 L_0 \rightarrow if R_3 = 0 then goto L_3 else goto L_1
 C_1
            (1,0,3,2,1)
            R_3 \leftarrow R_3 - 1 then goto L_2
 L_1
 C_2
             (2,0,3,2,0)
      \rightarrow if R_0 = 0 then goto L_0 else goto L_3
 L_2
 C_3
      = (0,0,3,2,0)
      \rightarrow if R<sub>3</sub> = 0 then goto L<sub>3</sub> else goto L<sub>1</sub>
 C_4
           (3,0,3,2,0)
      \rightarrow if R_1 = 0 then goto L_7 else goto L_4
 L_3
 C_5
             (4,0,3,2,0)
            R_3 \leftarrow R_3 + 1 then goto L_5
 C_6
            (5,0,3,2,1)
      \rightarrow R<sub>1</sub> \leftarrow R<sub>1</sub> – 1 then goto L<sub>6</sub>
 L_5
      = (6,0,2,2,1)
 C_7
      \rightarrow if R_0 = 0 then goto L_3 else goto L_7
 L_6
       = (3,0,2,2,1)
 C_8
       \rightarrow if R_1 = 0 then goto L_7 else goto L_4
 L_3
 C_9
            (4,0,2,2,1)
            R_3 \leftarrow R_3 + 1 then goto L_5
 L_4
C_{10}
      = (5,0,2,2,2)
     \rightarrow R<sub>1</sub> \leftarrow R<sub>1</sub> – 1 then goto L<sub>6</sub>
 L_5
C_{11} = (6,0,1,2,2)
 L_6 \rightarrow if R_0 = 0 then goto L_3 else goto L_7
C_{12} = (3,0,1,2,2)
 L_3 \rightarrow if R_1 = 0 then goto L_7 else goto L_4
C_{13}
             (4,0,1,2,2)
 L_4 \rightarrow R_3 \leftarrow R_3 + 1 then goto L_5
           (5,0,1,2,3)
C_{14} =
L_5 \sim R_1 \leftarrow R_1 - 1 then goto L_6
C_{15} = (6,0,0,2,3)
L_6 \rightarrow \text{if } R_0 = 0 \text{ then goto } L_3 \text{ else goto } L_7
C_{16} = (3,0,0,2,3)
L_3 \rightarrow if R_1 = 0 then goto L_7 else goto L_4
            (7,0,0,2,3)
C_{17}
L_7 \rightarrow if R_2 = 0 then goto L_{11} else goto L_8
C_{18} = (8,0,0,2,3)
L_8 \sim R_3 \leftarrow R_3 + 1 then goto L_9
C_{19} = (9,0,0,2,4)
 L_9 \rightarrow R_2 \leftarrow R_2 - 1 then goto L_{10}
             (10,0,0,1,4)
C_{20}
L_{10} \rightarrow \text{if } R_0 = 0 \text{ then goto } L_7 \text{ else goto } L_{11}
             (7,0,0,1,4)
C_{21}
L_7 \rightarrow if R_2 = 0 then goto L_{11} else goto L_8
C_{22} = (8, 0, 0, 1, 4)
L_8 \rightarrow R_3 \leftarrow R_3 + 1 then goto L_9
C_{23} = (9,0,0,1,5)
L_9 \rightarrow R_2 \leftarrow R_2 - 1 then goto L_{10}
            (10,0,0,0,5)
C_{24} =
            if R_0 = 0 then goto L_7 else goto L_{11}
\mathsf{L}_{10} \sim
             (7,0,0,0,5)
C_{25}
            if R_2 = 0 then goto L_{11} else goto L_8
 L_7
C_{26}
       =
             (11,0,0,0,5)
            halt
```

**Figure 3:** An example trace of the program described in Figure ??.

the construction of unconditional branches in  $L_2$ ,  $L_6$ , and  $L_{10}$ . That is, it does not utilise the additional clear and unconditional branch instructions considered above.

We can demonstrate that the program computes the correct result by producing an example trace of execution. Starting with the initial configuration

$$C_0 = (l = 0, v_0 = 0, v_1 = 3, v_2 = 2, v_3 = 1),$$

Figure 3 describes such a trace. Note that initially  $R_1 = 3$  and  $R_2 = 2$ , and the final configuration halts execution with  $R_3 = 3 + 2 = 5$  as expected.