### CSCI 2500 — Computer Organization Group Project (document version 1.2) — Due December 10, 2021 MIPS Processor in C

- This project is due by the Midnight EST on the above date via a Submitty gradeable.
- This project is a *group assignment*. You can have groups of up to four people.
- Start the project early. You can ask questions during office hours, in the Submitty forum, and during your lab session.
- NOTE: This document might change in the coming weeks, so please check for updates.

## 1 Project Overview

For the group project, we'll be taking Lab 5 and Lab 6 to the logical conclusion – you'll be implementing a full gate-level circuit representing the datapath for a reduced, but still Turing complete, MIPS ISA. While we have already developed our ALU, the rest of the datapath and control needs to be implemented. For most of the instructions and datapath, implementation details are provided in depth throughout the Chapter 4 slides as well as the book. However, another aspect of the project will be to expand on the provided datapath by developing your own implementations for a few other instructions. See the template file (project.c) for more details.

## 2 Supported Instructions

Our ISA will include the following instructions:

| Instruction          | Type   | Description         | Input format                            | Operation                        |
|----------------------|--------|---------------------|-----------------------------------------|----------------------------------|
| lw                   | I-type | Load Word           | lw reg1 reg2 offset                     | reg1 = M[reg2 + offset]          |
| sw                   | I-type | Store Word          | sw $reg1 reg2$ offset                   | M[reg2+offset] = reg1            |
| beq                  | I-type | Banch on equal      | beq reg1 reg2 offset                    | if $(reg1 == reg2) PC += offset$ |
| addi                 | I-type | Add immediate       | addi reg1 reg2 constant                 | reg1 = reg2 + constant           |
| and                  | R-type | Logical AND         | and reg1 reg2 reg3                      | reg1 = reg2 & reg3               |
| or                   | R-type | Logical OR          | or reg1 reg2 reg3                       | $reg1 = reg2 \mid reg3$          |
| add                  | R-type | Integer addition    | add $reg1 reg2 reg3$                    | reg1 = reg2 + reg3               |
| $\operatorname{sub}$ | R-type | Integer subtraction | sub reg1 reg2 reg3                      | reg1 = reg2 - reg3               |
| slt                  | R-type | Set less than       | slt reg $1 \text{ reg} 2 \text{ reg} 3$ | reg1 = (reg2 < reg3 ? 1 : 0)     |
| j                    | J-type | Jump                | j address                               | PC = address                     |
| jal                  | J-type | Jump and link       | jal address                             | RA = PC, PC = address            |
| jr                   | R-type | Jump register       | jr reg1                                 | PC = reg1                        |

The "input format" given above refers to the format of the assembly instructions you'll parse, convert to machine code, and then process through your circuit. **Note that for lw and sw the format is slightly different than what we've seen in the past**. This adjustment was made to simplify input processing. Also note that you'll need to map each register label (e.g., t0) to its 5-bit binary representation.

#### 2.1 Instruction Conversion

Your first step is to parse the input instructions into their 32-bit binary machine code representation. See the MIPS reference card on Submitty for the instruction format fields, the op codes, function codes, and register mappings (rs, rt, rd) for each instruction. For Lab 6, you'll be implementing this for a subset of instruction types and only the t0 and s0 registers. For this project, you'll need to consider all of the following the registers:

| Register Name       | Register # | Use                     |
|---------------------|------------|-------------------------|
| zero                | 0          | Constant value 0        |
| v0                  | 2          | Return value register 0 |
| a0                  | 4          | Argument register 0     |
| t0                  | 8          | Temporary register 0    |
| t1                  | 9          | Temporary register 1    |
| s0                  | 16         | Saved register 0        |
| s1                  | 17         | Saved register 1        |
| $\operatorname{sp}$ | 29         | Stack pointer           |
| ra                  | 31         | Return address          |

## 3 Memory

For simplicity, we'll have 3 separate  $32 \times 32$ -bit 2D arrays representing storage for the register file, instruction memory, and data memory. You won't need to construct D-flip-flops or similar circuits to store program state. For additional simplicity, memory is to be addressed on the word (**not byte**) boundary. What this means is:

- For updating your PC, you'll only need to add +1 instead of +4.
- For jump and branch addresses, you won't need to shift left by 2.
- Any jump value in an instruction will be the actual address to jump to.

To access and read/write a specific memory address, you can use the 5-to-32 decoder implemented in Lab 6 along with read/write control bits. The different memories are implemented in four separate functions, including Instruction\_Memory, Read\_Register, Write\_Register, and Data\_Memory.

#### 4 Control

There will be several sets of control bits that you'll need to determine for each instruction. These include the 4-bit ALU Control bits input to the ALU, the 2-bit ALUOp that is input into ALU Control circuit, as well as 9 other control bits. You'll primarily be calculating these values in the functions Control and ALU\_Control. The control bit are all specified below.

| Control Bit      | Size | Use                                                |
|------------------|------|----------------------------------------------------|
| ALUControl       | 4    | Output from ALU Control circuit to ALU             |
| RegDst           | 1    | TRUE if R-type instruction                         |
| Jump             | 1    | TRUE if jump instruction                           |
| Branch           | 1    | TRUE if branch instruction                         |
| ${\tt MemRead}$  | 1    | TRUE if reading from memory                        |
| ${\tt MemToReg}$ | 1    | TRUE if writing to register from memory            |
| ALU0p            | 2    | Output from Control circuit to ALU Control circuit |
| ${\tt MemWrite}$ | 1    | TRUE if writing to memory                          |
| ALUSrc           | 1    | TRUE if second input to ALU is immediate value     |
| RegWrite         | 1    | TRUE if register is being written                  |
| Zero             | 1    | TRUE if ALU result is Zero                         |

See the following tables for guidance on how to set these values for each instruction:

- ALUControl and ALUOp: Chapter-4\_MIPS\_Processor\_Pipelining-slides.pdf, slide "ALU Control"; Figure 4.13 from textbook
- Others: Chapter-4\_MIPS\_Processor\_Pipelining-slides.pdf, slides from "RegDst Control Line" to "MemToReg Control"; Figure 4.18, Figure 4.22 from textbook

However, **be careful** when implementing any SOP circuits using *only* the values in the provided tables. These tables do not include a few instructions that we're considering (j, jal, jr, addi). In addition, be aware that you'll likely to need to implement your own additional control lines for these specific instructions.

## 5 ALU, Adder, and Other Circuits

You'll need to implement additional circuitry for the ALU and 32-bit adders. You can mostly re-use your implementations from Lab 5 and Lab 6, with a couple of caveats.

- There are now four ALU control input bits, instead of just two
- We have an additional output for the control line Zero

You'll further need to implement a "sign-extender" circuit for instructions with a 16-bit immediate value field. You are also free to implement any additional "helper circuits", as you deem necessary. It will likely be useful to implement things like 3-input or 4-input AND/OR gates.

## 6 Datapath

The datapath describes the way that all of the above control lines and circuits are hooked up to each other. Recall that we're considering the entire processor as one big combinational circuit. Each subcircuit is similarly a combinational circuit of some inputs and some outputs. You'll need to implement the functionality of the datapath by feeding an output from one circuit to the input in another circuit in some specified order. A representative figure to base your implementation off of is given by slide "Datapath With Jumps Added" in Chapter-4\_MIPS\_Processor\_Pipelining-slides.pdf or in Figure 4.24 in the book. However, again be aware that this figure doesn't consider some of the additional instructions that we're including. As part of your project writeup, you'll need to show how you modified the datapath to account for these new instructions.

The datapath will primarily be implemented in the function updateState(). Although the processor we're implementing doesn't have explicit pipeline stages, it might still help you figure out your implementation by considering each pipeline stage at a time. E.g., you should process all of the circuits that would be in the DECODE stage of the pipeline before processing the circuits that would be in the EXECUTE stage.

## 7 Project Rules

The rules for your project implementation will be similar to Lab 5 and Lab 6 but with a few additions. Basically, anything you code up for your processor should be representable as some actual combinational circuit. Generally, for loops over fixed iteration counts are OK, while all if-else statements are NOT. You are free to re-use any code from any group members' prior assignments. Group work should be as evenly distributed among members as is possible. Try and start as early as is reasonably possible.

The primary purpose of the project is for you to demonstrate how basic logic gates can be combined together into complex circuits. As a rule, whatever you implement in your solution file should be a loose representation of an implementable circuit. The below are guidelines to this end:

- 1. No modifications to main().
- 2. You are allowed to implement additional basic gates e.g., a three input AND gate to simplify your other circuits.
- 3. If desired, you can modify the function prototypes given in the template.
- 4. No if() or if()-else() logical control statements, or equivalent.
- 5. You can use basic for loops only (for int i = 0; i < N; ++i). Different loop structures are not allowed, as they can easily be used as a logical equivalent to if-else statements.
- 6. No calls to any external or library functions.
- 7. The above rules are to basically ensure that you're sticking to using gates for all logical control and operations. No tricky workarounds are allowed.

### 8 Input and Output

The input format will be a list of assembly instructions. The output will be for each processed instruction: the PC, the binary instruction, the state of all 32 words of memory, and the state of all 32 registers. Generally, you should expect positive or negative numbers for all inputs except for the shift amount for shift left logical. Note that if you implement the circuits as given in the csci2500-f21-ch03b-slides.pdf, you should have to do no additional work to account for negative inputs. See below for example inputs and outputs:

```
bash$ cat arith.txt
addi t0 zero 12
addi t1 zero 13
add s0 t0 t1
sub s1 t0 t1
and a0 t0 t1
or v0 t0 t1
bash$ ./a.out < arith.txt
PC: 0
Instruction: 0010000000001000000000000001100
Instruction: 0010000000001001000000000001101
Register: 0 0 0 0 0 0 0 12 13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 32 0 0
PC: 2
Instruction: 00000001000010011000000000100000
Register: 0 0 0 0 0 0 0 12 13 0 0 0 0 0 25 0 0 0 0 0 0 0 0 0 0 32 0 0
PC: 3
Instruction: 00000001000010011000100000100010
Register: 0 0 0 0 0 0 0 12 13 0 0 0 0 0 25 -1 0 0 0 0 0 0 0 0 32 0 0
PC: 4
Instruction: 00000001000010010010000000100100
Register: 0 0 0 0 12 0 0 0 12 13 0 0 0 0 0 25 -1 0 0 0 0 0 0 0 0 0 32 0 0
PC: 5
Instruction: 00000001000010010001000000100101
Register: 0 0 13 0 12 0 0 0 12 13 0 0 0 0 0 25 -1 0 0 0 0 0 0 0 0 0 32 0 0
bash$ cat load_store.txt
addi t0 zero 25
add t1 t0 t0
addi sp sp -2
sw t0 sp 0
```

```
sw t1 sp 1
lw s0 sp 0
lw s1 sp 1
addi sp sp 2
bash$ ./a.out < load_store.txt</pre>
Instruction: 00100000000010000000000000011001
PC: 1
Instruction: 000000010000100001001000001
PC: 2
PC: 3
PC: 4
PC: 5
PC: 6
PC: 7
```

# 9 Submission and Grading Criteria

Due to the complexity and duration of the project, there will be a few different checkpoints and deadlines that you'll need to be aware of. The final submission will consist of two parts – one being the autograded component and the other being the project writeup.

• November 10 - Lab 6: Instruction parsing and 32-bit ALU implementation

- November 19 Finalize your groups on Submitty
- December 10 Final submissions to Submitty for autograded portion. Final submissions to Submitty for project writeup.

Since final submission is due on the last day of the semester, it will not be possible to use any late days for the team project.

Do your best to stick to the above rules, as the TA grade is mostly based on your adherence to them. As a reminder, you **must** use C for the coding part of the project. Your code **must** successfully compile and run on Submitty, which uses Ubuntu v20.04.

#### 9.1 Grading Breakdown

The grading breakdown is as follows:

- 1. Autograding: 50%
  - Standard visible and hidden test cases
- 2. TA grading: 50%
  - Solution is representative of an implementable circuit
  - Code quality
    - Code has clear and logical organization and is well structured. For example, functions have well defined responsibilities, there are no huge "God" functions, arguments and return values are properly used to pass data between callers and callees, there are no (or very few) global data items, etc.
    - Code is properly formatted, commented, and consistently follows some C style guidelines (see https://www.gnu.org/prep/standards/html\_node/Writing-C.html or http://www.literateprogramming.com/indhill-annot.pdf as examples).
  - Project writeup
    - Fully explains design and implementation choices for each circuit
    - Includes modified datapath from Figure 4.24 for new instructions
    - Explains contributions of each group member