### ECE/CS 552: INTRODUCTION TO COMPUTER ARCHITECTURE

# Project Description – Stage 1

First Stage Project Submission: <u>April 17<sup>th</sup> (30% of grade)</u> Final Stage Project Demo Due: <u>May 6<sup>th</sup> & 8<sup>th</sup> (70% of grade)</u>

The project should be completed in a group of two or three students.

Note: (rx) stands for the content of register rx.

## 1. WISC-S14 ISA Specifications

WISC-S14 is a 16-bit computer with a load/store architecture. Design, test, and synthesize this architecture using Verilog.

WISC-S14 has a register file, a 3-bit FLAG register, and sixteen instructions. The register file comprises sixteen 16-bit registers. Register \$0 is hardwired to 0x0000. Register \$15 has a special purpose. Register \$15 serves as a Return Address register for JAL instruction. The FLAG register contains three bits: Zero (Z), Overflow (V), and Sign bit (N).

WISC-S14's instructions can be categorized into four major classes: Arithmetic, Memory, Load Immediate and Control.

## 1.1 Arithmetic Instructions

Eight arithmetic and logical instructions belong to this category. They are ADD, ADDZ, SUB, AND, NOR, SLL, SRL, SRA.

The ADD, ADDZ, SUB, AND, and NOR, instructions have a three address format. The assembly level syntax for these instructions is

## Opcode rd, rs, rt

The two operands are (rs) and (rt) and the destination is register rd.

The ADD, ADDZ, and SUB instructions respectively add, add if zero, and subtract the two operands (rs – rt) in two's-complement representation and save the result in register rd (NOTE: ADDZ only stores the result if the Z flag was set by the previous instruction). NOTE: These instructions will use saturating arithmetic. Meaning if a result exceeds the most positive number  $(2^15-1)$  then the result is saturated to  $(2^15-1)$ . Likewise if the result were to underflow the most negative number  $(-2^15)$  then the result would be saturated to  $-2^15$ .

The AND, and NOR instructions respectively perform bitwise AND, and bitwise NOR, operations on the two operands and save the result in register rd.

The SLL, SRL, SRA, and instructions have the following assembly level syntax.

#### Opcode rd, rs, imm

The imm field is a 4-bit immediate operand in unsigned representation for the SLL, SRL, and SRA instructions.

SLL, SRL, and SRA shift (rs) by number of bits specified in the imm field and saves the result in register rd. SRA is shift right arithmetic, SRL is shift logical, finally SLL is shift left logical..

The machine level encoding for the eight arithmetic/logic instructions is

#### 0aaa dddd ssss tttt

where aaa represents the opcode (see Table 2), dddd and ssss respectively represent the rd and rs registers. The tttt field represents either the rt register or the imm field.

#### 1.2 Memory Instructions

There are two instructions which belong to this category: LW, SW. The assembly level syntax for the LW and SW instructions is

## Opcode rt, rs, offset

The LW instruction loads register rt with contents from the memory location specified by register rs plus the immediate offset. The signed value offset is sign-extended and added to the contents of register rs to compute the address of the memory location to load.

The SW instruction saves (rt) to the location specified by the register rs plus the immediate offset. The address of the memory location is computed as in LW.

The machine level encoding of these two instructions is

#### 10aa tttt ssss oooo

where aa specifies the opcode, tttt identifies rt, ssss identifies rs, and oooo is the offset in 2's complement representation.

## 1.3 Load Immediate Instruction

There are two instructions which belong to this category: LHB, LLB. The assembly level syntax for the LHB and LLB instructions is: LLB Rx, 0xYY

LHB Rx, 0xYY

Where Rx is the register being loaded, and 0xYY is the 8-bit immediate value. For LLB the 8-bit immediate value will be sign extended and loaded into Rx.

LHB instruction loads the most significant 8 bits of register rd with the bits in the immediate field. The least significant 8 bits of the register rd are **left unchanged**.

The machine level encoding for these instructions is

#### 101a dddd uuuu uuuu

where a is a bit to differentiate LLB vs LHB, dddd specifies the destination registers, and uuuuuuuu is the 8-bit immediate value.

## 1.4 Control Instructions

There are four instructions which belong to this category: B, JAL, JR, and HLT

The B (Branch) instruction conditionally jumps to the address obtained by adding the 9-bit immediate (signed) offset to the contents of the program counter. Assume that the value of the program counter used in this addition is the address of the next instruction (i.e., address of Branch instruction + 1).

The eight possible conditions are Equal (EQ), Not Equal (NEQ), Greater Than (GT), Less Than (LT), Greater Than or Equal (GTE), Less Than or Equal (LTE), Overflow (OVFL), and Unconditional (UNCOND). Many of these conditions are determined based on the 3-bit flag N, V, and Z. Instruction that set flags are outlined in the table below:

| Function | Flags Set |
|----------|-----------|
| ADD      | N,Z,V     |
| ADDZ     | N,Z,V     |
| SUB      | N,Z,V     |
| AND      | Z         |
| NOR      | Z         |
| SLL      | Z         |
| SRL      | Z         |
| SRA      | Z         |

The True condition corresponds to an unconditional branch. The status of the condition is obtained from the FLAG register (definition of each flag is in section 3.3). The assembly level syntax for this instruction is

#### B cond, Label

The machine level encoding for this instruction is

#### Opcode ccci iiii iiii

where ccc specifies the condition as in Table 1 and iiiiiiiii represents the 9-bit signed offset in twoscomplement representation.

|  | Table 1: | Encodii | ng for | Brancl | h cond | itions |
|--|----------|---------|--------|--------|--------|--------|
|--|----------|---------|--------|--------|--------|--------|

| ccc | Condition                                        |  |
|-----|--------------------------------------------------|--|
| 000 | Not Equal $(Z = 0)$                              |  |
| 001 | Equal $(Z = 1)$                                  |  |
| 010 | Greater Than $(Z = N = 0)$                       |  |
| 011 | Less Than (N = 1)                                |  |
| 100 | Greater Than or Equal ( $Z = 1$ or $Z = N = 0$ ) |  |
| 101 | Less Than or Equal ( $N = 1$ or $Z = 1$ )        |  |
| 110 | Overflow (V = 1)                                 |  |
| 111 | Unconditional                                    |  |

The JAL (jump and link) instruction saves the contents of the program counter (address of the Call instruction + 1) to the Return Address register (\$15) and jumps to the procedure whose starting address is partly specified in the instruction. The assembly level syntax for this instruction is

## JAL target

The machine level encoding for this instruction is

## Opcode gggg gggg gggg

where gggg gggg gggg specifies a 12-bit signed immediate offset that is added to the current PC (which is PC+1 relative to the JAL instruction PC) to form the target jump address.

The JR (jump register) instruction jumps to the address specified by (rs). When rs == \$15, JR is used as return-from-procedure-call instruction. The assembly level syntax for this instruction is

IR rs

The machine level encoding for this instruction is (ssss encodes the name of register rs)

### Opcode xxxx tttt xxxx

The HLT instruction essentially freezes the whole machine. PC stops advancing and the pipe is stalled.

Function Opcode ADD 0000 **ADDZ** 0001 SUB 0010 0011 AND **NOR** 0100 SLL 0101 **SRL** 0110 **SRA** 0111 1000 LW SW 1001 LHB 1010 LLB 1011 В 1100 1101 JAL IR 1110 HLT 1111

Table 2: Table of opcodes

# 2. Memory System

The address space of the WISC-S14 processor is 16 bits. Each memory location is 16 bits in size (word-addressable), so the total memory capacity is  $2^{16}$ x16 bits =  $2^{20}$  bits = 1 Mbit = 128 Kbytes.

For the first stage of the project, the processor will have separate instruction and data memories. The instruction memory has a 16-bit address input, a read enable, and a 16-bit data output. The data memory has a 16-bit address input, a 16-bit data input, a 16-bit data output, a write enable signal, and a read enable signal (both read and write cannot be active in the same clock cycle). If the write signal is asserted, the memory will write the data input bits to the address specified by the address. The read operation is active when the read enable signal is high. Both instruction and data memories perform their operations in a single clock cycle.

Verilog modules will be provided for both memories.

For the second stage of the project, the instruction and data memories will be replaced with cache memories backed up by a slower main memory. **Details of the timing and information of the cache and memory system will be announced later this semester.** 

# 3. Implementation

## 3.1 Pipelining

Your design must use a five stage pipeline (IF, ID, EX, MEM, WB) similar to that in the text. The design will make use of Verilog modules you developed as part of the homework assignments during the course of the semester. You must implement hazard detection so that your pipeline correctly handles all data and control dependences.

## 3.2 Reset Sequence

WISC-S14 has an active low reset input (rst\_n). Instructions are executed when rst\_n is high. If rst\_n goes low for one clock cycle, the contents of the state of the machine is reset and starts execution at address 0x0000.

## 3.3 Flags

Flag bits are stored in the FLAG register and are used in conditional jump. There are three bits in the FLAG register: Zero (Z), Overflow (V), and Sign bit (N). Only five the arithmetic instructions (ADD, ADDZ, SUB) can change all three flags (Z,OV,N). The logical instructions (AND, NOR, SLL, SRL, SRA) can change the Z FLAG.

The Z flag is set if and only if the output of the operation is zero.

The V flag is set by the ADD, ADDZ and SUB instructions if and only if the operation results in an overflow. Overflow must be set based on treating the arithmetic values as 16-bit signed integers.

The N flag is set if and only if the result of the ADD, ADDZ, and SUB instruction is negative. **Note that N** flag should be implemented as (sign-bit XOR overflow).

Other Instructions, including load/store instructions and control instructions, do not change the contents of the FLAG register.

## 4. Interface

Your top level verilog should be file called *cpu.v*. It should have a simple 3 signal interface: *clk*, *rst\_n*, and *hlt*.

| Signal Interface of <i>cpu.v</i> |                   |                                                                                                                                              |  |
|----------------------------------|-------------------|----------------------------------------------------------------------------------------------------------------------------------------------|--|
| Signal:                          | <b>Direction:</b> | <b>Description:</b>                                                                                                                          |  |
| clk                              | in                | System clock                                                                                                                                 |  |
| rst_n                            | in                | Active low reset. A low on this signal resets the processor and                                                                              |  |
|                                  |                   | causes execution to start at address 0x0000                                                                                                  |  |
| hlt                              | out               | When your processor encounters the HLT instruction it will assert<br>this signal once it is finished processing the instruction prior to the |  |
|                                  |                   | HLT. This signal should also be connected to the <i>hlt</i> input of the                                                                     |  |
|                                  |                   | RF, which causes the RF to dump its register contents at the end                                                                             |  |
|                                  |                   | of the test.                                                                                                                                 |  |

# 5. Submission Requirements

First stage project submission will be done as a tarball. You must port all of your verilog file to a CAE linux directory. It should be flat, no sub-directories. Just all your verilog file in one spot.

UnixPrompt>tar -cvf project.tar \*.v

Then submit the *project.tar* file to the dropbox for Project Stage 1 on the Learn@UW site.

## Grading criteria will be based on:

- 1. The number of our tests your design passed
- 2. The synthesizability of your design and its synthesized size if it did synthesize.
- 3. The geometric mean of your CPI