# Register File (of eight 16-bit registers)

#### DDCO Assignment 2a

October 3, 2017

In a modern computer, it typically takes hundreds of clock cycles to fetch data from main memory to the microprocessor (even cache memory reads take a few clock cycles). So if the ALU in the microprocessor operated directly on data from memory, it would spend most of its time waiting. To achieve high performance, microprocessors incorporate a register file. The register file, a key component of microprocessors, is a bank of registers used as a temporary high-speed storage into which data is fetched from memory. The register file then provides inputs to the ALU. As you may recall, in assignment 1 you constructed an ALU that received two 16-bit inputs and produced one 16-bit output.

## 1 Register File Implementation

So in this assignment, your task is to construct a register file, from which two 16-bit values can be read, and to which one 16-bit value written, every clock cycle. Therefore, the register file requires two read ports and one write port. The register file should contain eight (16-bit) registers. Each of the read and write ports need to be able address any of the eight registers. Register address varies from 0 to 7, hence each of the two read and one write addresses need to be three bits wide.

Also, register 0 should always be zero<sup>1</sup>. Specifically, when register address 0 is supplied on any of the read ports, the corresponding output should be zero, and when register address 0 is supplied on the write port, the corresponding input data should be ignored (not written to any register).

The inputs to the register file are clk, reset, wr, rd\_addr\_a[2:0], rd\_addr\_b[2:0], wr\_addr[2:0], d\_in[15:0], and its outputs are d\_out\_a[15:0], d\_out\_b[15:0]. rd\_addr\_a and d\_out\_a form read port a just as rd\_addr\_b and d\_out\_b form read port b. wr, wr\_addr and d\_in form the write port.

Read operation: For port a, the address of the register to be read is applied to rd\_addr\_a and at the next positive clock edge, the contents of

<sup>&</sup>lt;sup>1</sup>Processor architectures such as MIPS, and the legendary CDC 6600, had this feature of a register 0 permanently tied to 0. Doing so can optimize some computation, such as not initializing a register to zero before using it, zero being somewhat frequently required.

the corresponding register should be available on d\_out\_a. Port b should operate in the same manner.

Write operation: When wr is high the value applied to d\_in should, at the next clock positive clock edge, be stored in the register whose address is specified by wr\_adder. When wr is low, inputs d\_in and wr\_adder should be ignored.

So in this manner the register file should be able to perform two read operations and one write operation (if wr is high) every clock cycle.

You can construct the registers using the dfrl flip flop (positive edge triggered D flip flop with reset and load enable) supplied in lib.v. All dfrl clock and reset inputs should be connected to the global clock and reset inputs supplied to the reg\_file module.

## 2 Integrate with ALU



Figure 1: Integration of alu and reg file to form reg alu.

The two read outputs and the write input of the of the register file implemented above need to be connected respectively to the two inputs and one output of the ALU implemented in the previous assignment. Also, the register file read and write ports would need "outside" connection as well so that operands can be written in the register file and the results read out. The

register file thus needs to be integrated with the ALU, in a straightforward manner, as shown in figure 1.

Inside the reg\_alu module one simply instantiates the alu and reg\_file modules and also the 2:1 16-bit mux to mediate the writes into the register file, and wires up the above three instantiations as shown in the figure. The mux is required to select whether data to be written to the reg\_file (d\_in) comes from the alu or "outside", the sel control input to the mux being used to make the selection. Also, the cout alu output is registered in a flip-flop<sup>2</sup>. All the inputs and outputs of reg\_file are inputs and outputs of the reg\_alu module (d\_in reg\_alu input being routed via mux to d\_in reg\_file input, all other inputs and outputs are connected directly). In addition reg\_alu has as inputs the mux sel described above and the op input to the alu. Finally reg\_alu also has as an additional output the registered cout.

The reg\_file, reg\_alu and other related modules need to be submitted in the reg\_alu.v file.

#### 3 Design and Simulation

You can use the alu.v and augmented lib.v supplied for basic components. The reg\_file module can be tested using the supplied tb\_reg\_file.v with the commands:

```
iverilog -o tb_reg_file lib.v alu.v reg_alu.v tb_reg_file.v
vvp tb_reg_file
```

The reg\_alu module can be tested using the supplied tb\_reg\_alu.v with the commands:

```
iverilog -o tb_reg_alu lib.v alu.v reg_alu.v tb_reg_alu.v
vvp tb_reg_alu
```

Note that above reg\_alu.v is used in both cases (it is okay for reg\_alu.v to contain both reg\_file and reg\_alu even if you are testing just one of the two).

The general approach and the tools used remain the same as last assignment.

<sup>&</sup>lt;sup>2</sup>In a microprocessor, an instruction may make use of the cout output of the previous instruction. So it is sufficient to store cout for only one clock cycle due to which no load enable to flip flop is required, so the dfr component in lib.v can be used (dfr, though not shown in figure, has to be supplied with clk and reset as well).