# Slovenská technická univerzita v Bratislave Fakulta informatiky a informačných technológií FIIT-3028-52076

# Martin Dinja Simulator for instruction pipelining

Bakalárska práca

Vedúci práce: Ing. Ján Hudec PhD.

May 2025

# Slovenská technická univerzita v Bratislave Fakulta informatiky a informačných technológií

FIIT-3028-52076

# Martin Dinja Simulator for instruction pipelining

Bakalárska práca

Študijný program: Software Engineering

Študijný odbor: 9.2.5 Software Engineering

Miesto vypracovania: Institute of Informatics and Software Engineering, FIIT

STU, Bratislava

Vedúci práce: Ing. Ján Hudec PhD.

May 2025

**Declaration** 

Slovenská technická univerzita v Bratislave Ústav počítačového inžinierstva a ap-

likovanej informatiky Fakulta informatiky a informačných technológií 2024/2025

ZADANIE BAKALÁRSKEJ PRÁCE

Autor práce: Martin Dinja Študijný program: informatika Študijný odbor: in-

formatika Evidenčné číslo: FIIT-16768-120774 ID študenta: 120774 Vedúci práce:

Ing. Ján Hudec, PhD. Vedúci pracoviska: Ing. Katarína Jelemenská, PhD.

Názov práce: Simulátor pre prúdové spracovanie inštrukcií

Jazyk, v ktorom sa práca vypracuje: slovenský jazyk

Specifikácia zadania: Pre potreby výučby v predmete Princípy počítačového in-

žinierstva je účelné používať názorný simulátor pre prúdové spracovanie inštrukcií

na zvolenej architektúre procesora. Z hľadiska bezproblémovej použiteľnosti na

rozdielnych platformách navrhnuté riešenie musí byť spracované ako webová ap-

likácia. Inšpiráciou pre vývoj by mal byť aj používaný simulátor MIPSim. Ana-

lyzujte princípy prúdového spracovania inštrukcií a existujúce riešenia simulátorov

takéhoto spôsobu práce procesora. Navrhnite a implementujte webovú aplikáciu

s funkcionalitou vlastnej koncepcie simulátora prúdového spracovania inštrukcií,

ktorý bude umožňovať editovať program pre procesor so zvoleným inštrukčným

súborom, spustiť, krokovať a vykonať simuláciu prúdového spracovania inštrukcií

na procesore, nastavovať a prehliadať obsah pamäti, registrov procesora, vizuálne

sledovať stav simulácie jednotlivých fáz prúdového spracovania inštrukcií. Spracu-

jte podrobný návod na použitie navrhnutého simulátora. Riešenie overte, otestu-

jte pre praktické použitie, vyhodnoťte dosiahnuté výsledky a navrhnite možnosti

rozšírenia a zdokonalenia simulátora.

Rozsah práce: 40

Termín odovzdania práce: 12. 05. 2025

| I hereby declare that I have completed this work independently, based on consultations and using the cited literature. |
|------------------------------------------------------------------------------------------------------------------------|
| In Bratislava, 21.05.2025                                                                                              |
|                                                                                                                        |
| Martin Dinja                                                                                                           |
|                                                                                                                        |
|                                                                                                                        |

# Acknowledgment

I would like to express my gratitude to my supervisor, Ing. Ján Hudec PhD., for his guidance, support, and patience throughout the course of this thesis.

Annotation

Slovak University of Technology Bratislava

FACULTY OF INFORMATICS AND INFORMATION TECHNOLOGIES

Degree Course: Software Engineering

Author: Martin Dinja

Diploma Thesis: Simulator for instruction pipelining

Supervisor: Ing. Ján Hudec PhD.

May 2025

The aim of this thesis is to address the problem of outdated and difficult to use simulators for instruction pipelining, -mainly the archaic MIPSIM used by the Principles of Computer Engineering course at Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava- by introducing a new simulator. It is designed to be used by students and teachers to better understand the concept of instruction pipelining and its inner workings. This thesis analyzes the instruction pipelining concept, the existing simulators, their shortcomings, and proposes a new simulator that addresses those shortcomings, and describes its implementation. The new simulator is implemented using modern web technologies and is designed to be user friendly and easy to use. The simulator is evaluated by students and teachers to determine its usability and effectiveness in teaching and learning instruction pipelining.

Anotácia

Slovenská technická univerzita v Bratislave

FAKULTA INFORMATIKY A INFORMAČNÝCH TECHNOLÓGIÍ

Študijný program: Informatika

Autor: Martin Dinja

Bakalárska práca: Simulator na prúdove spracovanie inštrukcií

Vedúci bakalárskej práce: Ing. Ján Hudec PhD.

Máj 2025

inštrukcií.

Cieľom tejto práce je vyriešiť problém zastaraných a ťažko použiteľných simulátorov pre prúdove spracovanie inštrukcií -hlavne archaického MIPSIMu používaného v predmete Základy Počítačového Inžinierstva na Slovenskej technickej univerzita v Bratislave v ústave počítačového inžinierstva a aplikovanej informatikyzavedením nového simulátora. Je navrhnutý tak, aby ho mohli používať študenti a učitelia na lepšie pochopenie koncepcie prúdoveho spracovania inštrukcií a jeho vnútorného fungovania. Táto práca analyzuje koncept prúdoveho spracovania inštrukcií, existujúce simulátory a ich nedostatky. Navrhuje nový simulátor, ktorý tieto nedostatky rieši, a opisuje jeho implementáciu. Nový simulátor je implementovaný pomocou moderných webových technológií a je navrhnutý tak, aby bol užívateľsky prívetivý a ľahko použiteľný. Simulátor je hodnotený študentmi a učiteľmi s cieľom určiť jeho použiteľnosť a účinnosť pri výučbe a učení sa pipelingu

# Contents

| 1 | Intr | oducti | ion                               | 1  |
|---|------|--------|-----------------------------------|----|
| 2 | Ana  | alysis |                                   | 3  |
|   | 2.1  | Backg  | round                             | 3  |
|   |      | 2.1.1  | Historical Context                | 3  |
|   |      | 2.1.2  | RISC vs CISC                      | 4  |
|   | 2.2  | Instru | ction Pipelining                  | 5  |
|   |      | 2.2.1  | Introduction                      | 5  |
|   |      | 2.2.2  | Instruction Pipeline Stages       | 6  |
|   |      | 2.2.3  | Pipeline Stalls                   | 6  |
|   |      | 2.2.4  | Pipeline Hazards                  | 6  |
|   |      |        | 2.2.4.1 Structural Hazards        | 7  |
|   |      |        | 2.2.4.2 Data Hazards              | 7  |
|   |      |        | 2.2.4.3 Control Hazards           | 9  |
|   | 2.3  | MIPS   |                                   | 9  |
|   |      | 2.3.1  | Overview                          | 9  |
|   |      | 2.3.2  | Registers                         | 10 |
|   |      |        | 2.3.2.1 General-Purpose Registers | 10 |
|   |      |        | 2.3.2.2 \$at Register             | 11 |

|     |         | 2.3.2.3    | Floating Point Registers                  | 12 |
|-----|---------|------------|-------------------------------------------|----|
|     |         | 2.3.2.4    | HI and LO Registers                       | 12 |
|     | 2.3.3   | Instructi  | on Formats                                | 12 |
|     |         | 2.3.3.1    | R-type Format                             | 13 |
|     |         | 2.3.3.2    | I-type Format                             | 14 |
|     |         | 2.3.3.3    | J-type Format                             | 15 |
|     | 2.3.4   | Assembl    | y Language                                | 15 |
|     | 2.3.5   | MIPS Pi    | peline                                    | 17 |
|     |         | 2.3.5.1    | Explanation of General Symbols            | 18 |
|     |         | 2.3.5.2    | Instruction Fetch (IF) Stage              | 19 |
|     |         | 2.3.5.3    | Instruction Decode (ID) Stage             | 19 |
|     |         | 2.3.5.4    | Execute (EX) Stage $\dots$                | 20 |
|     |         | 2.3.5.5    | Memory Access (MEM) Stage                 | 21 |
|     |         | 2.3.5.6    | Write-Back (WB) Stage                     | 22 |
|     | 2.3.6   | Pipeline   | Hazards                                   | 22 |
|     | 2.3.7   | Floating   | Point Operations                          | 23 |
|     | 2.3.8   | Expandi    | ng the design with more jump capabilities | 24 |
|     | 2.3.9   | Example    | execution of each instruction type        | 25 |
|     |         | 2.3.9.1    | R-type Instruction Execution              | 26 |
|     |         | 2.3.9.2    | J-type Instruction Execution              | 30 |
| 2.4 | Existin | ng Solutio | ns Analysis                               | 31 |
|     | 2.4.1   | Overview   | v                                         | 31 |
|     | 2.4.2   | MIPSIM     | Case Study                                | 32 |
|     | 2.4.3   | Other So   | olutions Analysis                         | 33 |
|     | 2.4.4   | MIPS pi    | peline simulator requirements             | 36 |
| 2.5 | Featur  | e Analysi  | s                                         | 36 |
|     | 2.5.1   | Instructi  | on Input                                  | 37 |

# Contents

|              |      | 2.5.2    | Instruction Execution  | 37 |
|--------------|------|----------|------------------------|----|
|              |      | 2.5.3    | Visualization          | 37 |
|              |      | 2.5.4    | Interactability        | 38 |
|              |      | 2.5.5    | User Interface         | 38 |
|              |      | 2.5.6    | Accessibility          | 39 |
|              | 2.6  | Analys   | sis Conclusion         | 39 |
| 3            | Solı | ıtion F  | Proposal               | 1  |
| 4            | Imp  | lemen    | tation                 | 3  |
| 5            | Solı | ıtion V  | Verification           | 5  |
| 6            | Con  | clusio   | n                      | 7  |
|              | 6.1  | Summ     | ary                    | 7  |
|              | 6.2  | Future   | e Work                 | 7  |
| $\mathbf{A}$ | Firs | st App   | $\operatorname{endix}$ | 15 |
| В            | Con  | itents o | of Included CD–ROM     | 17 |

# Chapter 1

# Introduction

Instruction pipelining is an interestingly simple concept... But to truly understanding this idea of parallelism can be challenging. It feels like reading music without ever hearing a song. The flow, the stalls, the hazards—they were words without a melody. And many tools that try to help with visualizing these notes are outdated in today's day and age. This thesis was born out of a desire to transform that silence into a beautiful symphony, crafting an instrument that would allow learners to not just see, but experience the song of instruction pipelining.

In contrast to how a non pipelined processor works, a pipelined processor executes multiple instructions simultaneously. One at each stage of the pipeline. This allows for a higher throughput of instructions, but also introduces a new set of challenges.

The problem is that there is no modern, accessible, user-friendly tool that allows for the visualization of instruction pipelining in a way that is easy to understand. Most tools are either outdated, too complex to use, or lack the necessary features to provide a comprehensive learning experience. This creates a barrier to entry, which makes it difficult for students to learn about this interesting topic.

To bridge the gap. This thesis aims to analyze existing solutions, identify their limitations, and propose a comprehensive solution in the form of a web-based instruction pipeline simulator.

# Chapter 2

# Analysis

# 2.1 Background

## 2.1.1 Historical Context

Pipelining as a concept existed far before even the first definitive RISC processor was developed. The machines like the IBM 7030(1959) and CDC 6600(1964) we're the first general-purpose pipelined machines. After that pipelining was constantly improving and by the 1980s most of the terminology and concepts we use today were already established[13]. In the 1970s and 1980s in response to the increasing complexity of CISC(Complex Instruction Set Computer) processors researchers realized that simplifying the instruction set could lead to better performance. That is because in CISC "25% of the instructions in the instructions' set make up 95% of the execution time. This means that about 75% of the hardware-supported instructions often are not used at all"[7]. This insight is how the first RISC(Reduced Instruction Set Computer) projects were developed[1] and they were originally designed to have pipelining in mind[13]. The first RISC system

- as considered by Michael J. Flynn - was the IBM 801[3], which was developed in the early 1980s in an effort to make a digital telephone switch able to handle 300 phone calls per second[2]. The most notable mention especially in the context of this thesis would be the Stanford MIPS architecture developed by John L. Hennessy and David A. Patterson in the 1980s. Its main goal was to attain high performance in the execution of compiled code by closely relating the instruction to the microcode instructions[6]. MIPS is still used today in embedded systems, networking, and educational settings. For a more detailed overview of the history of pipelining and RISC, see "The history and use of pipelining computer architecture" by Pantazi et al.[13] and "The evolution of RISC technology at IBM" by Cocke and Markstein et al.[2].

# 2.1.2 RISC vs CISC

The main difference between RISC and CISC is the complexity of the instruction set. CISC processors have complex instructions requiring multiple clock cycles and the use of microcode to execute, while RISC processors have simple instructions that can be executed in a single clock cycle[7]. This makes RISC processors easier to implement and optimize, as well as more efficient in terms of power consumption and performance. And since the simple instructions can be executed in a single clock cycle, this makes RISC processors more suitable for pipelining. CISC processors, on the other hand, are more difficult to pipeline due to the complexity of the instructions. However, when it comes to program size, Compiled CISC programs are generally smaller than compiled RISC programs, because CISC instructions are more powerful and can perform more operations in a single instruction[7]. This is why CISC processors are still used in many applications, such as desktop computers and servers, because of the backwards compatibility with older software and the ability to perform complex operations in a single instruction. RISC proces-

sors, on the other hand, are more commonly used in embedded systems, mobile devices, and other applications where power consumption and performance are critical [17].

# 2.2 Instruction Pipelining

## 2.2.1 Introduction

Pipelining is a method used to enhance a processor's throughput by overlapping the execution phases of multiple instructions. By dividing the execution of an instruction into several stages and executing multiple instructions concurrently, the processor can handle several instructions simultaneously, thereby increasing its throughput[11]. Figure 2.1 illustrates the concept of pipelining.

|               | t0 | t1 | t2 | t3  | t4  | t5  | t6  | t7  | t8 |
|---------------|----|----|----|-----|-----|-----|-----|-----|----|
| Instruction 1 | IF | ID | EX | MEM | WB  |     |     |     |    |
| Instruction 2 |    | IF | ID | EX  | MEM | WB  |     |     |    |
| Instruction 3 |    |    | IF | ID  | EX  | MEM | WB  |     |    |
| Instruction 4 |    |    |    | IF  | ID  | EX  | MEM | WB  |    |
| Instruction 5 |    |    |    |     | IF  | ID  | EX  | MEM | WB |

Figure 2.1: Pipelining

As seen in Figure 2.1, this example pipeline is divided into five stages, and looking at any column we can see what instruction would be in what stage at the given time. For example, at time  $t_3$ , instruction 1 is in the MEM stage, instruction 2 is in the EX stage, instruction 3 is in the ID stage and instruction 4 is in the IF stage. This is how pipelining works and what is meant by overlapping the execution phases of multiple instructions. The stages of the pipeline are described in the next section 2.2.2.

# 2.2.2 Instruction Pipeline Stages

The classical RISC pipeline consists of five stages: instruction fetch, instruction decode, execute, memory access, and write-back [19][5].

- Instruction Fetch (IF): Fetches the instruction from memory using the program counter, which is then incremented.
- Instruction Decode (ID): Decodes the instruction to determine the operation and fetches operands from the register file.
- Execute (EX): Performs the operation on the fetched operands.
- Memory Access (MEM): Accesses memory for load/store instructions; otherwise, bypassed.
- Write-back (WB): Writes the result back to the destination register.

# 2.2.3 Pipeline Stalls

Pipeline stalls, also known as pipeline bubbles, are empty stages that occur when an instruction cannot move on to the next stage due to a hazard or dependency. Stalls decrease the efficiency of the pipeline and result in wasted clock cycles. [14].

# 2.2.4 Pipeline Hazards

A hazard is a condition that occurs when the pipeline cannot execute the next instruction in the cycle, or its execution will lead to unintended behavior. There are three types of hazards: structural, data, and control hazards[11].

#### 2.2.4.1 Structural Hazards

Structural hazards occur when the hardware cannot support the combination of instructions in the pipeline. For example, if two instructions require the same hardware resource, such as the register control unit. This stalls the pipeline or corrupts a result. There are multitude of ways to avoid them, such as scheduling instructions to avoid conflicts, or adding extra hardware to support multiple instructions at once[16].

#### 2.2.4.2 Data Hazards

Data hazards occur when an instruction depends on a previous instruction. There are four main types of data hazards: read-after-write (RAW), write-after-read (WAR), write-after-write (WAW), and read-after-read (RAR).

Read-After-Write (RAW): Also known as a true dependency it occurs when an instruction reads a register before a previous instruction writes to it.

**Example**: Consider the following instructions:

add \$t0, \$t1, \$t2 sub \$t3, \$t0, \$t4

'add' writes to register \$t0, and 'sub' reads from it, so the 'sub' instruction cannot be executed until the 'add' instruction has completed.

**Solution**: Use techniques such as forwarding (or bypassing), where we add extra paths to the pipeline to allow the result of an instruction to be forwarded to the next instruction that needs it.

Write-After-Read (WAR): Also known as an anti-dependency it occurs when an instruction writes to a register after a previous instruction reads from it.

**Example**: Consider the following instructions:

```
add $t1, $t0, $t2
sub $t0, $t3, $t4
```

'sub' tries to write to register \$t0 after 'add' has read from it. **Solution**: This hazard is rare in RISC architectures and nonexistent in the classical RISC pipeline since reads are done in the ID stage and writes in the WB stage. Otherwise, it can be avoided due to the use of register renaming, which ensures that each instruction has a unique destination register.

Write-After-Write (WAW): Also know as an output dependency it occurs when two instructions write to the same register.

**Example**: Consider the following instructions:

```
add $t0, $t1, $t2
sub $t0, $t3, $t4
```

'sub' tries to write to register \$t0 before 'add' has completed writing to it.

**Solution**: Similar to WAR hazards, nonexistent in the classical RISC pipeline since register writes are in order. Otherwise, register renaming can be used to avoid WAW hazards by ensuring that each write operation targets a unique register.

**Read-After-Read (RAR)**: Occurs when read operations are performed sequentially.

Further detail is unnecessary because it is immediately obvious that this hazard is not possible in classical RISC pipelines, as reads do not change the state of the registers.

#### 2.2.4.3 Control Hazards

Control hazards occur when the processor cannot determine the instruction to fetch next due to a branch instruction changing the program counter. This can lead to incorrect execution of instructions and wasted clock cycles. There are several techniques to mitigate control hazards, such as branch prediction, delayed branching and branch target buffers.[12]

# 2.3 MIPS

### 2.3.1 Overview

MIPS, which stands for Microprocessor without Interlocked Pipeline Stages, is a RISC architecture that has made a significant impact in the world of computing. Developed by MIPS Tech LLC[9], a company founded in 1984 by Stanford University researchers including John L. Hennessy and David A. Patterson, MIPS has become a cornerstone in embedded systems, networking, and education. Its load/store architecture and 5-stage pipeline, detailed in section 2.3.5, make it a model of simplicity and efficiency. This straightforward design is why MIPS is an excellent choice for teaching computer architecture. Especially because "Computer Organization and Design" by Patterson and Hennessy[14], which can be considered the bible of computer architecture, uses MIPS as the primary example throughout the book. Over the years, MIPS has evolved through several versions, including MIPS I, II, III, IV, V, MIPS32, and MIPS64, each adding new features and capabilities. In this section, we'll focus on the MIPS I architecture to grasp the fundamental concepts of the MIPS architecture and the inner workings of the MIPS pipeline.

# 2.3.2 Registers

The MIPS architecture has 32 32-bit general-purpose registers and 32 floating point registers, numbered from \$0 to \$31 and \$f0 to \$f31 respectively. For integer multiplication and division, there are two special registers, HI and LO, which store the high and low 32 bits of the result. And of course the program counter, which is used to store the address of the next instruction to be executed. The MIPS registers are used to store data, addresses, and control information, and are accessed using load and store instructions. The registers are divided into several categories, each with a specific purpose.

### 2.3.2.1 General-Purpose Registers

General purpose registers are used to store data and perform arithmetic and logical operations, pass arguments to functions, store return values, and hold temporary values. Even though these registers could be used for any purpose, there are conventions on how they should be used:

| Register Number | Name      | Description              |
|-----------------|-----------|--------------------------|
| \$0             | \$zero    | Constant value 0         |
| \$1             | \$at      | Assembler temporary      |
| \$2-\$3         | \$v0-\$v1 | Function return values   |
| \$4-\$7         | \$a0-\$a3 | Function arguments       |
| \$8-\$15        | \$t0-\$t7 | Temporary registers      |
| \$16-\$23       | \$s0-\$s7 | Saved registers          |
| \$24-\$25       | \$t8-\$t9 | More temporary registers |
| \$26-\$27       | \$k0-\$k1 | Reserved for OS kernel   |
| \$28            | \$gp      | Global pointer           |
| \$29            | \$sp      | Stack pointer            |
| \$30            | \$fp      | Frame pointer            |
| \$31            | \$ra      | Return address           |

Table 2.1: MIPS Registers and Their Names

## 2.3.2.2 \$at Register

The \$at register is reserved for the assembler to store temporary values during translation of high-level code to machine code. It is not preserved across function calls.

## \$v0-\$v1 Registers

These registers store function return values.

#### \$a0-\$a3 Registers

These registers pass arguments to functions.

#### \$t0-\$t9 Registers

Temporary registers for storing values that do not need to be preserved across function calls.

### \$s0-\$s7 Registers

Saved registers that preserve values across function calls.

### \$k0-\$k1 Registers

Reserved for the operating system kernel.

## \$gp Register

The global pointer register, used to store the base address of the global data segment. It is preserved across function calls.

#### \$sp Register

The stack pointer register, which tracks the top of the stack. It is preserved across function calls.

### \$fp Register

The frame pointer register, used to access the current stack frame, local variables, and function arguments. It is preserved across function calls.

### \$ra Register

The return address register, which stores the address to return to after a function call. It is preserved across function calls.

### 2.3.2.3 Floating Point Registers

MIPS supports the IEEE 754 single precision and double precision formats[14] where the single precision format is 32 bits long and the double precision format is 64 bits long. A double precision floating point register is really an even-odd pair of single precision registers[14]. For example \$f2 and \$f3 together form a double precision register referred to as \$f2, and same for all the other even-odd pairs. The floating point registers are used for floating point operations and are not preserved across function calls. They are numbered from \$f0 to \$f31 and don't have a definitive name and usage conventions. MIPS does not directly support floating point arithmetic[6], instead, these operations are implemented with integer operations or by using a coprocessor.

#### 2.3.2.4 HI and LO Registers

The HI and LO registers are used to store the high and low 32 bits of the result of a multiplication or division operation. The HI register stores the high 32 bits, and the LO register stores the low 32 bits. These registers are not preserved across function calls.

### 2.3.3 Instruction Formats

The MIPS instruction set consists of three instruction formats: R-type, I-type, and J-type. Each format is used for different types of instructions, such as arithmetic, logical, and control instructions. The R-type format is used for arithmetic and logical instructions, the I-type format is used for immediate instructions, and the J-

type format is used for jump instructions. The machine format of each instruction is 32 bit.

## 2.3.3.1 R-type Format

The R-type format is used for arithmetic and logical instructions that operate on registers. It takes 3 arguments: the destination register(rd), the source register(rs), and the second source register(rt). Example:

The machine format of the instruction is as follows:

| opcode | rs     | $\mathbf{rt}$ | $^{\mathrm{rd}}$ | shamt  | funct  |
|--------|--------|---------------|------------------|--------|--------|
| 6 bits | 5 bits | 5 bits        | 5 bits           | 5 bits | 6 bits |

Table 2.2: R-type Format

• **opcode**: The operation code.

• **rs**: The source register.

• rt: The second source register.

• rd: The destination register.

• **shamt**: The shift amount.

• funct: The function code.

The interesting thing in the R-type format is that the opcode is always 0, and the funct field is used to determine the operation to be performed. And shamt is used in shift instructions such as sll, srl, sra.

### 2.3.3.2 I-type Format

The I-type format is used for immediate instructions that operate on registers and immediate values. It has 3 arguments: the source register(rs), the destination register(rd), and the immediate value. Example:

```
addi rd, rs, imm
```

The machine format of the instruction is as follows:

| opcode | rs     | rt     | imm     |
|--------|--------|--------|---------|
| 6 bits | 5 bits | 5 bits | 16 bits |

Table 2.3: I-type Format

• opcode: The operation code.

• **rs**: The source register.

• rt: The destination register.

• imm: The immediate value.

Other than just performing arithmetic operations, the I-type format is also used for load and store instructions, and branch instructions.

Examples:

```
lw rt, imm(rs) # Load word
sw rt, imm(rs) # Store word
beq rs, rt, imm # Branch if equal
```

The load and store instructions are the only instructions that have a different assembly syntax then the rest of the instructions.

### 2.3.3.3 J-type Format

The J-type format is used for jump instructions. It has only one argument: the jump address. Example:

#### j addr

The machine format of the instruction is as follows:

| opcode | addr    |
|--------|---------|
| 6 bits | 26 bits |

Table 2.4: J-type Format

- opcode: The operation code corresponding to the jump instruction.
- addr: The pseudo-direct address of the jump target. The 32-bit address is formed by shifting the 26 bit address left by 2 bits and concatenating the 4 most significant bits of the program counter.

The J-type format is used for jump and jump-and-link instructions. The jump-and-link instruction stores the return address in register \$ra before jumping to the target address.

# 2.3.4 Assembly Language

The conventional general notation for MIPS assembly language is as follows:

The label is optional and is used to mark a specific location in the program. The instruction is the operation to be performed, and the arguments are the operands of the instruction. The arguments can be registers, immediate values, or memory addresses depending on the instruction. The arguments are separated by commas, and the instruction is followed by a newline character. The assembly language

is case-insensitive, and whitespace is used to separate tokens. Comments can be added using the '#' character, and the assembler ignores everything after it. The only exception to this rigid structure is the load and store instructions, which have a different syntax. Example:

The load instruction loads a word from memory at the address rs+imm and stores it in register rt. The store instruction stores the word in register rt to memory at the address rs+imm.

Of course an assembly language can be designed in any way, but this is the most common way MIPS assembly is written.

Instructions can be categorized into six groups: arithmetic, logical, comparison, data transfer, conditional branch, and unconditional jump.

The arithmetic, logical and comparison instructions operate on registers and immediate values, while the data transfer instructions load and store data from memory. The conditional branch instructions change the program counter based on a condition, and the unconditional jump instructions jump to a specific address. Table 2.5 shows examples of instructions from each group.

| Group              | Example Instruction | Description                                             |
|--------------------|---------------------|---------------------------------------------------------|
| Arithmetic         | add rd, rs, rt      | Add rs and rt and store the result in rd                |
| Logical            | and rd, rs, rt      | Bitwise AND of rs and rt and store the result in rd     |
| Comparison         | slt rd, rs, rt      | Set rd to 1 if rs $<$ rt, 0 otherwise                   |
| Data Transfer      | lw rt, imm(rs)      | Load word from memory at address rs+imm and store in rt |
| Conditional Branch | beq rs, rt, label   | Branch to label if rs and rt are equal                  |
| Unconditional Jump | j addr              | Jump to address addr                                    |

Table 2.5: Example MIPS Assembly Instructions for Each Group

There also exist exception handling instructions that are used to handle exceptions

and interrupts. However, these are not used in the context of this thesis.

## 2.3.5 MIPS Pipeline

The MIPS pipeline consists of five stages: instruction fetch(IF), instruction decode(ID), execute(EX), memory access(MEM), and write-back(WB). Each stage is responsible for a specific part of the instruction execution process allowing it to overlap the execution of multiple instructions to improve performance and throughput. Here each stage will be described in detail according to the design shown in Figure 2.2, which is the same design described in Patterson and Hennessy's "Computer Organization and Design" book[14].



Figure 2.2: High-Level MIPS Pipeline Diagram with Control Unit

Figure 2.2 shows a high-level diagram of the MIPS pipeline with a control unit. It can get even more complex with the addition of forwarding and hazard detection units. But this current state is sufficient for the explanation of the pipeline stages.

Note: This design doesn't feature support for any J-type instructions and many other more complicated instructions for simplicity's sake, and since they are not necessary to make a turning complete processor. Also, note that the J-type could be simulated by using an I-type branch instructions whose condition is a tautology. For example, beq\$zero, \$zero, label would be equivalent to a jump instruction. MIPSIM is based on a design that handles jump instructions in such a way. But adding support for real unconditional jump instructions is not that difficult and will be covered in section 2.3.8.

### 2.3.5.1 Explanation of General Symbols

Before diving into the details of each pipeline stage, it is important to understand the general symbols used in the pipeline diagram.

Multiplexers (MUX): The rounded rectangles represent multiplexers, which select one of the input signals based on a control signal. In the diagram there are only 2-to-1 multiplexers which only require one bit of control signal to select between two inputs. But in more complex designs, there could be 3 or 4-to-1 multiplexers which would require more bits of control signal.

**PC:** The small square box with PC written in it represents the program counter register, which stores the address of the next instruction to be fetched.

**Stage Register:** The vertical dashed rectangles represent the stage registers. There are 4 of them, one for each transition between stages. A stage register stores the necessary values passed on from the previous stage to the next. It acts as a buffer between stages, allowing them to operate independently.

**Buses:** The lines connecting everything together, both dashed and solid, are buses, which carry data and control signals between components. A bus that carries data is a data bus, and a bus that carries control signals is a control bus.

Control Signals: The control signals are dashed lighter lines originating from the Control unit. They are generated by the control unit and determine which operations to perform at each stage. Which input should a multiplexer select, which operation should the ALU perform, etc.

#### 2.3.5.2 Instruction Fetch (IF) Stage

The IF stage is responsible for fetching the instruction from memory using the program counter(PC), which is incremented by 4 after each fetch to point to the next instruction. After the instruction has been fetched from memory, it is saved to the IF/ID stage register to be passed to the next stage.

### 2.3.5.3 Instruction Decode (ID) Stage

The ID stage is responsible for generating the control signals for the pipeline and decoding the instruction to determine the operation to be performed.

The control unit receives the operation code of the instruction and generates the necessary control signals for the pipeline. The control signals are then passed to other stages through the stage registers and get used to control the multiplexers and other components in the pipeline. Note that the *RegWrite* signal is not immediately 'applied' in the ID Stage; instead, it is carried all the way to the write-back stage, where it is then applied to the register subsystem. Since this diagram is a high-level diagram, this nuance is not shown to avoid making the diagram more cluttered, inconsistent, and harder to read.

The Register Subsystem is responsible for reading the registers rs and rt from the

register file and passing the values to the EX stage. The register file is read using the rs and rt fields of the instruction, and the 32-bit values are passed to the EX stage through the ID/EX stage register.

The 16-bit immediate value is sign-extended to 32 bits and passed to the EX stage, as well as instruction bits from 20-16 and 15-11 which are used to detriment the write register used in the WB stage. This is because the I and R type instructions have their destination register at different positions in the instruction, so depending on the instruction type, the correct bits representing the destination register are passed to the WB stage.

### 2.3.5.4 Execute (EX) Stage

The EX stage is responsible for performing the operation on the operands received from the ID stage, calculating the branch address for branch instructions and determining the write register.

The ALU performs the operation based on the ALUOp control signal, which makes its way to the ALU control unit. In this simplified design this signal has two bits which is all we need to be able to specify which operation to carry out to satisfy load/store instructions, branch instructions, or R-type arithmetic instructions.

| Instruction | ALUOp | Function                 |
|-------------|-------|--------------------------|
| Load/Store  | 00    | add                      |
| Branch      | 01    | subtract                 |
| R-Type      | 10    | dependent on funct field |

Table 2.6: ALUOp Control Signal Values

In more complex designs, the ALUOp signal would have more bits to allow for or, xor, and, etc. operations. This signal gets passed to the ALU control unit which

generates the ALU control signal.

The ALU receives the two operands from the ID stage and performs the operation based on the ALU control signal. The ALU control also receives the *funct* field or the lower 6 bits of the instruction, which is used to determine the operation to be performed by the ALU if the instruction is an R-type instruction. The result of the operation is then passed to the MEM stage through the EX/MEM stage register.

Branch address calculation is done by shifting the immediate value left by 2 bits and adding it to PC+4. The result is then passed to the MEM stage through the EX/MEM stage register.

And lastly, the write register is determined by the RegDst control signal, which is passed to the MEM stage through the EX/MEM stage register.

## 2.3.5.5 Memory Access (MEM) Stage

The MEM stage is responsible for accessing memory for load and store instructions, and branching to the target address for branch instructions.

For load instructions, the data is read from memory at the address calculated in the EX stage and passed to the WB stage through the MEM/WB stage register. And for store instructions, the data is written to memory at the address calculated in the EX stage. The *MEMWrite* and *MEMRead* control signals determine whether to write or read from memory. If both are set to 0, the Data Memory unit is unused.

For branch instructions we have an AND gate that checks if the ALU Zero control signal is set to 1, and if it is, that means the condition that was evaluated in the EX stage is true. Based on that and the branch control signal, the multiplexer is

sets either the branch address calculated in the EX stage or the PC+4 value as the new PC value.

## 2.3.5.6 Write-Back (WB) Stage

The WB stage is responsible for writing the result of the operation to the destination register. Using the *MEMToReg* control signal, the multiplexer selects if the load data from the MEM stage or the ALU result from the EX stage should be written to the register file. The register file is then written to if the *RegWrite* control signal is set to 1.

## 2.3.6 Pipeline Hazards

From the described pipeline stages, it would be immediately noticeable to someone somewhat versed in computer architecture that this pipeline design is very simple and doesn't handle any kind of hazards. The first issue that would arise is the structural hazard that could occur when reading and writing to the register file, since we have two stages that use the register subsystem at the same time. The second issue would be data hazards, especially read-after-write hazards, since the pipeline doesn't have any kind of forwarding or bypassing unit. And the third issue would be control hazards, since the pipeline doesn't have any kind of branch prediction or branch target buffer, and also since there is no flushing feature that would eliminate unwanted instructions that entered the pipeline before a branch instruction had the chance to update the PC. These are all issues that need to be addressed in a more complex pipeline design and far outpace the scope of this section. For a more detailed understanding of hazards and the design of pipelines that handle them, refer to "The Processor" chapter in "Computer Organization and Design" by Patterson and Hennessy[14].

However, it is still interesting to understand how these hazards would theoretically be handled in this pipeline design.

For structural hazards, the simplest solution is to perform write operations at the beginning of a clock cycle and read operations at the end of it. More technically, this means writing at the rising edge of the clock and reading at the falling edge.

For data hazards, one solution would be stall the pipeline until the data is ready, which is slow, or a better solution would be adding a forwarding unit that would allow the result of an instruction to be forwarded to the next instruction that needs it. One case where forwarding cannot save the day is when an instruction tries to read a register following a load instruction that writes the same register [14]. In that case in addition to a forwarding unit, a load-use hazard detection unit that would stall the pipeline until the load instruction has completed would be necessary.

For control hazards, solutions include stalling the pipeline or introducing a flushing mechanism to clear stage registers holding instructions that should not be executed. To improve the performance of determining when to flush, a branch prediction unit can be added. This unit predicts the outcome of branch instructions and decides whether to flush based on the prediction [14].

## 2.3.7 Floating Point Operations

It may have been noticed that floating point operations have not been discussed until now. This is because the MIPS architecture does not directly support floating point arithmetic, as mentioned in section 2.3.2.3. Instead, floating point operations are typically handled by a coprocessor, such as the MIPS Floating Point Unit (FPU)[10].

The FPU is a specialized unit designed to handle floating point operations, communicating with the main processor through a coprocessor interface. It comes equipped with its own set of registers, control signals, and instructions tailored for floating point arithmetic[14]. The MIPS architecture provides specific instructions for loading and storing floating point values, performing arithmetic operations, and converting between integer and floating point formats.

Typically, the FPU is either implemented as a separate chip or integrated into the main processor as a coprocessor. The main reason to have it on a separate chip is because of the lack of transistors in the 1980's. But since the early 1990s, microprocessors have integrated floating point (and just about everything else) on chip[14].

## 2.3.8 Expanding the design with more jump capabilities

As it currently stands, the pipeline design doesn't support unconditional jump instructions. But no worries since adding that is not too complicated. Here is a diagram of the pipeline with support for jump instructions.



Figure 2.3: High-Level MIPS Pipeline Diagram with Control Unit and Support for Unconditional Jump Instructions

The only thing that was added was a multiplexer that selects the jump address from the ID stage if the instruction is a jump instruction. In this implementation the entirety of the jump executes in the ID stage. The jump address gets calculated, and the control unit generates the high Jump control signal and immediately passes it to the multiplexer without going through the stage registers. This extended design will suffice for the purposes of explaining each instruction type execution in the pipeline which will be done in the next section.

## 2.3.9 Example execution of each instruction type

In this section, the execution of each instruction type in the MIPS pipeline will be discussed. This will include the subtypes of each instruction type, such as load and store instructions for the I-type format.

### 2.3.9.1 R-type Instruction Execution

### Arithmetic and logical R-type instructions

add \$t0, \$t1, \$t2

Machine format:

| opcode | $\mathbf{r}\mathbf{s}$ | $\mathbf{rt}$ | $\operatorname{rd}$ | shamt | funct |
|--------|------------------------|---------------|---------------------|-------|-------|
| 0      | 9                      | 10            | 8                   | 0     | 32    |

Table 2.7: ADD Instruction Machine Format

#### ADD Control signals:

| Control Signal | Value |
|----------------|-------|
| ALUOp          | 10    |
| ALUSrc         | 0     |
| RegDst         | 1     |
| MemRead        | 0     |
| MemWrite       | 0     |
| MemToReg       | 0     |
| RegWrite       | 1     |
| Branch         | 0     |
| Jump           | 0     |

Table 2.8: Control Signals for ADD Instruction

As we can see the only control signals that are set to 1 is RegWrite, and RegDst. RegWrite is set to 1 because we are writing to a register, and RegDst is set to 1 because the destination register is in the rd field of the instruction. The ALUOp is set to 10 to identify an R-type isntruction that has a funct field that we want to use as the operation to be performed by the ALU. ALUSrc is set to 0 because

we are not using an immediate value, and the ALU is getting its inputs from the register file. MemRead, MemWrite, MemToReg, Jump, and Branch are all set to 0 because we are not reading or writing to memory, and we are not branching nor jumping. First the instruction is fetched from memory and the next pc value is calculated by adding 4 to PC. Moving forward this value will be referred to as NPC(Next Program Counter). Then in the ID stage the control signals are generated, and the registers t1 and t2 are read from the register file. In the EX stage the ALU performs the addition operation and together with t0 the result is passed to the MEM stage. The MEM stage updates the PC to NPC and passes the ALU result from the EX stage to the WB stage, where it is written to register t0.

### Arithmetic and logical I-type instructions

addi \$t0, \$t1, 100

Machine format:

| opcode | rs | $\mathbf{rt}$ | imm |
|--------|----|---------------|-----|
| 8      | 9  | 8             | 100 |

Table 2.9: ADDI Instruction Machine Format

ADDI Control signals:

| Control Signal | Value |
|----------------|-------|
| ALUOp          | 00    |
| ALUSrc         | 1     |
| RegDst         | 1     |
| MemRead        | 0     |
| MemWrite       | 0     |
| MemToReg       | 0     |
| RegWrite       | 1     |
| Branch         | 0     |
| MemWrite       | 0     |
| Jump           | 0     |

Table 2.10: Control Signals for ADDI Instruction

The control signals are the same as for the R-type instruction, except for ALUSrc which is set to 1 because we are using an immediate value. The execution of the instruction is the same as for the R-type instruction, except that the ALU is performing an addition operation with an immediate value instead of a register value. Since this is an I-type instruction, there is no funct field for the ALU control unit to use. Therefore, the ALUOp signal is set to 00, which means that the ALU should perform an addition operation.

#### Load and store instructions

lw \$t0, 100(\$t1)
sw \$t0, 100(\$t1)

Machine format:

| opcode | rs | $\mathbf{rt}$ | imm |
|--------|----|---------------|-----|
| 35     | 9  | 8             | 100 |

Table 2.11: Lw Instruction Machine Format

| opcode | $\mathbf{r}\mathbf{s}$ | $\mathbf{rt}$ | imm |
|--------|------------------------|---------------|-----|
| 43     | 9                      | 8             | 100 |

Table 2.12: Sw Instruction Machine Format

## LW and SW Control signals:

| Control Signal | SW Value | LW Value |
|----------------|----------|----------|
| ALUOp          | 00       | 00       |
| ALUSrc         | 1        | 1        |
| RegDst         | 0        | 1        |
| MemRead        | 1        | 1        |
| MemWrite       | 1        | 0        |
| MemToReg       | 0        | 1        |
| RegWrite       | 0        | 1        |
| Branch         | 0        | 0        |
| Jump           | 0        | 0        |

Table 2.13: Control Signals for LW Instruction

In the example of the lw instructions, the control signals are the same as for the R-type instruction, except for MemRead and MemToReg, which are set to 1 because we are reading from memory and writing the data to a register. Most of the execution is the same as for the R-type instruction. The instruction gets fetched from memory, the registers t1 and t0 are read from the register file, and the ALU calculates the memory address by adding the immediate value to the value in register t1. The memory address is then passed to the MEM stage, where the data is read from memory and passed to the WB stage, where it is written to register t0. Almost the same goes for the sw instruction, except that the data is written to memory instead of being read from memory because of the MemWrite control signal being set to 1 and the MemToReg control signal being set to 0. And also nothing is written to the register file because of the RegWrite control signal being set to 0.

### 2.3.9.2 J-type Instruction Execution

j 100

Machine format:

| opcode | addr |
|--------|------|
| 2      | 100  |

Table 2.14: J Instruction Machine Format

### J Control signals:

| Control Signal | Value |
|----------------|-------|
| ALUOp          | 00    |
| ALUSrc         | 0     |
| RegDst         | 0     |
| MemRead        | 0     |
| MemWrite       | 0     |
| MemToReg       | 0     |
| RegWrite       | 0     |
| Branch         | 0     |
| Jump           | 1     |

Table 2.15: Control Signals for J Instruction

All the control signals are low except for the *Jump* control signal which is set to 1. The execution of the jump instruction is quite simple and requires the least amount of hardware. This is visible in the section 2.3.8 where adding support for an unconditional jump instruction was explained. First the instruction gets fetched from memory, then the jump address is calculated in the ID stage, by shifting the 26-bit address left by 2 bits and concatenating the 4 most significant bits of the PC. The concatenation of the most significant bits of the PC is done so that the jump address is relative to current 256 MB segment of the PC value. Or more accurately, the jump address is relative to the address of the instruction

following the jump instruction. And finally after the *Jump* control signal is set, the multiplexer selects the jump address and the PC is updated to the jump address. All other stages are bypassed since the jump instruction doesn't require any other operations to be performed.

## 2.4 Existing Solutions Analysis

## 2.4.1 Overview

There are many simulators available for simulating the MIPS and other pipelines. Simulators like WinDLX[4], VisualMIPS[18], MARS[15], WebMIPS, WebRISC-V[8] just to name a few. However, the most important one in the context of this thesis is MIPSIM[4], as it is the main inspiration and motivation for it. Even though it's the same concept, the simulators differ in their informativeness, user experience, feature set, and accessibility. These factors are arguably the most important when a simulator is chosen for educational purposes. The more informative a simulator is, the more can be learned from it. The better the user experience, the more focus will go into learning rather than figuring out how to use the simulator. The more features a simulator has, the more versatile it is in teaching different topics. And the more accessible a simulator is, the more students can use it. If a simulator is a headache to use and doesn't provide enough information to the user, then it won't be very effective as a learning tool. In this section, the existing solutions will be analyzed based on these factors to see how they compare to each other. But before that, the inspiration for this thesis, MIPSIM, will be briefly discussed.

## 2.4.2 MIPSIM Case Study

MIPSIM is a MIPS pipeline simulator that was developed by Herbert Grünbacher and Maziar Khosravipour at Vienna University of Technology. It is a MS-Windows (32 bit and 16 bit) based pipeline simulator for the MIPS processor which models a simple pipeline without hazard detection and forwarding units[4]. Much like the one described in section 2.3.5.



Figure 2.4: MIPSIM User Interface with an Overlay Plugin

Upon launching MIPSIM, one is greeted with a straightforward interface: the instruction memory on the left and a large pipeline diagram on the right. Additional windows for data memory and registers can be enabled as needed. Adding instructions is done via an icon that opens a new window, allowing the input of instructions. Memory, Register states, and instructions are editable and can be saved to their own file, adding a layer of flexibility. The execution controls are

intuitive, featuring play, pause, stop, step, and skip-to-end buttons, making the simulation process simple to interact with.

However, the instruction window is where frustrations begin. Instructions are entered into rows, resembling a spreadsheet, but without any modern conveniences like syntax highlighting, autocomplete, or sanity checks. An instruction is either correct and saved, or incorrect and an error pops up preventing the user from doing anything before the instruction is correctly input. This lack of feedback is manageable, but the real headache comes when there is a need to reorder instructions. Unlike spreadsheet editors, MIPSIM offers no repositioning features. Moving an instruction means rewriting it as a NOP and then re-entering it in the desired position. This can be quite cumbersome, especially when trying to move multiple instructions or let alone the whole program, because an instruction needs to be inserted at the beginning of the program.

Despite these usability issues, the pipeline visualization is quite informative, clearly showing the datapaths used by each instruction. However, it falls short in labeling the inputs of various components, which can be confusing.

Originally, MIPSIM lacked proper debugging support, and some features visible in Figure 2.4 were added via an overlay plugin, which also enhances stability. In summary, MIPSIM is a simple, old, platform limited, somewhat cumbersome, yet sufficiently informative MIPS pipeline simulator that has served its purpose well, despite its quirks.

## 2.4.3 Other Solutions Analysis

Arguably, the most fundamental aspect of a pipelining simulator is the underlying processor design. Many simulators, including MIPSIM, are based on a straightforward MIPS pipeline design that often lacks hazard detection or forwarding units,

or at least obscures these components. This is similar to the design described in section 2.3.5. Numerous simulators, such as MIPS Visualizer, do not incorporate pipelining at all, opting instead for a single-cycle design limiting its feature set. Conversely, more sophisticated simulators like WebMIPS support both forwarding and hazard detection units, and even more sophisticated ones like DrMIPS allow the user to choose between different CPU designs.

However, regardless of the processor design, a simulator's educational value is significantly diminished without a user-friendly interface and informative features. WebMIPS excels in providing detailed information, displaying the bits of each machine instruction and the values of control signals. Unfortunately, its pipeline diagram is static and does not reflect the execution of instructions as MIPSIM or DrMIPS does, serving only as a visual aid. On the positive side, WebMIPS includes interactive elements that explain their functions when clicked, which is an excellent approach. This informativeness is very beneficial but also unfortunately, as is the case with WebMIPS, it can make the interface cluttered and hard to navigate. The interface of WebMIPS, while functional, may not be immediately intuitive, suggesting room for improvement despite a relatively shallow learning curve.

DrMIPS on the other hand shows exactly what WebMIPS does, except that it tie everything together in a much more intuitive design including a reactive pipeline diagram. It also, as previously mentioned, features multiple cpu designs which is great for teaching purposes. Even though the interface has some quirks and possible minor improvements. The most major problem with DrMIPS is that it is Windows only, which is a big downside for students that use other operating systems.

That is where web based simulators come in to play. They are accessible to everyone on any device that supports a modern web browser, and they don't require any installation. WebRISC-V, WebMIPS, WeMIPS, VisualMIPS and a bunch of other web based simulators are great for this reason. Unfortunately none of them are as good as DrMIPS. The best out of the bunch is WebMIPS, but the issues mentioned before may be a dealbreaker for some students if not most students.

It would be important to mention that most simulators, such as VisualMIPS, WeMIPS, MARS, and SPIM, do not display a pipeline diagram, as they are not primarily designed as teaching tools. Nevertheless, they offer commendable user experience features. MARS, in particular, stands out as a robust and comprehensive simulator capable of testing real MIPS assembly code. It has a conventional and user-friendly interface that should be familiar to software engineering students, featuring syntax highlighting, autocomplete, and other functionalities that make writing code a breeze. Although MARS is not intended to teach the MIPS pipeline, its exceptional user experience makes it the best among the simulators reviewed.

Each simulator analyzed has its own strengths and weaknesses when it comes to educational use. While some excel in certain areas more than others, none of them are satisfactory in all aspects. If we could combine and adjust the features of all these simulators to meet the needs of students, we would have an ideal MIPS pipeline educational simulator. Better yet, a pipelining simulator that supports multiple architectures and offers a customizable feature set tailored to student requirements would be the ultimate solution.

- MIPSIM[4]
- WinDLX[4]
- WebMIPS. (n.d.). Retrieved from http://webmips.org
- MARS (MIPS Assembler and Runtime Simulator). (n.d.). Retrieved from

http://courses.missouristate.edu/KenVollmar/mars/

- DrMIPS. (n.d.). Retrieved from http://drmips.org
- WeMIPS. (n.d.). Retrieved from http://wemips.org
- MIPSim. (n.d.). Retrieved from http://mipsim.org
- WebRISC-V. (n.d.). Retrieved from http://webriscv.org
- VisualMIPS[18]
- SPIM

## 2.4.4 MIPS pipeline simulator requirements

From MIPSIM and from generally looking at the existing solutions, there are the minimum features that a MIPS pipeline simulator should have to be considered a minimum viable product. The most important ones would be the ability to input instructions, the ability to execute the instructions, and the ability to visualize the output of those instructions. But that wouldn't quite cut it if the simulator is supposed to also teach students about the MIPS pipeline. The more information the simulator provides about the execution of the instructions, the better. This includes the ability to see the values of the registers and the data memory, the ability to see the control signals, and the ability to see the values of the datapaths in the pipeline. The more information the simulator provides, the more effective it will be in teaching students about the MIPS pipeline and pipelining in general.

# 2.5 Feature Analysis

In this section, the features that make up a MIPS pipeline simulator will be discussed, and the existing solutions will be compared based on those features.

## 2.5.1 Instruction Input

The ability to input instructions is the most basic feature that realistically any simulator or code execution environment should have. And for decades that has been the conventional text editor. Every simulator analyzed, except for MIPSIM has one. Some are basic like the one in WebMIPS, and some are more advanced like the one in MARS. Which one is better is subjective, but the more advanced the editor is, the easier it will be for students that can really take advantage of its additional features. And if something is unwanted, it can always be turned off.

### 2.5.2 Instruction Execution

This is the feature that allows students execute their programs, a feature that every simulator has. Only thing that differs in each simulator are the execution controls. The most basic controls are play, pause, stop, and step, while more advanced controls would be the ability to set breakpoints, or the ability to change the speed of the execution. The more controls the better, but even the most basic ones are sufficient for the simulator to be effective.

#### 2.5.3 Visualization

This is the feature that allows students to see what happened to the instructions they executed. This can range just from seeing the values of the registers and the data memory, to seeing the individual instruction going through the pipeline diagram. The visualization is the broadest feature, because there is so much that can be shown to the user. The more information the better, but the most basic information that should be shown is the values of the registers and the data memory, and the pipeline diagram. The pipeline diagram is the most important part of the

visualization, because it shows the user how the instructions are executed, which is very important when it comes to teaching pipeline execution. WebMIPS, MIPSIM and DrMIPS are examples of simulators that have a pipeline diagram, with MIPSIM and DrMIPS actually show the instructions going through the pipeline, while WebMIPS just shows the pipeline diagram. DrMIPS actually has multiple cpu designs available, both single cycle and pipelined, and it shows the datapaths of the CPU, in each one. MARS, VisualMIPS, and SPIM are examples of simulators that don't have a pipeline diagram, but they do show the values of the registers and the data memory.

## 2.5.4 Interactability

Since the simulators should be analyzed from the perspective of a teaching tool, interactability is a very important feature. Poking around and observing changes is the single most effective way to learn about something. That even stems from the scientific method. The more interactive the simulator is, the more effective it will be in teaching students. The interactability can manifest itself in many ways. Editing a register value, creating custom instructions, or even just clicking on a component in the pipeline diagram to see what it does. A good example of this is WebMIPS and DrMIPS which have interactive elements that explain their functions when clicked. And most of the simulators have alterable data memory and register values, which is a good start. The more interactive the simulator is, the better.

### 2.5.5 User Interface

User Interface is something that can make or break a simulator. It is the glue that holds everything together. If the user interface is bad, the simulator will be bad. The user interface should be intuitive, easy to use, and not cluttered. Example of a

good user interface is MARS, which is simple, easy to use, and not cluttered. The user interface should also be customizable, so that the user can turn off features that they don't need, or turn on features that they do need. On the other spectrum is WebMIPS, which is very cluttered and a bit more difficult to navigate.

## 2.5.6 Accessibility

Accessibility plays a crucial role in the effectiveness of a teaching tool. The more accessible a simulator is, the broader its potential user base. Web-based simulators stand out in this regard, as they can be accessed from any device with a modern web browser, eliminating the need for installation. This makes them highly convenient for students who may be using different operating systems or devices. On the other hand, simulators that are limited to a single platform, such as Windows-only applications, restrict access to users with compatible devices. Among the simulators reviewed, WebMIPS, WebRISC-V, and WeMIPS are the most accessible due to their web-based nature. In contrast, MIPSIM and DrMIPS are less accessible, as they are confined to Windows environments.

## 2.6 Analysis Conclusion

This chapter has provided an overview of pipelining in general, went into the specifics of the MIPS architecture and its pipeline, and analyzed existing solutions and their features. The analysis has shown that while there are many simulators available, none of them are satisfactory in all aspects. Some simulators excel in certain areas, such as user experience or informativeness, but fall short in others. The most important features of a MIPS pipeline simulator have been identified, including instruction input, execution, visualization, interactability, user interface, and accessibility. A simulator that combines the best features of existing solutions

and offers a customizable feature set tailored to student requirements would be the ultimate solution. The next chapter will introduce the design of the MIPS pipeline simulator, focusing on the features identified in this analysis.

Solution Proposal

Implementation

Solution Verification

# Conclusion

- 6.1 Summary
- 6.2 Future Work

# Resumé

# References

- [1] Samuel O Aletan. "An overview of RISC architecture". In: Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing: technological challenges of the 1990's. 1992, pp. 11–20.
- [2] John Cocke and V. Markstein. "The evolution of RISC technology at IBM". In: IBM Journal of Research and Development 34.1 (1990), pp. 4–11. DOI: 10.1147/rd.341.0004.
- [3] M.J. Flynn. Computer Architecture: Pipelined and Parallel Processor Design.

  Computer Science Series. Jones and Bartlett, 1995. ISBN: 9780867202045.

  URL: https://books.google.sk/books?id=JS-010T19dsC.
- [4] H. Grunbacher and H. Khosravipour. "WinDLX and MIPSim pipeline simulators for teaching computer architecture". In: *Proceedings IEEE Symposium and Workshop on Engineering of Computer-Based Systems*. 1996, pp. 412–417. DOI: 10.1109/ECBS.1996.494568.
- [5] Yan He and Xiangning Chen. "Survey and Comparison of Pipeline of Some RISC and CISC System Architectures". In: 2023 8th International Conference on Computer and Communication Systems (ICCCS). IEEE. 2023, pp. 785–790.
- [6] John Hennessy et al. "MIPS: A microprocessor architecture". In: ACM SIG-MICRO Newsletter 13.4 (1982), pp. 17–22.

- [7] T. Jamil. "RISC versus CISC". In: *IEEE Potentials* 14.3 (1995), pp. 13–16.DOI: 10.1109/45.464688.
- [8] Gianfranco Mariotti and Roberto Giorgi. "WebRISC-V: A 32/64-bit RISC-V pipeline simulation tool". In: SoftwareX 18 (2022), pp. 1-7. ISSN: 2352-7110. DOI: 10.1016/j.softx.2022.101105. URL: https://doi.org/10.1016/j.softx.2022.101105.
- [9] MIPS. MIPS. https://mips.com/company/. [Online; accessed 4-December-2024]. 2024.
- [10] S. Mirapuri, M. Woodacre, and N. Vasseghi. "The Mips R4000 processor".
   In: *IEEE Micro* 12.2 (1992), pp. 10–22. DOI: 10.1109/40.127580.
- [11] Rashid F Olanrewaju et al. "Design and Implementation of a five stage pipelining architecture simulator for RiSC-16 instruction set". In: *Indian J Sci Technol* 10.3 (2017), pp. 1–9.
- [12] Amit Pandey. "Study of data hazard and control hazard resolution techniques in a simulated five stage pipelined RISC processor". In: 2016 International Conference on Inventive Computation Technologies (ICICT). Vol. 2. IEEE. 2016, pp. 1–4.
- [13] Iro Pantazi-Mytarelli. "The history and use of pipelining computer architecture: MIPS pipelining implementation". In: 2013 IEEE Long Island Systems, Applications and Technology Conference (LISAT). 2013, pp. 1–7. DOI: 10.1109/LISAT.2013.6578243.
- [14] David A Patterson and John L Hennessy. Computer organization and Design. Morgan Kaufmann, 1994.
- [15] Kenneth Vollmar Pete Sanderson. MARS. https://dpetersanderson.github.io/. [Online; accessed 4-December-2024]. 2024.

- [16] Todd A Proebsting and Christopher W Fraser. "Detecting pipeline structural hazards quickly". In: *Proceedings of the 21st ACM SIGPLAN-SIGACT symposium on Principles of programming languages*. 1994, pp. 280–286.
- [17] Leonid Ryzhyk. "The arm architecture". In: Chicago University, Illinois, EUA~(2006).
- [18] VisualMIPS. VisualMIPS. https://visualmips.github.io/. [Online; accessed 4-December-2024]. 2024.
- [19] Wikipedia contributors. Classic RISC pipeline Wikipedia, The Free Encyclopedia. https://en.wikipedia.org/w/index.php?title=Classic\_RISC\_pipeline&oldid=1255528196. [Online; accessed 6-November-2024]. 2024.

# Appendix A

First Appendix

# Appendix B

# Contents of Included CD–ROM

CD–ROM included to the thesis contains following files:

- /file1 First file
- /file2 Second file