# RISC-V Intro & Control Flow

Discussion 4: September 20, 2021

### 1 RISC-V: A Rundown

RISC-V is an assembly language, which is comprised of simple instructions that each do a single task such as addition or storing a chunk of data to memory.

For example, on the left is a line of C code and on the right is a chunk of RISC-V code that accomplishes the same thing.

```
// x -> s0, &y -> s1

int x = 5, y[2]; addi s0, x0, 5

y[0] = x; sw s0, 0(s1)

y[1] = x * x; mul t0, s0, s0

sw t0, 4(s1)
```

[1.1] Can you figure out what each line in the RISC-V code is doing?

```
addi s0, x0, 5 does x = 5.

sw s0, 0(s1) does y[0] = x.

mul t0, s0, s0 calculates x * x.

sw t0, 4(s1) saves the result of the multiplication with y[1] = x * x.
```

### 2 Registers

In RISC-V, we have two methods of storing data: main memory and registers. Registers are much faster than using main memory, but are very limited in space (32 bits each). You should ALWAYS use the names of registers, e.g.  $\mathfrak{s0}$  rather than  $\mathfrak{x8}$ ; the one exception to this rule is the zero register  $\mathfrak{x0}$ , as it is often shorter to write  $\mathfrak{x0}$  than its name  $\mathsf{zero}$ , and the purpose of the register is still easy to tell with either identifier. The below table of register names is reproduced from the RISC-V green card.

| Register(s)    | Alt.             | Description                                                       |
|----------------|------------------|-------------------------------------------------------------------|
| x0             | zero             | The zero register, always zero                                    |
| x1             | ra               | The return address register, stores where functions should return |
| x2             | $^{\mathrm{sp}}$ | The stack pointer, where the stack ends                           |
| x5-x7, x28-x31 | t0-t6            | The temporary registers                                           |
| x8-x9, x18-x27 | s0-s11           | The saved registers                                               |
| x10-x17        | a0-a7            | The argument registers, a0-a1 are also return value               |

2.1 Can you convert each instruction's registers to the other form?

```
add s0, zero, a1 --> add x8, x0, x11 or x18, x1, x30 --> or s2, ra, t5
```

As a reminder, you should ALWAYS use the named registers (e.g.  $\mathfrak{s0}$  rather than  $\mathfrak{x8}$ ).

# 3 Basic Instructions

For your reference, here are some of the basic instructions for arithmetic operations and dealing with memory (Note: ARG1 is argument register 1, ARG2 is argument register 2, and DR is destination register):

| [inst] | [destination register] [argument register 1] [argument register 2]             |
|--------|--------------------------------------------------------------------------------|
| add    | Adds the two argument registers and stores in destination register             |
| xor    | Exclusive or's the two argument registers and stores in destination register   |
| mul    | Multiplies the two argument registers and stores in destination register       |
| sll    | Logical left shifts ARG1 by ARG2 and stores in DR                              |
| srl    | Logical right shifts ARG1 by ARG2 and stores in DR                             |
| sra    | Arithmetic right shifts ARG1 by ARG2 and stores in DR                          |
| slt/u  | If ARG1 < ARG2, stores 1 in DR, otherwise stores 0, u does unsigned comparison |
| [inst] | [register] [offset]([register containing base address])                        |
| sw     | Stores the contents of the register to the address+offset in memory            |
| lw     | Takes the contents of address+offset in memory and stores in the register      |
| [inst] | [argument register 1] [argument register 2] [label]                            |
| beq    | If $ARG1 == ARG2$ , moves to label                                             |
| bne    | If ARG1 != ARG2, moves to label                                                |
| [inst] | [destination register] [label]                                                 |
| jal    | Stores the next instruction's address into DR and moves to label               |

You may also see that there is an "i" at the end of certain instructions, such as addi, slli, etc. This means that ARG2 becomes an "immediate" or an integer instead of using a register. There are also immediates in some other instructions such as  $\mathbf{sw}$  and  $\mathbf{lw}$ . Note that the size (maximum number of bits) of an immediate in any given instruction depends on what type of instruction it is (more on this soon!).

3.1 Assume we have an array in memory that contains int \*arr = {1,2,3,4,5,6,0}. Let register s0 hold the address of the element at index 0 in arr. You may assume integers are four bytes and our values are word-aligned. What do the snippets of RISC-V code do? Assume that all the instructions are run one after the other in the same context.

```
a) lw t0, 12(s0)
                      -->
                              Sets t0 equal to arr[3]
```

b) sw t0, 
$$16(s0)$$
 --> Stores t0 into arr[4]

#### 4 C to RISC-V

4.1 Translate between the C and RISC-V verbatim.

```
\mathbf{C}
                                       RISC-V
// s0 -> a, s1 -> b
                                       addi s0, x0, 4
// s2 -> c, s3 -> z
                                       addi s1, x0, 5
int a = 4, b = 5, c = 6, z;
                                       addi s2, x0, 6
                                       add s3, s0, s1
z = a + b + c + 10;
                                       add s3, s3, s2
                                       addi s3, s3, 10
// s0 -> int * p = intArr;
                                       sw x0, 0(s0)
// s1 -> a;
                                       addi s1, x0, 2
*p = 0;
                                       sw s1, 4(s0)
int a = 2;
                                       slli t0, s1, 2
p[1] = p[a] = a;
                                       add t0, t0, s0
                                       sw s1, 0(t0)
// s0 -> a, s1 -> b
                                          addi s0, x0, 5
int a = 5, b = 10;
                                           addi s1, x0, 10
if(a + a == b) {
                                           add t0, s0, s0
   a = 0;
                                           bne t0, s1, else
} else {
                                           xor s0, x0, x0
   b = a - 1;
                                          jal x0, exit
}
                                       else:
                                           addi s1, s0, -1
                                       exit:
// computes s1 = 2^30
                                           addi s0, x0, 0
// assume int s1, s0; was declared above
                                           addi s1, x0, 1
s1 = 1;
                                           addi t0, x0, 30
for(s0 = 0; s0 != 30; s0++) {
                                       loop:
                                           beq s0, t0, exit
   s1 *= 2;
                                           add s1, s1, s1
                                           addi s0, s0, 1
                                           jal x0, loop
                                       exit:
```

```
// s0 -> n, s1 -> sum
// assume n > 0 to start
for(int sum = 0; n > 0; n--) {
    sum += n;
}

addi s1, x0, 0
loop:
beq s0, x0, exit
add s1, s1, s0
add s0, s0, -1
    jal x0, loop
exit:
```

# 5 RISC-V with Arrays and Lists

Comment what each code block does. Each block runs in isolation. Assume that there is an array, int arr[6] = {3, 1, 4, 1, 5, 9}, which starts at memory address 0xBFFFFF00, and a linked list struct (as defined below), struct 11\* 1st, whose first element is located at address 0xABCD0000. Let s0 contain arr's address 0xBFFFFF00, and let s1 contain 1st's address 0xABCD0000. You may assume integers and pointers are 4 bytes and that structs are tightly packed. Assume that 1st's last node's next is a NULL pointer to memory address 0x00000000.

```
struct ll {
         int val;
         struct 11* next;
     }
        t0, 0(s0)
5.1
     lw
     lw t1, 8(s0)
     add t2, t0, t1
     sw t2, 4(s0)
     Sets arr[1] to arr[0] + arr[2].
5.2
     loop: beq s1, x0, end
           lw
                 t0, 0(s1)
           addi t0, t0, 1
                 t0, 0(s1)
            SW
                 s1, 4(s1)
           1w
           jal x0, loop
      end:
```

Increments all values in the linked list by 1.

```
5.3 add t0, x0, x0
loop: slti t1, t0, 6
beq t1, x0, end
slli t2, t0, 2
add t3, s0, t2
lw t4, 0(t3)
sub t4, x0, t4
```

Negates all elements in arr.

## 6 RISC-V Calling Conventions

6.1 How do we pass arguments into functions?

Use the 8 arguments registers a0 - a7.

6.2 How are values returned by functions?

Use a0 and a1 as the return value registers.

6.3 What is sp and how should it be used in the context of RISC-V functions?

sp stands for stack pointer, and it represents the boundary between stored data and free space on the stack. Because the stack grows downward, we subtract from sp to create more space (moving the stack pointer down), and add to sp to free space (moving the stack pointer back up). The stack is mainly used to save (and later restore) the value of registers that may be overwritten.

6.4 Which values need to saved by the caller, before jumping to a function using jal?

Registers a0 - a7, t0 - t6, and ra.

6.5 Which values need to be restored by the callee, before returning from a function?

Registers sp, gp (global pointer), tp (thread pointer), and s0 - s11. Note that we don't use gp and tp very often in this course.

6.6 In a bug-free program, which registers are guaranteed to be the same after a function call? Which registers aren't guaranteed to be the same?

Registers a0 - a7, t0 - t6, and ra are not guaranteed to be the same after a function call (which is why they must be saved by the caller). Registers sp, gp, tp, and s0 - s11 are guaranteed to be the same after a function call (which is why the callee must restore them before returning).

### 7 Writing RISC-V Functions

7.1 Write a function sumSquare in RISC-V that, when given an integer n, returns the summation below. If n is not positive, then the function returns 0.

$$n^2 + (n-1)^2 + (n-2)^2 + \ldots + 1^2$$

For this problem, you are given a RISC-V function called square that takes in a single integer and returns its square.

First, let's implement the meat of the function: the squaring and summing. We will be abiding by the caller/callee convention, so in what register can we expect the parameter n? What registers should hold square's parameter and return value? In what register should we place the return value of sumSquare?

```
add
           s0, a0, x0
                        # Set s0 equal to the parameter n
      add
           s1, x0, x0
                        # Set s1 (accumulator) equal to 0
loop: beq
           s0, x0, end
                       # Branch if s0 reaches 0
                        # Set a0 to the value in s0, setting up
      add
           a0, s0, x0
                        # args for call to function square
                        # Call the function square
      jal ra, square
          s1, s1, a0
                        # Add the returned value into s1
      addi s0, s0, -1
                        # Decrement s0 by 1
      jal x0, loop
                        # Jump back to the loop label
                        # Set a0 to s1 (desired return value)
end:
      add
           a0, s1, x0
```

5.2 Since sumSquare is the callee, we need to ensure that it is not overriding any registers that the caller may use. Given your implementation above, write a prologue and epilogue to account for the registers you used.

```
# Make space for 3 words on the stack
prologue: addi sp, sp -12
                             # Store the return address
               ra, 0(sp)
               s0, 4(sp)
                             # Store register s0
               s1, 8(sp)
                             # Store register s1
epilogue: lw
               ra, 0(sp)
                             # Restore ra
          1w
               s0, 4(sp)
                             # Restore s0
               s1, 8(sp)
                             # Restore s1
          addi sp, sp, 12
                             # Free space on the stack for the 3 words
                             # Return to the caller
          jr
               ra
```

Note that ra is stored in the prologue and epilogue even though it is a caller-saved register. This is because if we call multiple functions within the body of sumSquare, we'd need to save ra to the stack on every call, which would be redundant — we might as well save it in the prologue and restore it in the epilogue along with the callee-saved registers. For this reason, in functions that don't call other functions, it is generally safe to refrain from saving/restoring ra in the prologue/epilogue as long as nothing else is overwriting it.