

# Computer Organization

Lab3 MIPS(2)

Instructions





### Types of Instructions

- Data transfer
- Calculation
- Jump related to the instruction execution
- How to determine the execution order of instructions
  - PC register and its updation
- Practice: p1(1-1,1-2,1-3),p2(2-1,2-2,2-3)
- Tips: Big-endian vs Little-endian



## Summary of MIPS32 Instruction Types

- Function
  - data transfer(load, store)
  - calculation(arithmetic, Bitwise operation)
  - Jump instructions related to instruction execution
- Relation with data
  - Immedate is a operand vs all the operands are from registers
  - treat data as **signed** vs treat data as **Unsigned 指今末尾是さかい**
- Coding of Instruction: R, I, J 格含美型
- Basic instruction vs Pseduo instruction
  - lui, ori vs la) 伪裕全. 完成2条基本描字保作.
  - add \$t0,\$zero,\$t1 vs move \$t0,\$t1

名账旗 伪计通过加注范围 不能写



# **Common Operations**

| Description                     | Op-code | Operand                      |
|---------------------------------|---------|------------------------------|
| Add with Overflow               | add     | destination, src1, src2      |
| Add without Overflow            | addu    | destination, src1, src2      |
| AND                             | and     | destination, src1, immediate |
| Divide Signed                   | div     | destination/src1, immediate  |
| Divide Unsigned                 | divu    |                              |
| Exclusive-OR                    | xor     |                              |
| Multiply                        | mul     |                              |
| Multiply with Overflow          | mulo    |                              |
| Multiply with Overflow Unsigned | mulou   |                              |
| NOT OR                          | nor 🦴   | <b>多辑按位运算</b>                |
| OR                              | or /    | 244 1512                     |
| Set Equal                       | seq     |                              |
| Set Greater                     | sgt     |                              |
| Set Greater/Equal               | sge     |                              |
| Set Greater/Equal Unsigned      | sgeu    | 0 50 54 1)4                  |
| Set Greater Unsigned            | sgtu    | 姚维勒                          |
| Set Less                        | slt     | •                            |
| Set Less/Equal                  | sle     |                              |
| Set Less/Equal Unsigned         | sleu    |                              |
| Set Less Unsigned               | sltu    |                              |
| Set Not Equal                   | sne     |                              |
| Subtract with Overflow          | sub     |                              |
| Subtract without Overflow       | subu    |                              |

| Description             | Op-cod | e Operand        |
|-------------------------|--------|------------------|
| Rotate Left             | rol 😘  | 位                |
| Rotate Right            | ror    |                  |
| Shift Right Arithmetic  | sra    |                  |
| Shift Left Logical      | sll    |                  |
| Shift Right Logical     | srl    |                  |
| Absolute Value          | abs    | destination,src1 |
| Negate with Overflow    | neg    | destination/src1 |
| Negate without Overflow | negu   |                  |
| NOT                     | not    |                  |
| Move                    | move   | destination,src1 |
| Multiply                | mult   | src1,src2        |
| Multiply Unsigned       | multu  |                  |



## Signed vs Unsigned (caculation)

Run the two demos, which one will invoke the exception (arithmetic overflow), why?

```
.include "macro_print_str.asm"
.data
    tdata: .word 0x11111111
.text
main:
    lw $t0,tdata
    addu $a0,$t0,$t0
    li $v0,1
    syscall 11 17 (a)
     print_string("\n")
    add $a0,$t0,$t0
    li $v0,1
    syscall
    end #A
```

```
.include "macro_print_str.asm"
.data
    tdata: .word 0x71111111
.text
main:
   Iw $t0,tdata 都是 word 不用填充
    addu $a0,$t0,$t0
   li$v0,1 > 有符号校刊印输出
    syscall
    print_string("\n")
   add $a0,$t0,$t0 everflow
                  校值溢出,循溪传改符号位
    li $v0,1
    syscall
    end #B
```



# **Bit-wise Logic Operation(1)**

| Instruction name                             | description                                                                                                                                                                                                                                                                                    |
|----------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| and<br>(AND)<br>and dst,sr1,sr2(im)          | Computes the <b>Logical AND</b> of two values. This instruction ANDs (bit-wise) the contents of src1 with the contents of src2, or it can AND the contents of src1 with the immediate value. The immediate value is <b>NOT</b> sign extended. AND puts the result in the destination register. |
| or mulilett<br>(OR)<br>or dst,sr1,sr2(im)    | Computes the <b>Logical OR</b> of two values. This instruction ORs (bit-wise) the contents of src1 with the contents of src2, or it can OR the contents of src1 with the immediate value. The immediate value is <b>NOT</b> sign extended. OR puts the result in the destination register      |
| xor<br>(Exclusive-OR)<br>xor dst,sr1,sr2(im) | Computes the <b>XOR</b> of two values. This instruction XORs (bit-wise) the contents of src1 with the contents of src2, or it can XOR the contents of src1 with the immediate value. The immediate value is <b>NOT</b> sign extended. Exclusive-OR puts the result in the destination register |
| not<br>(NOT)<br>not dst,src1                 | Computes the <b>Logical NOT</b> of a value. This instruction complements (bit-wise) the contents of src1 and puts the result in the destination register.                                                                                                                                      |
| nor<br>(NOT OR)<br>nor dst,sr1,sr2           | Computes the <b>NOT OR</b> of two values. This instruction combines the contents of src1 with the contents of src2 (or the immediate value). NOT OR complements the result and puts it in the destination register.                                                                            |



# Bit-wise Logic Operation(2)

Run the demo and answer the question:

```
3/特殊 rep:
pc 松字相关
hi P字况字物
lo 荷
```

```
.data
    dvalue1: .byte 27
    dvalue2: .byte 4
.text
    lb $t0,dvalue1
    lb $t1,dvalue2
    div $t0,$t1
    mfhi $a0 将hi 移到 $a0
    li$v0,1 $7 $7 $ $400
    syscall
    li $v0,10
    syscall
```

```
.data
     dvalue1: .byte 27
     dvalue2: .byte 4
.text
     lb $t0,dvalue1
     lb $t1,dvalue2
     sub $t1,$t1,1
     and $a0,$t0,$t1
     li $v0,1
     syscall
     li $v0,10
     syscall
```

Q1: Is the output of two demos the same?

Q2: If use 5 instead of 4 as the initial value on dvalue2, is the output of two demos the same?

Q3: On which situation could use 'and' operation to get the remainder instead of division?

Q4: Do the logic operations work quicker than arithmetic operations?



# **Shift Operation**

| Туре                         | Instruction name                                                                                                                                                               | Description                                                                                                                                                                        | ShiftOperator src1, src2                                                                                                                                             |  |
|------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
|                              | Shifts the contents of a register left (toward the sign bit) and inserts  (Shift Left Logical)  Zeros at the least-significant bit. 18. 19. 19. 19. 19. 19. 19. 19. 19. 19. 19 |                                                                                                                                                                                    | The contents of src1 specify the value to shift, and the contents of src2 or the immediate value specify the amount to                                               |  |
| shift                        | (Shift Right Arithmetic )                                                                                                                                                      | Shifts the contents of a register right (toward the least-significant bit) and inserts the sign bit at the most-significant bit.                                                   | shift.  If src2 (or the immediate value) is greater                                                                                                                  |  |
| srl<br>(Shift Right Logical) |                                                                                                                                                                                | Shifts the contents of a register right (toward the least-significant bit) and inserts zeros at the most-significant bit.                                                          | than 31 or less than 0, src1 shifts by the result of src2 MOD 32.                                                                                                    |  |
|                              | <b>rol</b><br>(Rotate Left )                                                                                                                                                   | Rotates the contents of a register left (toward the sign bit).  This instruction inserts in the least-significant bit any bits that were shifted out of the sign bit.              | The contents of src1 specify the value to shift, and the contents of src2 (or the immediate value) specify the amount to shift. Rotate Left/right puts the result in |  |
| rotate                       |                                                                                                                                                                                |                                                                                                                                                                                    | the destination register.                                                                                                                                            |  |
|                              | <b>ror</b><br>(Rotate Right )                                                                                                                                                  | Rotates the contents of a register right (toward the least-significant bit). This instruction inserts in the sign bit any bits that were shifted out of the least-significant bit. | If src2 (or the immediate value) is greater than 31, src1 shifts by the result of src2 MOD 32.                                                                       |  |

# Practice1-1

Here is a demo to meet the following function: get the integer from input, judge whether the data is odd, if it is odd then print 1, else print 0.

1-1-1). Run the demo to see if the function of the code is ok? if not please find the reason and modify the code to meet the design expectations.

1-1-2). Which is(are) basic instruction(s) in the following set: li, move, nor, sra, and, syscall?

```
.include "macro_print_str.asm"
.data
.text
main:
    print string("please input an integer : ")
    li $v0,5
               ox ffff ffff
    syscall
                                        please input an integer : 3
              $t0, $v0
    move
                                        it is an odd number (0: false, 1:true) : 1
              $t1) $zero, $zero
    nor
                                        -- program is finished running --
    sra $t2, $t1,
                      31
    and $a0, $t2,
                      $t0
    print string("it is an odd number (0: false,1:true) : ")
    li $v0,1
    syscall
    end
```



1-2: Calculate the checksum of data. Here are the steps of calculation:

- 1) The data is grouped in 16bit units
- 2) The data of each group is accumulated. 🐉 🚧
- 3) The accumulation is divided into two parts according to the bit width.

  the first part is the low-address part of subscripts 0 to 15

  the second part is the part of the accumulated sum that exceeds the bit width of 16bit.



- 4) Then the first part and the second part are accumulated again(if there is no 16bit in second part, fill in 0 on the high-bit bits).
  - 求反為 The analysis and a second at a decimal at the

5) The one's complement of accumulated sum is the checksum

The data is grouped as: 0x7f00, 0x0001, 0x7f00, 0x0001,0x0011, 0xcbe0,0x2ee0, 0x000a,0x6162, calculate its checksum and print it out in hexdecimal.

tips: the part2 of the accumulated sum (step3) is 0x0002

# Practice1-3

There are 5 shift operations list on page8, practice it's function in MIPS and implement them by Verilog.

- 1-3-1: Write a demo in MIPS to practice 5 shift operations list on page 8 of this slice
- 1-3-2: Implement 5 shift operations list on page 8 of this slice in Verilog, build the testbench to verify its function. It's suggested to build a module with two inputs(src1, src2), five outputs(osll,oslr,osla,orol,oror), the bitwise of all inputs and outputs is 32. the output port "osll" is the output of operation sll src1,src2, and so on.

Here are some suggestions for the testcase, the value of src1 could be 0x8000\_0000, 0x0000\_0001, 0XFFFF\_FFFF, 0x00000\_FFFF, the value of src2 could be 0x0010\_0000, 0x00010\_0001, 0x8010\_0000, 0x8010\_0001, 0x0001\_1111, 0x00011\_1111.

| veri | 109 | Ş | 44 | 移往 |
|------|-----|---|----|----|
|      | J   | 1 | 24 | <  |

| Tuno   | Instruction name                     |
|--------|--------------------------------------|
| Type   | instruction name                     |
|        | <b>sll</b><br>(Shift Left Logical)   |
| shift  | sra<br>(Shift Right Arithmetic )     |
|        | <b>srl</b><br>(Shift Right Logical ) |
| rotate | rol<br>(Rotate Left )                |
|        | ror<br>(Rotate Right )               |

# Who' determine the execution order of instructions

- The CPU takes the value of the PC register as the address and fetches the corresponding instruction from the memory.
  - PC register maintains the address of the instruction currently being executed.
  - After the current instruction is executed, the value of the PC register will be updated to determine the next instruction to be executed.

| cpt | Address    | Code       | Basic                      | Source                           |
|-----|------------|------------|----------------------------|----------------------------------|
|     | 0x00400000 | 0x24020008 | addiu \$2,\$0,0x00000008   | 7: li \$v0,8 #to get a string    |
|     | 0x00400004 | 0x3c011001 | lui \$1,0x00001001         | 8: la \$a0, sid                  |
|     | 0x00400008 | 0x34240008 | ori \$4,\$1,0x00000008     |                                  |
|     | 0x0040000c | 0x24050009 | addiu \$5, \$0, 0x00000009 | 9: li \$a1,9                     |
|     | 0x00400010 | 0x0000000c | syscall                    | 10: syscall                      |
|     | 0x00400014 | 0x24020004 | addiu \$2,\$0,0x00000004   | 13: li \$v0,4 #to print a string |
|     | 0x00400018 | 0x3c011001 | lui \$1,0x00001001         | 14: la \$a0, s1                  |
|     | 0x0040001c | 0x34240000 | ori \$4,\$1,0x00000000     |                                  |
|     | 0x00400020 | 0x0000000c | syscall                    | 15: syscall                      |
|     | 0x00400024 | 0x2402000a | addiu \$2,\$0,0x0000000a   | 16: li \$v0,10 #to exit          |
| 1   | 0x00400028 | 0x0000000c | syscall                    | 17: syscall                      |

| c | 0x00400000 J-44          |
|---|--------------------------|
| С | 0x00400004<br>0x00400008 |
| С | 0x00400008 444           |
| c | 0x0040000c               |
|   |                          |
| с |                          |

### How to update the value of PC register?

Check if the current instruction is non-jump

- 普通路令
- If the current instruction is non-jump instruction: PC = PC+4
- If the current instruction is jump instruction 微報な
  - If the current instruction is unconditional jump pc = destination address
  - If the current instruction is conditional jump
    - If the condition is met: PC = destination address
    - If the condition is not met: PC = PC + 4



# **Conditional Jump**

| basic instruction              | usage 利的是否相等                                                                 |
|--------------------------------|------------------------------------------------------------------------------|
| えんえン beq \$t0,\$t1,labelx      | branch to instruction addressed by the labelx if \$t0 and \$t1 are equal     |
| 見たなだが、<br>bne \$t0,\$t1,labelx | branch to instruction addressed by the labelx if \$t0 and \$t1 are NOT equal |

| pseudo instruction   | basic instruction                                          | usage                                                                                                                                 |
|----------------------|------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------|
| blt \$t0,\$t1,lable  | slt <b>\$1</b> , \$t0, \$t1<br>bne <b>\$1</b> ,\$0, lable  | # branch to instruction addressed by the label if \$t0 is less than \$t1, data in \$t0 and \$t1 are taken as signed number            |
| ble \$t0,\$t1,lable  | slt <b>\$1</b> ,\$t1,\$t0<br>beq <b>\$1</b> ,\$0,lable     | # branch to instruction addressed by the label if \$t0 is less or equal than \$t1, data in \$t0 and \$t1 are taken as signed number   |
| bltu \$t0,\$t1,lable | sltu <b>\$1</b> , \$t0, \$t1<br>bne <b>\$1</b> ,\$0, lable | # branch to instruction addressed by the label if \$t0 is less than \$t1, data in \$t0 and \$t1 are taken as unsigned number          |
| bleu \$t0,\$t1,lable | sltu <b>\$1</b> ,\$t1,\$t0<br>beq <b>\$1</b> ,\$0,lable    | # branch to instruction addressed by the label if \$t0 is less or equal than \$t1, data in \$t0 and \$t1 are taken as unsigned number |
| bgt, bge, bgtu, bgeu | 常用作中转                                                      |                                                                                                                                       |



#### **Branch**

Are the running results of two demos the same? This Modify them without changing the result by using **ble** or **blt** instead

```
.include "macro_print_str.asm"
.text
      print string("please input your score (0~100):")
      li $v0,5
      syscall
      move $t0,$v0
case1:
      bge $t0,60,passLable
case2:
      i failLable
passLable:
      print string("\nPASS (exceed or equal 60) ")
      j caseEnd
failLable:
      print_string("\nFaild(less than 60)")
      j caseEnd
caseEnd:
      end
```

```
.include "macro_print_str.asm"
.text
      print_string("please input your score (0~100):")
      li $v0,5
      syscall
      move $t0,$v0
case1:
      bge $t0,60,passLable
      i case2
case2:
      i failLable
passLable:
      print string("\nPASS (exceed or equal 60) ")
      j caseEnd
failLable:
      print_string("\nFaild(less than 60)")
      j caseEnd
caseEnd:
      end
```



### Loop

Compare the operations of loop which calculats the sum from 1 to 10 in java and MIPS.

#### Code in Java:

```
public class CalculateSum{
  public static void main(String [] args){
    int i = 0;
    int sum = 0;
    for(i=0;i<=10;i++)
        sum = sum + i;
    System.out.print("The sum from 1 to 10 : " + sum );
  }
}</pre>
```

#### **Code in MIPS:**

```
.include "macro print str.asm"
.data
.text
     add $t1,$zero,$zero
     addi $t0,$zero,0
     addi $t7,$zero,10
calcu:
     addi $t0,$t0,1
                       #i++
     add $t1,$t1,$t0 #sum+=i
     bgt $t7,$t0,calcu #if(t7>t0) t0==t7
     print string ("The sum from 1 to 10 : ")
     move $a0,$t1
     li $v0,1
     syscall
     end
```



#### Demo #1

The following code is expected to get 10 integers from the input device, and print it as the following sample.

Will the code get desired result? If not, what happened?

```
#piece 1/3
.include "macro_print_str.asm"
.data
                            10
     arrayx:
                 .space
                            "\nthe arrayx is:"
     str:
                 .asciiz
.text
main:
     print string("please input 10 integers: ")
     add $t0,$zero,$zero
     addi $t1,$zero,10
     la $t2,arrayx
```

```
#piece 2/3
loop_r:
     li $v0,5
     syscall
     sw $v0,($t2)
      addi $t0,$t0,1
     addi $t2,$t2,4
      bne $t0,$t1,loop r
      la $a0,str
      li $v0,4
      syscall
      addi $t0,$zero,0
      la $t2,arrayx
```

```
please input 10 integers: 0

1

2

3

4

5

6

7

8

9

the arrayx is:0 1 2 3 4 5 6 7 8 9

program is finished running —
```

The function of following code is to get 5 integers from input device, and find the min value and max value of them.

There are 4 pieces of code, write your code based on them.

Can it find the real min and max?

```
#piece ?/4
.include "macro_print_str.asm"
.data
     min: .word 0
     max: .word 0
.text
     lw $t0,min
     lw $t1,max
     li $t7,5
     li $t6,0
     print_string("please input 5
integer:")
loop:
     li $v0,5
     syscall
     bgt $v0,$t1,get_max
     j get_min
```

```
#piece ?/4
get_max:
    move $t1,$v0
    j get_min
get_min:
    bgt $v0,$t0,judge_times
    move $t0,$v0
    j judge_times
```

```
#piece ?/4
judge_times:
addi $t6,$t6,1
bgt $t7,$t6,loop
```

```
#piece ?/4

print_string("min:")

move $a0,$t0

li $v0,1

syscall

print_string("max:")

move $a0,$t1

li $v0,1

syscall

end
```



### **Practice2-1,2-2**

- Answer the questiones on page 17 and 18
- Read a character, judge whether the binary representation of the character's ascii code is palindrome. For example, the ascii code of 'f' (102 in decimal, 0110\_0110 in binary) is a binary palindrome, the ascii code of space(32 in decimal, 0010\_0000 in binary) is not.

Tips: You can get more information from Mars' help page.

| ASCII printable characters |       |    |   |     |     |
|----------------------------|-------|----|---|-----|-----|
| 32                         | space | 64 | @ | 96  |     |
| 33                         | !     | 65 | A | 97  | а   |
| 34                         | "     | 66 | В | 98  | b   |
| 35                         | #     | 67 | C | 99  | C   |
| 36                         | \$    | 68 | D | 100 | d   |
| 37                         | %     | 69 | E | 101 | e   |
| 38                         | &     | 70 | F | 102 | f   |
| 39                         |       | 71 | G | 103 | g   |
| 40                         | (     | 72 | Н | 104 | h   |
| 41                         | )     | 73 | 1 | 105 | i   |
| 42                         | *     | 74 | J | 106 | j   |
| 43                         | +     | 75 | K | 107 | k   |
| 44                         | ,     | 76 | L | 108 | - 1 |
| 45                         |       | 77 | M | 109 | m   |
| 46                         |       | 78 | N | 110 | n   |
| 47                         | 1     | 79 | 0 | 111 | 0   |
| 48                         | 0     | 80 | P | 112 | p   |
| 49                         | 1     | 81 | Q | 113 | q   |
| 50                         | 2     | 82 | R | 114 | r   |
| 51                         | 3     | 83 | S | 115 | S   |
| 52                         | 4     | 84 | Т | 116 | t   |
| 53                         | 5     | 85 | U | 117 | u   |
| 54                         | 6     | 86 | V | 118 | V   |
| 55                         | 7     | 87 | W | 119 | w   |
| 56                         | 8     | 88 | X | 120 | X   |
| 57                         | 9     | 89 | Y | 121 | y   |
| 58                         | :     | 90 | Z | 122 | Z   |
| 59                         | ;     | 91 | [ | 123 | {   |
| 60                         | <     | 92 | 1 | 124 | -   |
| 61                         | =     | 93 | ] | 125 | }   |
| 62                         | >     | 94 | ٨ | 126 | ~   |
| 63                         | ?     | 95 | _ |     |     |
|                            |       |    |   |     |     |



#### Practice2-3

Implement the circuit described on page 13 in Verilog.

Build a testbench to verify its function.

NOTES: the width of all the register is 32bits.

#### The logic described on page 13:

- ➤ Check if the current instruction is non-jump
  - $\triangleright$  If the current instruction is non-jump instruction: PC = PC+4
  - ➤ If the current instruction is jump instruction
    - $\triangleright$  If the current instruction is unconditional jump pc = destination address
    - ➤ If the current instruction is conditional jump
      - > If the condition is met: PC = destination address
      - $\triangleright$  If the condition is not met: PC = PC + 4



## Tips: Big-endian vs Little-endian(1)

The CPU's **byte ordering scheme** (or **endian issues**) affects memory organization and defines the relationship between address and byte position of data in memory.

- > a **Big-endian** system means byte 0 is always the most-significant (leftmost) byte.
- > a Little-endian system means byte 0 is always the least-significant (rightmost) byte.





Figure 1-2: Little-endian Byte Ordering



# Tips: Big-endian vs Little-endian(2)

Run the demo to anwer the question:

Does your simulator work on big-endian or little-endian, explain the reasons.

```
.include "macro_print_str.asm"
.data

tdata0: .byte 0x11,0x22,0x33,0x44
tdata: .word 0x44332211
.text
main:

lb $a0,tdata
li $v0,34
syscall
end
```

```
.include "macro_print_str.asm"
.data
    tdata0: .byte  0x11,0x22,0x33,0x44
    tdata: .word  0x44332211
.text
main:
    Ih $a0,tdata
    li $v0,34
    syscall
    end
```

print integer in hexadecimal

34

\$a0 = integer to print

Displayed value is 8 hexadecimal digits, left-padding with zeroes if necessary.

# Tips: Big-endian or Little-endian?

```
.include "macro print str.asm"
.data
      tdata0: .word 0x00112233, 0x44556677
.text
main:
      la $t0,tdata0
      Ib $a0,($t0)
      li $v0,34
      syscall
      la $t0,tdata0
      lb $a0,1($t0)
      syscall
      Ib $a0,2($t0)
      syscall
      lb $a0,3($t0)
      syscall
      lw $a0,4($t0)
      syscall
      end
```

Run the demo to anwer the question:

Q1. What's the output of this demo?

A. **0**x00000033**0**x00000002**20**x00000001**10**x0000000**00**x44556677

B.**0**x0000000<mark>00</mark>x00000011**0**x0000000<mark>220</mark>x0000000<mark>330</mark>x44556677

C.**0**x0000004**40**x0000000<mark>550</mark>x0000000<mark>660</mark>x0000000<mark>770</mark>x00112233

D.**0**x000000<mark>770</mark>x000000<del>660</del>x0000000<del>550</del>x0000000<del>440</del>x33221100

Q2. Does your simulator work on big-endian or little-endian, explain the reasons.

| print integer in hexadecimal 34 \$a0 = integer to print | Displayed value is 8 hexadecimal digits, left-padding with zeroes if necessary. |
|---------------------------------------------------------|---------------------------------------------------------------------------------|
|---------------------------------------------------------|---------------------------------------------------------------------------------|