

## Faculty of Media Engineering and Technology Dept. of Computer Science and Engineering Dr. Milad Ghantous

## CSEN 702: Microprocessors Winter 2022 $Quiz\ 2\ Version\ 1\ -\ {\color{red} solution}$

| Name:   |                       |                |                | ID:              | Group:          |                |                         |
|---------|-----------------------|----------------|----------------|------------------|-----------------|----------------|-------------------------|
| •       | `                     | - /            |                |                  | Γomasulo arc    |                | some clock cycle X, the |
|         | _                     |                | in a sentence  |                  | O               |                |                         |
| RS A1:  |                       |                |                |                  |                 |                |                         |
| 1       |                       | ADD            | 2.8            |                  | 0               | M1             |                         |
| Explan  |                       |                | L. DCA         | 4 77 0 11        |                 |                |                         |
|         |                       |                |                | , •              | •0              |                | 8 is valid and was      |
|         |                       |                | be able to s   |                  |                 | neaning that   | 3 is invalid and this   |
| RS M1   |                       |                | be able to s   | otar i executii. | <sup>1</sup> 8· |                |                         |
| 1       |                       | DIV            | 2.8            | 3                | 0               | 0              |                         |
| Explan  |                       | instruction    | -hat is curror | atly ovocuting   | r cinco both f  | ields Q are ze | Wood                    |
|         |                       | mstruction (   | mat is curren  | my execum        | z since both i  | ields & are ze | 105.                    |
| Store b |                       | <u> </u>       | Λ.1            |                  |                 |                |                         |
| 1       | 10+F                  | 15             | A1             |                  |                 |                |                         |
| Explana | ation: <mark>s</mark> | tore is issued | with address 1 | 10(R5) and it    | is waiting for  | A1 to finish.  |                         |
| Registe | er file:              |                |                |                  |                 |                |                         |
| Reg     | Q                     |                |                |                  |                 |                |                         |
| F2      | M1                    |                |                |                  |                 |                |                         |
| F3      | A1                    |                |                |                  |                 |                |                         |
| F4      | 0                     |                |                |                  |                 |                |                         |

## Explanation:

Register F2 will be computed by M1 (DIV) when it finishes, while F3 is the destination of the ADD instruction in A1, and F4 is currently valid and no one is producing it.

Part B) Reverse engineer into assembly. Take any necessary assumptions. (2 pts)

DIV.D F2, F7,F8 SUB.D F3,F7, F2 S.D F3, 10(R5)

Question 2- Consider the following code to be scheduled on a Tomasulo's architecture with 2 add/sub reservation stations, 2 mult/div reservation stations, 2 load buffers and 2 store buffers.

Multiplication takes 2 execution cycles, addition takes 1 execution cycles, loads and stores take 1 execution cycles each, but the first load misses and takes 3 cycles to execute.

F5 is available in the register file and contains the value 9.0. Also register R2 is available and contains value 2. (12 pts)

L.D F2, 10(R2) ADD.D F3, F2 ,F2 ADD.D F4, F3, F3 MUL.D F5, F5, F5 S.D F4, 10(R2)

- Fill in the contents of the corresponding components on pages 3 to 6, each clock cycle, starting with cycle 1 up until cycle 7 inclusive. (8 pts)
- However, fill the table on Page 2, fully, until code completion. (4 pts)
- Write down a summary of events in each cycle inside the discussion box.
- Note that the A field is omitted in the RS of Add and Mult.
- If 2 instructions want to write result in the same cycle, use FIFO order.

|                  | Issue                  | Execute | Write result |
|------------------|------------------------|---------|--------------|
| L.D F2, 10(R2)   | 1                      | 24      | 5            |
| ADD.D F3, F2 ,F2 | 2                      | 6       | 7            |
| ADD.D F4, F3, F3 | 3                      | 8       | 9            |
| MUL.D F5, F5, F5 | 4                      | 56      | 8            |
| S.D F4, 10(R2)   | 5 (6 is also accepted) | 10      | 11           |

6 in case someone considers the load published in 5 and exits, so the store, having the same address can now be issued)

| <b>-</b> | Busy | ор | Vj | Vk | Qj | Qk |
|----------|------|----|----|----|----|----|
| A1       | 0    |    |    |    |    |    |
| A2       | 0    |    |    |    |    |    |

|    | busy | ор | Vj | Vk | Qj | Qk |
|----|------|----|----|----|----|----|
| M1 | 0    |    |    |    |    |    |
|    |      |    |    |    |    |    |
| M2 | 0    |    |    |    |    |    |
|    |      |    |    |    |    |    |

|   |   | Busy | Address |
|---|---|------|---------|
| L | 1 | 1    | 12      |
| L | 2 | 0    |         |

|    | busy | Address | v | Q |
|----|------|---------|---|---|
| S1 | 0    |         |   |   |
| S2 | 0    |         |   |   |

|    | Qi | Reg value | Discussion     |
|----|----|-----------|----------------|
| F2 | L1 |           | Load is issued |
| F3 |    |           |                |
| F4 |    |           |                |
| F5 | 0  | 9         |                |

|    | Busy | op  | Vj | Vk | Qj | Qk |
|----|------|-----|----|----|----|----|
| A1 | 1    | add |    |    | L1 | L1 |
| A2 |      |     |    |    |    |    |

|    | busy | ор | Vj | Vk | Qj | Qk |
|----|------|----|----|----|----|----|
| M1 |      |    |    |    |    |    |
|    |      |    |    |    |    |    |
| М2 |      |    |    |    |    |    |
|    |      |    |    |    |    |    |

|    | Busy | Address |
|----|------|---------|
| L1 | 1    | 12      |
| L2 | 0    |         |

|    | busy | Address | v | Q |
|----|------|---------|---|---|
| S1 | 0    |         |   |   |
| S2 | 0    |         |   |   |

|    | Qi | Reg value | Discussion                                    |
|----|----|-----------|-----------------------------------------------|
| F2 | L1 |           | Add is issued<br>Load starts will finish at 6 |
| F3 | A1 |           |                                               |
| F4 |    |           |                                               |
| F5 | 0  | 9         |                                               |

|    | Busy | ор  | Vj | Vk | Qj | Qk |
|----|------|-----|----|----|----|----|
| A1 | 1    | add |    |    | L1 | L1 |
| A2 | 1    | add |    |    | A1 | A1 |

|    | busy | op | Vj | Vk | Qj | Qk |
|----|------|----|----|----|----|----|
| м1 |      |    |    |    |    |    |
|    |      |    |    |    |    |    |
| M2 |      |    |    |    |    |    |
|    |      |    |    |    |    |    |

|    | Busy | Address |
|----|------|---------|
| L1 | 1    | 12      |
| L2 | 0    |         |

|    | busy | Address | v | Q |
|----|------|---------|---|---|
| S1 | 0    |         |   |   |
| S2 | 0    |         |   |   |

|    | Qi | Reg value | Discussion           |
|----|----|-----------|----------------------|
| F2 | L1 |           | Second add is issued |
| F3 | A1 |           |                      |
| F4 | A2 |           |                      |
| F5 | 0  | 9         |                      |

|    | Busy | op  | Vj | Vk | Qj | Qk |
|----|------|-----|----|----|----|----|
| A1 | 1    | add |    |    | L1 | L1 |
| A2 | 1    | add |    |    | A1 | A1 |

|    | busy | op   | Vj | Vk | Qj | Qk |
|----|------|------|----|----|----|----|
| M1 | 1    | mult | 9  | 9  |    |    |
|    |      |      |    |    |    |    |
| M2 |      |      |    |    |    |    |
|    |      |      |    |    |    |    |

|    | Busy | Address |
|----|------|---------|
| L1 | 1    | 12      |
| L2 | 0    |         |

|    | busy | Address | v | Q |
|----|------|---------|---|---|
| S1 | 0    |         |   |   |
| S2 | 0    |         |   |   |

|    | Qi | Reg value | Discussion     |
|----|----|-----------|----------------|
| F2 | L1 |           | mult is issued |
| F3 | A1 |           |                |
| F4 | A2 |           |                |
| F5 | M1 | 9         |                |

|    | Busy | ор  | Vj      | Vk      | Qj | Qk |
|----|------|-----|---------|---------|----|----|
| A1 | 1    | add | Mem[12] | Mem[12] |    |    |
| A2 | 1    | add |         |         | A1 | A1 |

|    | busy | op   | Vj | Vk | Qj | Qk |
|----|------|------|----|----|----|----|
| M1 | 1    | mult | 9  | 9  |    |    |
|    |      |      |    |    |    |    |
| м2 |      |      |    |    |    |    |
|    |      |      |    |    |    |    |

|    | Busy | Address |
|----|------|---------|
| L1 | 0    |         |
| L2 | 0    |         |

|    | busy | Address | v | Q  |
|----|------|---------|---|----|
| S1 | 1    | 12      |   | A2 |
| S2 | 0    |         |   |    |

|    | Qi | Reg value | Discussion                                       |
|----|----|-----------|--------------------------------------------------|
| F2 | 0  | Mem[12]   | Load publishes, taken by Al<br>Mult is executing |
| F3 | A1 |           | Store is issued                                  |
| F4 | A2 |           |                                                  |
| F5 | М1 | 9         |                                                  |

|    | Busy | ор  | Vj      | Vk      | Qj | Qk |
|----|------|-----|---------|---------|----|----|
| A1 | 1    | add | Mem[12] | Mem[12] |    |    |
| A2 | 1    | add |         |         | A1 | A1 |

|    | busy | op   | Vj | Vk | Qj | Qk |
|----|------|------|----|----|----|----|
| м1 | 1    | mult | 9  | 9  |    |    |
|    |      |      |    |    |    |    |
| M2 |      |      |    |    |    |    |
|    |      |      |    |    |    |    |

|    | Busy | Address |
|----|------|---------|
| L1 | 0    |         |
| L2 | 0    |         |

|    | busy | Address | V | Q  |
|----|------|---------|---|----|
| S1 | 1    | 12      |   | A2 |
| S2 | 0    |         |   |    |

|    | Qi | Reg value | Discussion                                                                                          |
|----|----|-----------|-----------------------------------------------------------------------------------------------------|
| F2 | 0  | Mem[12]   | <ul><li>Nothing happens in terms of values.</li><li>Al is executing and will publish in 7</li></ul> |
| F3 | A1 |           | <ul> <li>Mult is executing and will publish in 8<br/>(because Al publishes in 7)</li> </ul>         |
| F4 | A2 |           | • Store is also executing and will write result in 7 (this is ok since store doesn't                |
| F5 | М1 | 9         | use the bus)                                                                                        |

|    | Busy | ор  | Vj      | Vk      | Qj | Qk |
|----|------|-----|---------|---------|----|----|
| A1 | 0    |     |         |         |    |    |
| A2 | 1    | add | 2*m[12] | 2*m[12] |    |    |

|    | busy | ор   | Vj | Vk | Qj | Qk |
|----|------|------|----|----|----|----|
| M1 | 1    | mult | 9  | 9  |    |    |
|    |      |      |    |    |    |    |
| M2 |      |      |    |    |    |    |
|    |      |      |    |    |    |    |

|    | Busy | Address |
|----|------|---------|
| L1 | 0    |         |
| L2 | 0    |         |

|    | busy | Address | v | Q |
|----|------|---------|---|---|
| S1 | 0    |         |   |   |
| S2 | 0    |         |   |   |

|    | Qi | Reg value | Discussion                             |
|----|----|-----------|----------------------------------------|
| F2 | 0  | Mem[12]   | A1 published taken by A2 and reg file. |
| F3 | 0  | 2*Mem[12] |                                        |
| F4 | A2 |           |                                        |
| F5 | М1 | 9         |                                        |