Alexandria University
Faculty of Engineering
Computer Architecture
Spring 2025

# **Cache Coherence Report**

Mohamed Atef Yousef

20011630

# 1. Introduction:

In this lab, we explored the operation of a multi-core cache system using the **MOESI** (Modified, Owned, Exclusive, Shared, Invalid) cache coherence protocol. The primary objective was to simulate and analyze how cache lines transition between states based on memory operations performed by four processors (P0–P3). We used a Python-based **MOESI** simulator (<u>GitHub Repository</u>) that randomly generates read and write instructions and visually shows the impact on each L1 cache.

The goal was to observe and explain how each instruction affects the cache line states across the processors, identifying the cause processor, transition type, and resulting coherence impact across the system. This hands-on experiment enhanced our understanding of distributed memory consistency and the detailed workings of MOESI transitions in shared-memory systems.

# **Example**

# Cycle 1

| CPU 0               |                |              |      |              |      |            |        |            |      | Actions                      |
|---------------------|----------------|--------------|------|--------------|------|------------|--------|------------|------|------------------------------|
| Instructions:       |                | L1 - Block ( | 0:   | L1 - Block 1 |      | L1 - Block | 2:     | L1 - Block | 3:   | Start Simulation             |
| Executing:          | CALC           | State:       | 1    | State:       | 1    | State:     | 1      | State:     | 1    | Start Simulation             |
| Last executed:      | Idle           | Address:     | 000  | Address:     | 000  | Address:   | 000    | Address:   | 000  | Stop Simulation              |
|                     |                | Data:        | 0000 | Data:        | 0000 | Data:      | 0000   | Data:      | 0000 | Single Step                  |
|                     |                |              |      |              |      |            |        |            |      | Single Step                  |
| CPU 1 Instructions: |                | L1 - Block ( | n-   | L1 - Block 1 |      | L1 - Block | g.     | L1 - Block | 3.   | Custom instruction:          |
| Executing:          | CALC           | State:       |      | State:       |      | State:     | z.<br> | State:     | . I  | P0 ~                         |
| Last executed:      | Idle           | Address:     | 000  | Address:     | 000  | Address:   | 000    | Address:   | 000  | Add                          |
|                     |                | Data:        | 0000 | Data:        | 0000 | Data:      | 0000   | Data:      | 0000 |                              |
|                     |                |              |      |              |      |            |        |            |      |                              |
| CPU 2               |                |              |      |              |      |            |        |            |      | <b>Debug Options</b>         |
| Instructions:       |                | L1 - Block ( | 0:   | L1 - Block 1 |      | L1 - Block | 2:     | L1 - Block | 3:   | Instruction time (s): 1      |
| Executing:          | WRITE 011;7f09 | State:       | 1    | State:       | 1    | State:     | М      | State:     | 1    |                              |
| Last executed:      | ldle           | Address:     | 000  | Address:     | 000  | Address:   | 011    | Address:   | 000  | Change instruction time (s): |
|                     |                | Data:        | 0000 | Data:        | 0000 | Data:      | 7f09   | Data:      | 0000 |                              |
|                     |                |              |      |              |      |            |        |            |      | Change                       |
| CPU 3               |                |              |      |              |      |            |        |            |      |                              |
| Instructions:       |                | L1 - Block ( | 0:   | L1 - Block 1 |      | L1 - Block | 2:     | L1 - Block | 3:   | Generate Instruction         |
| Executing:          | WRITE 100;7f83 | State:       | М    | State:       | 1    | State:     | 1      | State:     | 1    |                              |
| Last executed:      | ldle           | Address:     | 100  | Address:     | 000  | Address:   | 000    | Address:   | 000  |                              |
|                     |                | Data:        | 7f83 | Data:        | 0000 | Data:      | 0000   | Data:      | 0000 |                              |

CPU 0: DECODED CALC

CPU 1: DECODED CALC

CPU 2: DECODED WRITE 3 32521

BUS: REQ FROM CPU 2: FOUND COPIES []

CPU 3: DECODED WRITE 4 32643

BUS: REQ FROM CPU 3: FOUND COPIES []

| Instruction     | Cause<br>Processor | Transition        | P0<br>Effect | P1<br>Effect | P2<br>Effect | P3<br>Effect |
|-----------------|--------------------|-------------------|--------------|--------------|--------------|--------------|
| WRITE 011, 7F09 | P2                 | $I \to M$         | -            | -            | -            | -            |
| WRITE 100, 7F83 | Р3                 | $I \rightarrow M$ | _            | _            | _            | _            |

 $I \rightarrow M$  (Invalid  $\rightarrow$  Modified):

Happens when a processor writes to a memory address not currently in its cache (WRITE MISS), and no other caches have a copy.

| MOESI S             | imulator by J.A | A <i>Ibarra</i> |            |              |      |            |      |            |      |                              |
|---------------------|-----------------|-----------------|------------|--------------|------|------------|------|------------|------|------------------------------|
| CPU 0               |                 |                 |            |              |      |            |      |            |      | Actions                      |
| Instructions:       |                 | L1 - Block (    | 0:         | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block | 3:   | Charle Classification        |
| Executing:          | READ 011        | State:          | 1          | State:       | 1    | State:     | S    | State:     | 1    | Start Simulation             |
| Last executed:      | CALC            | Address:        | 000        | Address:     | 000  | Address:   | 011  | Address:   | 000  | Stop Simulation              |
|                     |                 | Data:           | 0000       | Data:        | 0000 | Data:      | 7f09 | Data:      | 0000 | Single Step                  |
|                     |                 |                 |            |              |      |            |      |            |      | Single Step                  |
| CPU 1               |                 |                 |            |              |      |            |      |            |      | Custom instruction:          |
| Instructions:       |                 | L1 - Block (    | 0:         | L1 - Block 1 | :    | L1 - Block |      | L1 - Block | 3:   | PO V                         |
| Executing:          | READ 101        | State:          | 1          | State:       | 1    | State:     | E    | State:     | 1    |                              |
| Last executed:      | CALC            | Address:        | 000        | Address:     | 000  | Address:   | 101  | Address:   | 000  | Add                          |
|                     |                 | Data:           | 0000       | Data:        | 0000 | Data:      | 0000 | Data:      | 0000 |                              |
| CPU 2               |                 |                 |            |              |      |            |      |            |      | <b>Debug Options</b>         |
| Instructions:       |                 | L1 - Block (    | <b>0</b> : | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block | 3:   | Instruction time (s): 1      |
| Executing:          | CALC            | State:          | 1          | State:       | 1    | State:     | 0    | State:     | 1    |                              |
| Last executed:      | WRITE 011;7f09  | Address:        | 000        | Address:     | 000  | Address:   | 011  | Address:   | 000  | Change instruction time (s): |
|                     |                 | Data:           | 0000       | Data:        | 0000 | Data:      | 7f09 | Data:      | 0000 |                              |
|                     |                 |                 |            |              |      |            |      |            |      | Change                       |
| CPU 3 Instructions: |                 | L1 - Block (    | 0:         | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block | 3:   |                              |
| Executing:          | WRITE 100;7f55  | State:          | М          | State:       | 1    | State:     | 1    | State:     | 1    | Generate Instruction         |
| Last executed:      | WRITE 100;7f83  | Address:        | 100        | Address:     | 000  | Address:   | 000  | Address:   | 000  |                              |
|                     |                 | Data:           | 7f55       | Data:        | 0000 | Data:      | 0000 | Data:      | 0000 |                              |

CPU 0: DECODED READ 3

CPU 0: READ MISS. PROBING...

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F4C0>]

CPU 1: DECODED READ 5

CPU 0: READ HIT ON PROBE

CPU 1: READ MISS. PROBING...

BUS: REQ FROM CPU 0: UPDATING CPU: 2 ADDRESS: 3 TYPE: RHP

CPU 2: DECODED CALC

CPU 3: DECODED WRITE 4 32597

BUS: REQ FROM CPU 1: FOUND COPIES []

CPU 1: READ MISS ON PROBE. READ FROM MEMORY.

BUS: REQ FROM CPU 1: READING 5 FROM MEMORY

WRITE HIT

BUS: REQ FROM CPU 3: WRITING 32643 TO ADDRESS 4

| Instruction | Cause<br>Processor | Transition        | P0<br>Effect | P1<br>Effect | P2<br>Effect      | P3<br>Effect |
|-------------|--------------------|-------------------|--------------|--------------|-------------------|--------------|
| READ 011    | PO                 | $I \rightarrow S$ | -            | _            | I $\rightarrow$ O | -            |
| READ 101    | P1                 | $I \rightarrow E$ | _            | _            | _                 | _            |

## $I \rightarrow S$ (Invalid $\rightarrow$ Shared) - on READ 011 by P0:

CPU 0 reads data it doesn't have (READ MISS), but a valid copy exists in CPU 2.

CPU 2 shares the data, changing its state to Owned (0).

CPU 0's cache line becomes Shared (S).

## I $\rightarrow$ E (Invalid $\rightarrow$ Exclusive) - on READ 101 by P1:

CPU 1 reads data not present in any other cache (READ MISS, no copies). Since it's the only copy, it gets the line in Exclusive (E) state.

### WRITE 100, 7F55 by P3:

The transition is not shown, likely because it was a write hit (already had the line in M or E).

No state change across processors is noted.



CPU 0: DECODED WRITE 5 32660

CPU 1: DECODED WRITE 4 32555

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>]

CPU 2: DECODED WRITE 4 32811

BUS: REQ FROM CPU 0: UPDATING CPU: 1 ADDRESS: 5 TYPE: WM

CPU 3: DECODED READ 3

BUS: REQ FROM CPU 1: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F730>]

CPU 3: READ MISS. PROBING...

BUS: REQ FROM CPU 1: UPDATING CPU: 3 ADDRESS: 4 TYPE: WM

BUS: REQ FROM CPU 2: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>]

BUS: REO FROM CPU 2: UPDATING CPU: 1 ADDRESS: 4 TYPE: WM

BUS: REQ FROM CPU 3: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7EF80>,

<CPU.CPU object at 0x0000021F84B7F4C0>]

CPU 3: READ HIT ON PROBE

BUS: REQ FROM CPU 3: UPDATING CPU: 0 ADDRESS: 3 TYPE: RHP

BUS: REQ FROM CPU 3: UPDATING CPU: 2 ADDRESS: 3 TYPE: RHP

| Instruction     | Cause<br>Processor | Transition    | P0<br>Effect | P1<br>Effect      | P2<br>Effect | P3<br>Effect      |
|-----------------|--------------------|---------------|--------------|-------------------|--------------|-------------------|
| WRITE 101, 7F94 | PO                 | $I \to M$     | -            | E $\rightarrow$ I | -            | -                 |
| WRITE 100, 7F2B | P1                 | $I \ \to \ M$ | -            | -                 | -            | $M \rightarrow I$ |
| WRITE 100, 802B | P2                 | $I \ \to \ M$ | -            | $M \rightarrow I$ | -            | -                 |
| READ 011        | Р3                 | I → S         | _            | _                 | _            | _                 |

### I $\rightarrow$ M (Invalid $\rightarrow$ Modified) - WRITE 101, 7F94 by P0:

CPU 0 writes to address 101; it doesn't have the line, so it moves from Invalid to Modified.

CPU 1 had the line in Exclusive (E), which is invalidated (E  $\rightarrow$  I).

### I $\rightarrow$ M - WRITE 100, 7F2B by P1:

CPU 1 writes a line it doesn't have, so I  $\rightarrow$  M.

CPU 3 had the line in Modified, which is now invalidated (M  $\rightarrow$  I).

#### I $\rightarrow$ M - WRITE 100, 802B by P2:

CPU 2 writes a line it didn't have; its state goes  $I \rightarrow M$ .

CPU 1 had the line in Modified, now invalidated  $(M \rightarrow I)$ .

## I $\rightarrow$ S - READ 011 by P3:

CPU 3 reads a line it doesn't have, so it becomes Shared (S).

Other processors may retain their states or adjust (e.g., from M/E  $\rightarrow$  O or I).

| MOESI Si            | mulator by J.A | A Ibarra      |      |              |      |            |      |              |       |                                         |
|---------------------|----------------|---------------|------|--------------|------|------------|------|--------------|-------|-----------------------------------------|
| CPU 0               |                |               |      |              |      |            |      |              |       | Actions                                 |
| Instructions:       |                | L1 - Block 0. | t:   | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block 3 | 3:    | 5 - 1 5 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 |
| Executing:          | READ 011       | State:        | 1    | State:       | 1    | State:     | 1    | State:       | М     | Start Simulation                        |
| Last executed:      | WRITE 101;7f94 | Address:      | 000  | Address:     | 000  | Address:   | 011  | Address:     | 101   | Stop Simulation                         |
|                     |                | Data:         | 0000 | Data:        | 0000 | Data:      | 7f09 | Data:        | 7f94  |                                         |
|                     |                |               |      |              |      |            |      |              |       | Single Step                             |
| CPU 1               |                |               |      |              |      |            |      |              |       | Custom instruction:                     |
| Instructions:       |                | L1 - Block 0  | :    | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block 3 | 3:    | [20]                                    |
| Executing:          | WRITE 011;8106 | State:        | 1    | State:       | 1    | State:     | 1    | State:       | М     | P0 ~                                    |
| Last executed:      | WRITE 100;7f2b | Address:      | 100  | Address:     | 000  | Address:   | 101  | Address:     | 011   | Add                                     |
|                     |                | Data:         | 7f2b | Data:        | 0000 | Data:      | 0000 | Data:        | 8106  |                                         |
| CPU 2               |                |               |      |              |      |            |      |              |       | Debug Options                           |
| Instructions:       |                | L1 - Block 0  | t    | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block 3 | 3:    | Instruction time (s): 1                 |
| Executing:          | WRITE 001;7fe5 | State:        | М    | State:       | 1    | State:     | 1    | State:       | М     | instruction time (s).                   |
| Last executed:      | WRITE 100;802b | Address:      | 100  | Address:     | 000  | Address:   | 011  | Address:     | 001   | Change instruction time (s):            |
|                     |                | Data:         | 802b | Data:        | 0000 | Data:      | 7f09 | Data:        | 7fe5  |                                         |
| 05110               |                |               |      |              |      |            |      |              |       | Change                                  |
| CPU 3 Instructions: |                | L1 - Block 0  | ·    | L1 - Block 1 |      | L1 - Block | 2.   | L1 - Block 3 | 3.    |                                         |
| Executing:          | READ 011       | State:        |      | State:       |      | State:     |      | State:       | ~<br> | Generate Instruction                    |
|                     | READ 011       | Address:      | 100  | Address:     | 000  | Address:   | 011  | Address:     | 000   |                                         |
| Last executed:      |                |               |      |              |      | rical coo. |      |              |       |                                         |

CPU 0: DECODED READ 3

CPU 0: READ HIT

CPU 1: DECODED WRITE 3 33030

CPU 2: DECODED WRITE 1 32741

BUS: REQ FROM CPU 1: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7EF80>,

<CPU.CPU object at 0x0000021F84B7F4C0>, <CPU.CPU object at

0x0000021F84B7F730>]

CPU 3: DECODED READ 3

BUS: REQ FROM CPU 1: UPDATING CPU: 0 ADDRESS: 3 TYPE: WM

CPU 3: READ HIT

BUS: REQ FROM CPU 1: UPDATING CPU: 2 ADDRESS: 3 TYPE: WM

BUS: REQ FROM CPU 1: UPDATING CPU: 3 ADDRESS: 3 TYPE: WM

BUS: REQ FROM CPU 2: FOUND COPIES []

| Instruction     | Cause<br>Processor | Transition        | P0<br>Effect      | P1<br>Effect | P2<br>Effect | P3<br>Effect |  |
|-----------------|--------------------|-------------------|-------------------|--------------|--------------|--------------|--|
| READ 011        | PO                 | -                 | _                 | -            | -            | -            |  |
| WRITE 011, 8106 | P1                 | $I \rightarrow M$ | S $\rightarrow$ I | -            | O → I        | S - I        |  |
| WRITE 001, 7FE5 | P2                 | $I \rightarrow M$ | -                 | -            | -            | -            |  |

### READ 011 by P0:

No transition occurs (it's a read hit), so the data is already in P0's cache in a valid state like S or O.

## WRITE 011, 8106 by P1:

CPU 1 performs a write, but others already have the line (PO, P2, P3).

P1: I  $\rightarrow$  M - It didn't have the line, now becomes the owner with updated data.

P0:  $S \rightarrow I - Had$  a shared copy, now invalidated.

P2:  $O \rightarrow I - Had$  an owned copy, now invalidated.

P3:  $S \rightarrow I - Also invalidated$ .

## WRITE 001, 7FE5 by P2:

Simple case of a write miss with no existing copies.

P2: I  $\rightarrow$  M - Becomes the only holder of the modified data.

Others remain unaffected.



CPU 0: DECODED READ 4

CPU 0: READ MISS. PROBING...

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F4C0>]

CPU 0: READ HIT ON PROBE

CPU 1: DECODED CALC

CPU 2: DECODED READ 3

BUS: REQ FROM CPU 0: UPDATING CPU: 2 ADDRESS: 4 TYPE: RHP

CPU 2: READ MISS. PROBING...

CPU 3: DECODED READ 3

CPU 3: READ MISS. PROBING...

BUS: REQ FROM CPU 2: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>]

CPU 2: READ HIT ON PROBE

BUS: REQ FROM CPU 2: UPDATING CPU: 1 ADDRESS: 3 TYPE: RHP

BUS: REQ FROM CPU 3: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>,

<CPU.CPU object at 0x0000021F84B7F4C0>]

CPU 3: READ HIT ON PROBE

BUS: REQ FROM CPU 3: UPDATING CPU: 1 ADDRESS: 3 TYPE: RHP

BUS: REQ FROM CPU 3: UPDATING CPU: 2 ADDRESS: 3 TYPE: RHP

| Instruction | Cause<br>Processor | Transition        | P0<br>Effect | P1<br>Effect      | P2<br>Effect | P3<br>Effect |
|-------------|--------------------|-------------------|--------------|-------------------|--------------|--------------|
| READ 100    | PO                 | $I \rightarrow S$ | -            | -                 | M → O        | _            |
| READ 011    | P2                 | $I \rightarrow S$ | -            | $M \rightarrow O$ | -            | _            |
| READ 011    | Р3                 | I → S             | -            | -                 | -            | _            |

| PU 0                         |                |            |      |            |      |            |      |            |      | Actions                      |
|------------------------------|----------------|------------|------|------------|------|------------|------|------------|------|------------------------------|
| nstructions:                 |                | L1 - Block | 0:   | L1 - Block | 1:   | L1 - Block | 2:   | L1 - Block | 3:   |                              |
| Executing:                   | WRITE 011;7f63 | State:     | S    | State:     | 1    | State:     | М    | State:     | 1    | Start Simulation             |
| Last executed:               | READ 100       | Address:   | 100  | Address:   | 000  | Address:   | 011  | Address:   | 101  | Stop Simulation              |
|                              |                | Data:      | 802b | Data:      | 0000 | Data:      | 7f63 | Data:      | 7f94 | Single Step                  |
| CPU 1                        |                |            |      |            |      |            |      |            |      | Custom instruction:          |
| Instructions:                |                | L1 - Block | 0:   | L1 - Block | 1:   | L1 - Block | 2:   | L1 - Block | 3:   |                              |
| Executing:                   | CALC           | State:     | 1    | State:     | 1    | State:     | 1    | State:     | 1    | P0 ~                         |
| Last executed:               | CALC           | Address:   | 100  | Address:   | 000  | Address:   | 101  | Address:   | 011  | Add                          |
|                              |                | Data:      | 7f2b | Data:      | 0000 | Data:      | 0000 | Data:      | 8106 | 35 <u></u>                   |
| CPU 2                        |                |            |      |            |      |            |      |            |      | Debug Options                |
| Instructions:                |                | L1 - Block | 0:   | L1 - Block | 1:   | L1 - Block | 2:   | L1 - Block | 3:   | Instruction time (s): 1      |
|                              | READ 100       | State:     | 0    | State:     | 1    | State:     | 1    | State:     | м    | instruction time (s).        |
| Executing:                   |                | Address:   | 100  | Address:   | 000  | Address:   | 011  | Address:   | 001  | Change instruction time (s): |
| Executing:<br>Last executed: | READ 011       | Address.   |      |            |      |            |      |            |      |                              |
|                              | READ 011       | Data:      | 802b | Data       | 0000 | Data:      | 8106 | Data:      | 7fe5 |                              |
| Last executed:               | READ 011       |            | 802b | Data:      | 0000 | Data:      | 8106 | Data       | 7fe5 | Change                       |
| Last executed:               | READ 011       | Data:      |      |            |      |            |      |            |      | Change                       |
| CPU 3                        |                | Data:      |      | L1 - Block |      | L1 - Block | 2:   | L1 - Block | 3:   | Change Generate Instruction  |
| Last executed:               | WRITE 101;7fc0 | Data:      |      |            |      |            |      |            |      |                              |

CPU 0: DECODED WRITE 3 32611

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>, <CPU.CPU object at 0x0000021F84B7F4C0>, <CPU.CPU object at 0x0000021F84B7F730>]

BUS: REQ FROM CPU 0: UPDATING CPU: 1 ADDRESS: 3 TYPE: WM
BUS: REQ FROM CPU 0: UPDATING CPU: 2 ADDRESS: 3 TYPE: WM
BUS: REO FROM CPU 0: UPDATING CPU: 3 ADDRESS: 3 TYPE: WM

CPU 1: DECODED CALC

CPU 2: DECODED READ 4

CPU 2: READ HIT

CPU 3: DECODED WRITE 5 32704

BUS: REQ FROM CPU 3: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7EF80>]

BUS: REQ FROM CPU 3: UPDATING CPU: 0 ADDRESS: 5 TYPE: WM

| Instruction     | Cause<br>Processor | Transition        | P0<br>Effect      | P1<br>Effect      | P2<br>Effect      | P3<br>Effect      |
|-----------------|--------------------|-------------------|-------------------|-------------------|-------------------|-------------------|
| WRITE 011, 7F63 | PO                 | $I \ \to \ M$     | -                 | O $\rightarrow$ I | S $\rightarrow$ I | S $\rightarrow$ I |
| READ 100        | P2                 | -                 | -                 | -                 | -                 | -                 |
| WRITE 101, 7FC0 | Р3                 | $I \rightarrow M$ | $M \rightarrow I$ | -                 | -                 | _                 |

## I $\rightarrow$ S - READ 100 by P0:

CPU 0 reads a line it doesn't have → becomes Shared (S).

CPU 2 had the line in Modified (M)  $\rightarrow$  it downgrades to Owned (O), since it supplies the data.

# I $\rightarrow$ S - READ 011 by P2:

CPU 2 reads a line it doesn't have  $\rightarrow$  moves to Shared (S).

CPU 1 had the line in Modified  $(M) \rightarrow \text{downgrades to Owned } (0)$ .

## I $\rightarrow$ S - READ 011 by P3:

CPU 3 also reads the same line and transitions to Shared.

No further state changes for others, as they already had the line in  $\ensuremath{\textsc{0}}$  or  $\ensuremath{\textsc{S}}\xspace.$ 



CPU 0: DECODED READ 5

CPU 0: READ MISS. PROBING...

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F730>]

CPU 1: DECODED WRITE 4 32697

CPU 2: DECODED READ 2

CPU 0: READ HIT ON PROBE

CPU 2: READ MISS. PROBING...

BUS: REQ FROM CPU 0: UPDATING CPU: 3 ADDRESS: 5 TYPE: RHP

CPU 3: DECODED READ 3

BUS: REQ FROM CPU 1: FOUND COPIES [<CPU.CPU object at  $0\times0000021F84B7EF80>$ , <CPU.CPU object at  $0\times0000021F84B7F4C0>$ ]

CPU 3: READ MISS. PROBING...

BUS: REQ FROM CPU 1: UPDATING CPU: 0 ADDRESS: 4 TYPE: WM

BUS: REO FROM CPU 1: UPDATING CPU: 2 ADDRESS: 4 TYPE: WM

BUS: REQ FROM CPU 2: FOUND COPIES []

CPU 2: READ MISS ON PROBE. READ FROM MEMORY.

BUS: REQ FROM CPU 2: READING 2 FROM MEMORY

BUS: REQ FROM CPU 3: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7EF80>]

CPU 3: READ HIT ON PROBE

BUS: REQ FROM CPU 3: UPDATING CPU: 0 ADDRESS: 3 TYPE: RHP

| Instruction     | Cause<br>Processor | Transition        | P0<br>Effect      | P1<br>Effect | P2<br>Effect | P3<br>Effect      |
|-----------------|--------------------|-------------------|-------------------|--------------|--------------|-------------------|
| READ 101        | PO                 | $I \rightarrow S$ | -                 | -            | -            | $M \rightarrow O$ |
| WRITE 100, 7FB9 | P1                 | $I \to M$         | -                 | -            | O → I        | _                 |
| READ 010        | P2                 | $I \to E$         | -                 | -            | -            | _                 |
| READ 011        | Р3                 | $I \rightarrow S$ | $M \rightarrow O$ | _            | _            | _                 |

## I $\rightarrow$ S - READ 101 by P0:

CPU 0 reads a line it doesn't have  $\rightarrow$  transitions to Shared (S).

CPU 3 had the line in Modified (M)  $\rightarrow$  downgrades to Owned (O) to supply the data.

## I $\rightarrow$ M - WRITE 100, 7FB9 by P1:

CPU 1 writes a line it doesn't have  $\rightarrow$  transitions to Modified (M).

CPU 2 had it in Owned (O)  $\rightarrow$  now invalidated (O  $\rightarrow$  I).

### I $\rightarrow$ E - READ 010 by P2:

CPU 2 reads a line with no other copies  $\rightarrow$  moves to Exclusive (E).

Since no one else has it, it can read and potentially write without notification.

## I $\rightarrow$ S - READ 011 by P3:

CPU 3 reads a line it doesn't have  $\rightarrow$  moves to Shared (S).

CPU 0 had it in Modified  $(M) \rightarrow becomes Owned (O)$  to provide the data.

| MOESI S             |                |            |         |            |      |            |         |            |      | Actions                     |
|---------------------|----------------|------------|---------|------------|------|------------|---------|------------|------|-----------------------------|
| Instructions:       |                | L1 - Block | 0:      | L1 - Block | 1:   | L1 - Block | 2:      | L1 - Block | 3:   |                             |
| Executing:          | WRITE 011;80d8 | State:     | 1       | State:     | 1    | State:     | М       | State:     | 1    | Start Simulation            |
| Last executed:      | READ 101       | Address:   | 100     | Address:   | 000  | Address:   | 011     | Address:   | 101  | Stop Simulation             |
|                     |                | Data:      | 802b    | Data:      | 0000 | Data:      | 80d8    | Data:      | 7fc0 |                             |
|                     |                |            |         |            |      |            |         |            |      | Single Step                 |
| CPU 1               |                |            |         |            |      |            |         |            |      | Custom instruction:         |
| Instructions:       |                | L1 - Block | 0:      | L1 - Block | 1:   | L1 - Block | 2:      | L1 - Block | 3:   | PO V                        |
| Executing:          | READ 010       | State:     | М       | State:     | 1    | State:     | 1       | State:     | T.   | PU                          |
| Last executed:      | WRITE 100;7fb9 | Address:   | 100     | Address:   | 010  | Address:   | 101     | Address:   | 011  | Add                         |
|                     |                | Data:      | 7fb9    | Data:      | 0000 | Data:      | 0000    | Data:      | 8106 |                             |
| CPU 2               |                |            |         |            |      |            |         |            |      | Debug Options               |
| Instructions:       |                | L1 - Block | 0:      | L1 - Block | 1:   | L1 - Block | 2:      | L1 - Block | 3:   | Instruction time (s): 1     |
| Executing:          | WRITE 101;8027 | State:     | 1       | State:     | 1    | State:     | 1       | State:     | М    |                             |
| Last executed:      | READ 010       | Address:   | 100     | Address:   | 010  | Address:   | 011     | Address:   | 101  | Change instruction time (s) |
|                     |                | Data:      | 802b    | Data:      | 0000 | Data:      | 8106    | Data:      | 8027 |                             |
|                     |                |            |         |            |      |            |         |            |      | Change                      |
| CPU 3 Instructions: |                | L1 - Block | n·      | L1 - Block | 1.   | L1 - Block | g.      | L1 - Block | g.   |                             |
| Executing:          | WRITE 010:7fd9 | State:     | v.<br>I | State:     | м    | State:     | z.<br>I | State:     | J.   | Generate Instruction        |
| Last executed:      | READ 011       | Address:   | 100     | Address:   | 010  | Address:   | 011     | Address:   | 101  |                             |
| Last onbodica.      |                | Data:      | 7f55    | Data:      | 7fd9 | Data:      | 7f63    | Data:      | 7fc0 |                             |

CPU 0: DECODED WRITE 3 32984

WRITE MISS

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F730>]

CPU 1: DECODED READ 2

CPU 1: READ MISS. PROBING...

BUS: REQ FROM CPU 0: UPDATING CPU: 3 ADDRESS: 3 TYPE: WM

CPU 3: DECODED WRITE 2 32729

CPU 2: DECODED WRITE 5 32807

BUS: REQ FROM CPU 1: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F4C0>]

CPU 1: READ HIT ON PROBE

BUS: REQ FROM CPU 1: UPDATING CPU: 2 ADDRESS: 2 TYPE: RHP

BUS: REQ FROM CPU 3: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>,

<CPU.CPU object at 0x0000021F84B7F4C0>]

BUS: REQ FROM CPU 3: UPDATING CPU: 1 ADDRESS: 2 TYPE: WM

BUS: REQ FROM CPU 3: UPDATING CPU: 2 ADDRESS: 2 TYPE: WM

BUS: REQ FROM CPU 2: WRITING 32741 TO ADDRESS 1

BUS: REQ FROM CPU 2: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7EF80>,

<CPU.CPU object at 0x0000021F84B7F730>]

BUS: REQ FROM CPU 2: UPDATING CPU: 0 ADDRESS: 5 TYPE: WM

BUS: REQ FROM CPU 2: UPDATING CPU: 3 ADDRESS: 5 TYPE: WM

| Instruction     | Cause<br>Processor | Transition        | P0<br>Effect      | P1<br>Effect | P2<br>Effect | P3<br>Effect      |
|-----------------|--------------------|-------------------|-------------------|--------------|--------------|-------------------|
| WRITE 011, 80D8 | PO                 | $O \rightarrow M$ | -                 | -            | -            | S $\rightarrow$ I |
| READ 010        | P1                 | $I \rightarrow S$ | -                 | -            | E → O        | -                 |
| WRITE 101, 8027 | P2                 | $I \rightarrow M$ | S $\rightarrow$ I | -            | -            | O → I             |
| WRITE 010, 7FD9 | Р3                 | $I \rightarrow M$ | _                 | S -> I       | O → I        | _                 |

## $O \rightarrow M - WRITE 011 by P0:$

CPU 0 has the line in Owned (O) and writes  $\rightarrow$  transitions to Modified (M).

CPU 3 had a Shared (S) copy  $\rightarrow$  must invalidate (S  $\rightarrow$  I).

# $I \rightarrow S - READ 010 by P1:$

CPU 1 reads a line it doesn't have  $\rightarrow$  moves to Shared (S).

CPU 2 had it in Exclusive (E)  $\rightarrow$  downgrades to Owned (O) to share it.

### I $\rightarrow$ M - WRITE 101 by P2:

CPU 2 writes a line it doesn't have  $\rightarrow$  transitions to Modified (M).

CPU 0 had it in Shared (S)  $\rightarrow$  invalidated.

CPU 3 had it in Owned (0)  $\rightarrow$  invalidated.

### I $\rightarrow$ M - WRITE 010 by P3:

CPU 3 writes a line it doesn't have  $\rightarrow$  transitions to Modified (M).

CPU 1 had it in Shared (S)  $\rightarrow$  invalidated.

CPU 2 had it in Owned (0)  $\rightarrow$  invalidated.

| CPU 0          |                |              |      |              |      |            |      |               |      | Actions                      |
|----------------|----------------|--------------|------|--------------|------|------------|------|---------------|------|------------------------------|
| Instructions:  |                | L1 - Block ( | ):   | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block 3: |      | Start Simulation             |
| Executing:     | WRITE 101;80ad | State:       | 1    | State:       | 1    | State:     | М    | State:        | М    | Start Simulation             |
| Last executed: | WRITE 011;80d8 | Address:     | 100  | Address:     | 000  | Address:   | 011  | Address:      | 101  | Stop Simulation              |
|                |                | Data:        | 802b | Data:        | 0000 | Data:      | 80d8 | Data:         | 80ad | Single Step                  |
| CPU 1          |                |              |      |              |      |            |      |               |      | Custom instruction:          |
| Instructions:  |                | L1 - Block ( | ):   | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block 3: |      | Custom instruction.          |
| Executing:     | READ 010       | State:       | 1    | State:       | s    | State:     | 1    | State:        | 1    | P0 ~                         |
| Last executed: | READ 010       | Address:     | 100  | Address:     | 010  | Address:   | 101  | Address:      | 011  | Add                          |
|                |                | Data:        | 7fb9 | Data:        | 7fd9 | Data:      | 0000 | Data:         | 8106 |                              |
| CPU 2          |                |              |      |              |      |            |      |               |      | Debug Options                |
| Instructions:  |                | L1 - Block ( | ):   | L1 - Block 1 |      | L1 - Block | 2:   | L1 - Block 3: |      | Instruction time (s): 1      |
| Executing:     | WRITE 100;7fdc | State:       | М    | State:       | 1    | State:     | 1    | State:        | 1    | instruction time (s).        |
| Last executed: | WRITE 101;8027 | Address:     | 100  | Address:     | 010  | Address:   | 011  | Address:      | 101  | Change instruction time (s): |
|                |                | Data:        | 7fdc | Data:        | 0000 | Data:      | 8106 | Data:         | 8027 |                              |
|                |                |              |      |              |      |            |      |               |      | Change                       |
| CPU 3          |                |              |      |              |      |            |      |               |      |                              |
| Instructions:  |                | L1 - Block ( | ):   | L1 - Block 1 |      | L1 - Block |      | L1 - Block 3: |      | Generate Instruction         |
| Executing:     | CALC           | State:       | 1    | State:       | 0    | State:     | 1    | State:        | 1    |                              |
| Last executed: | WRITE 010;7fd9 | Address:     | 100  | Address:     | 010  | Address:   | 011  | Address:      | 101  |                              |
|                |                | Data:        | 7f55 | Data:        | 7fd9 | Data:      | 7f63 | Data:         | 7fc0 |                              |

CPU 0: DECODED WRITE 5 32941

BUS: REQ FROM CPU 0: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F4C0>]

CPU 1: DECODED READ 2

BUS: REQ FROM CPU 0: UPDATING CPU: 2 ADDRESS: 5 TYPE: WM

CPU 1: READ MISS. PROBING...

BUS: REQ FROM CPU 1: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F730>]

CPU 1: READ HIT ON PROBE

BUS: REQ FROM CPU 1: UPDATING CPU: 3 ADDRESS: 2 TYPE: RHP

CPU 2: DECODED WRITE 4 32732

BUS: REQ FROM CPU 2: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>]

CPU 3: DECODED CALC

BUS: REQ FROM CPU 2: UPDATING CPU: 1 ADDRESS: 4 TYPE: WM

| Instruction     | Cause<br>Processor | Transition        | P0<br>Effect | P1<br>Effect      | P2<br>Effect      | P3<br>Effect      |
|-----------------|--------------------|-------------------|--------------|-------------------|-------------------|-------------------|
| WRITE 101, 80AD | PO                 | $I \to M$         | -            | -                 | $M \rightarrow I$ | -                 |
| READ 010        | P1                 | $I \rightarrow S$ | -            | -                 | -                 | $M \rightarrow O$ |
| WRITE 100, 7FDC | P2                 | $I \rightarrow M$ | -            | $M \rightarrow I$ | -                 | -                 |

```
WRITE 101, 80AD - by P0 (CPU 0):
   CPU 0 writes to address 5.
   Its local state was Invalid (I) \rightarrow moves to Modified (M).
   CPU 2 had the line in Modified (M) \rightarrow must invalidate (M \rightarrow I).
READ 010 - by P1 (CPU 1):
   CPU 1 reads address 2, and didn't have the line \rightarrow I \rightarrow Shared (S).
   CPU 3 had it in Modified (M) \rightarrow must downgrade to Owned (M \rightarrow O) to allow
   sharing.
WRITE 100, 7FDC - by P2 (CPU 2):
   CPU 2 writes to address 4.
   It had the line as Invalid (I) \rightarrow becomes Modified (M).
   CPU 1 had it in Modified (M) \rightarrow must invalidate (M \rightarrow I).
```

| CPU 0          |                |            |            |            |      |            |      |              |      | Actions                      |
|----------------|----------------|------------|------------|------------|------|------------|------|--------------|------|------------------------------|
| Instructions:  |                | L1 - Block | <b>0</b> : | L1 - Block | 1:   | L1 - Block | 2:   | L1 - Block   | 3:   | Start Simulation             |
| Executing:     | READ 011       | State:     | 1          | State:     | 1    | State:     | 1    | State:       | М    | Start Simulation             |
| Last executed: | WRITE 101;80ad | Address:   | 100        | Address:   | 000  | Address:   | 011  | Address:     | 101  | Stop Simulation              |
|                |                | Data:      | 802b       | Data:      | 0000 | Data:      | 80d8 | Data:        | 80ad | Single Step                  |
| CPU 1          |                |            |            |            |      |            |      |              |      | Custom instruction:          |
| Instructions:  |                | L1 - Block | 0:         | L1 - Block | 1:   | L1 - Block | 2:   | L1 - Block 3 | 3:   |                              |
| Executing:     | WRITE 011;8001 | State:     | 1          | State:     | 1    | State:     | 1    | State:       | М    | P0 V                         |
| Last executed: | READ 010       | Address:   | 100        | Address:   | 010  | Address:   | 101  | Address:     | 011  | Add                          |
|                |                | Data:      | 7fb9       | Data:      | 7fd9 | Data:      | 0000 | Data:        | 8001 |                              |
| CPU 2          |                |            |            |            |      |            |      |              |      | Debug Options                |
| Instructions:  |                | L1 - Block | 0:         | L1 - Block | 1:   | L1 - Block | 2:   | L1 - Block 3 | 3:   | Instruction time (s): 1      |
| Executing:     | WRITE 010;8015 | State:     | M          | State:     | M    | State:     | 1    | State:       | 1    |                              |
| Last executed: | WRITE 100;7fdc | Address:   | 100        | Address:   | 010  | Address:   | 011  | Address:     | 101  | Change instruction time (s): |
|                |                | Data:      | 7fdc       | Data:      | 8015 | Data:      | 8106 | Data:        | 8027 |                              |
|                |                |            |            |            |      |            |      |              |      | Change                       |
| CPU 3          |                |            | _          |            |      |            | _    |              | _    |                              |
| Instructions:  |                | L1 - Block |            | L1 - Block |      | L1 - Block | 2:   | L1 - Block 3 |      | Generate Instruction         |
| Executing:     | WRITE 110;800a | State:     | M          | State:     | I    | State:     | 1    | State:       | 1    |                              |
| Last executed: | CALC           | Address:   | 110        | Address:   | 010  | Address:   | 011  | Address:     | 101  |                              |

CPU 0: DECODED READ 3

CPU 0: READ HIT

CPU 1: DECODED WRITE 3 32769

BUS: REQ FROM CPU 1: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7EF80>]

BUS: REQ FROM CPU 1: UPDATING CPU: 0 ADDRESS: 3 TYPE: WM

CPU 2: DECODED WRITE 2 32789

BUS: REQ FROM CPU 2: FOUND COPIES [<CPU.CPU object at 0x0000021F84B7F010>,

<CPU.CPU object at 0x0000021F84B7F730>]

BUS: REQ FROM CPU 2: UPDATING CPU: 1 ADDRESS: 2 TYPE: WM

BUS: REQ FROM CPU 2: UPDATING CPU: 3 ADDRESS: 2 TYPE: WM

CPU 3: DECODED WRITE 6 32778

BUS: REQ FROM CPU 3: FOUND COPIES []

| Instruction     | Cause<br>Processor | Transition                      | P0<br>Effect      | P1<br>Effect      | P2<br>Effect | P3<br>Effect |
|-----------------|--------------------|---------------------------------|-------------------|-------------------|--------------|--------------|
| WRITE 011, 8001 | P1                 | $I \to M$                       | $M \rightarrow I$ | -                 | -            | -            |
| WRITE 010, 8015 | P2                 | $I \ \to \ M$                   | _                 | $S \rightarrow I$ | -            | O → I        |
| WRITE 110, 800A | Р3                 | $\mathbb{I} \ \to \ \mathbb{M}$ | _                 | -                 | -            | -            |

```
WRITE 011, 8001 - by P1 (CPU 1):
  CPU 1 writes to address 3.
   Its local cache line was Invalid (I) \rightarrow becomes Modified (M).
  CPU 0 had the line in Modified (M) \rightarrow must invalidate (M \rightarrow I) to ensure
  only one valid copy exists.
WRITE 010, 8015 - by P2 (CPU 2):
  CPU 2 writes to address 2.
  Its local state was Invalid (I) \rightarrow moves to Modified (M).
  CPU 1 had the line in Shared (S) \rightarrow must invalidate (S \rightarrow I).
  CPU 3 had it in Owned (0) \rightarrow must also invalidate (0 \rightarrow I).
WRITE 110, 800A - by P3 (CPU 3):
  CPU 3 writes to address 6.
  No other processors have this address cached.
  So it transitions from Invalid (I) to Modified (M) with no other effects.
```

## 1. Simulation Results:

The simulation results demonstrate how cache coherence is maintained across different processors using the MOESI (Modified, Owned, Exclusive, Shared, Invalid) protocol. Each state plays a crucial role in ensuring data consistency:

- **M Modified**: The most updated version of the line is in this cache. No copies exist in any other cache. This data is different from the memory version.
- **O Owned**: This cache line can exist in more than one cache. However, it contains the most updated version of the data. Similar to M, only the line in one core can be in this state; the others must be in state S.
- **E Exclusive**: This cache line has the same data as the main memory. No copies exist in any other cache. This occurs when it is read directly.
- **S Shared**: The line in this state is coherent with main memory. Copies may exist in other caches. The data is shared with other processors.
- **I Invalid**: The line is invalid. It follows several rules:
  - 1. If a write is made and the line is not in state M or E, all copies must be invalidated, as they do not have the most updated data.
  - **2.** The system may discard an invalid line to make space for one that contains another address, depending on the replacement policy for **cache lines**.
  - **3.** If a line is in **M** state, reads from other caches obtain the updated data, and the one that shared it changes to **O** (Owned).
  - **4.** If a line is in **E** state, reads from other caches change to **S**, and the one that shared it changes to **S** (Shared).



State machine of a cache line for the MOESI protocol



Bus state machine for the MOESI protocol.

# 2. Tools and Environment:

MOESI simulator was used for simulation.

# 3. Conclusion:

Through this lab, we gained practical insight into how the MOESI cache coherence protocol ensures memory consistency across multiple processor cores. By simulating various memory operations across four CPUs, we observed how each instruction can cause a series of state transitions across the L1 caches, such as transitions from Invalid to Shared, Shared to Invalid, or Owned to Modified, depending on the access type and current cache states.

The step-by-step simulation highlighted the importance of bus-based communication in enforcing coherence and the impact of one processor's actions on the cache states of others. For instance, a write by one processor often resulted in invalidation of shared copies held by others, demonstrating the trade-offs between performance and consistency in cache coherence protocols.

Overall, this exercise solidified our theoretical understanding of the MOESI protocol by visualizing real-time state transitions and provided valuable experience in interpreting system-level cache interactions in multiprocessor systems.