## EXERCISE 5

Assume that a computer system is equipped with L1 and L2 caches:



- L1 data cache is a 2-way set associative cache with 2 sets and block size of 32 bytes.
- L1 instruction cache is a 2-way set associative cache with 2 sets and block size of 32 bytes.
- L2 unified cache is a 4-way set associative cache with 4 sets and block size of 32 bytes.

The caches use the Least Recently Used algorithm for the block replacement, and are initially empty.

| L1 I             | nstruction cache | L1              | L1 Data cache |  |  |  |
|------------------|------------------|-----------------|---------------|--|--|--|
| 0                | 1                | 0               | 1             |  |  |  |
| 0x20             | 0×21             | Ox 1000 0x 1006 | 0×1001        |  |  |  |
| 0x 27.           | 0×23             | 0×100 2         | 0x1003        |  |  |  |
| L2 Unified cache |                  |                 |               |  |  |  |
| 0                | 1                | 2               | 3             |  |  |  |
| 0×20 0×1001      |                  | 0×100Z          | 0×1003        |  |  |  |
| 0x1000           | 0x21             | 0× 22.          | 0×23          |  |  |  |
|                  |                  | 0× 100 b        |               |  |  |  |
|                  |                  |                 |               |  |  |  |

Complete the table below to show how the following memory addresses can be accessed. Write L1 when the specified memory address is found in one of the L1 caches, L2 when the memory address is found in the L2 cache, and Miss when the memory address is not available in both levels of caches.

| Time | Instr Addr                     | L1/L2/Miss | Data Addr              | L1/L2/Miss |
|------|--------------------------------|------------|------------------------|------------|
| 1    | 0x410 <b>0<sub>x</sub> 2</b> 0 | M          | 0x20000 <b>0x100</b> 7 | M          |
| 2    | 0x414 <b>0x20</b>              | L1         | 0x20024 0x1001         | М          |
| 3    | 0x430 <b>0x21</b>              | М          | 0x20060 0x1003         | М          |
| 4    | 0x434 <b>0x21</b>              | L1         |                        |            |
| 5    | 0x418 <b>0×2</b> 0             | <u>L</u> 1 |                        |            |
| 6    | 0x400 <b>0×2</b> 0             | L1         | 0x20004 0x 1002        | М          |
| 7    | 0x440 <b>0x22</b>              | М          | 0x200C8 <i>0x</i> 1006 | M          |
| 8    | 0x460 <b>0x23</b>              | М          | 0x20028 0x1001         | L1         |
| 9    | 0x450 <b>Ux22</b>              | L1         | 0x200C4 0x100          | <b>L</b> 1 |
| 10   | 0x430 <b>0</b> x21             | L1         |                        |            |
| 11   | 0x400 <b>0x20</b>              | L1         |                        |            |
| 12   | 0x460 <b>0x23</b>              | <b>L1</b>  |                        |            |
| 13   | 0x440 <b>0x22</b>              | 4          |                        |            |

Mem address 0x410 = 0b010000010000 ---> Block address = 0x20

```
#define N 1000

int array_compare(int A[N], int B[N]) {
   int i;

   for(i=0; i<N, i++) {
      if (A[i] != B[i]) {
        return 0;
      }
   }

   return 1;
}</pre>
```

Suppose our computer uses a 2-way set associative cache with 4 sets, and cache line size is 16 bytes (4 integers). Here, we assume that the starting address of the arrays A and B are address O and address O and O000, respectively.

1. What is the hit rate when both arrays are identical?

$$\frac{750}{1000} = 757$$

2. What is the hit rate after we make the cache line size be 64 bytes? We still assume that both arrays are still identical.

$$\frac{937}{1000} = 93.7\%$$