**Homework 4: Memory**

**Due date: 11:59PM Nov. 8**

1. Consider an array declared in C as "double a[100];". How many 64-byte cache lines are required to hold the complete array? [4pts]
2. Consider the byte address 0x002468ac. What is the value modulo 64? (That is, what is the offset of this address within a 64-byte block?) [4pts]
3. Consider the byte address 0x002468ac. What is the value shifted to the right by 6 bits? (That is, what is the block address corresponding to this byte address when using 64-byte blocks?) [4pts]
4. Consider matrix transpose written in C. Which array is exhibiting spatial locality: array "a", "b", or both? (Note that NROWS and NCOLS could each be relatively large compared to the size of the cache.) [4pts]

for(i=0; i<NROWS; i++){

for(j=0; j<NCOLS; j++){

b[i][j] = a[j][i];

}

}

1. Consider a 4 GB byte-addressable main memory (32-bit address) with a level-1 data cache that is eight-way set-associative, 32 KB in size, with 64-byte block size. [12pts]

a) How many total blocks are there in cache?

b) How many sets are there?

c) Show how the main memory address is partitioned into fields for the cache access and give the bit lengths of these fields.

1. Consider a direct-mapped data cache design in which a 32-bit address is divided into these three fields: 20-bit tag, 9-bit index, and 3-bit offset. [16pts]

a) How large is a line in number of bytes?

b) How many lines are in the cache?

c) How large is the cache in number of bytes?

d) For the following segment of code written in C, where "sum" and the array "a" are typed as 4-byte integers, what is the miss rate?

(Assume the variable "sum" and the loop index "i" are register-allocated by the compiler within the body of the loop and thus do not cause data cache accesses within the loop.)

for(i=0;i<4096;i++){

sum = sum + a[i];

}

1. Assume a 256-byte main memory and a four-line cache with four bytes Per line. The cache is initially empty. For the byte address reference stream (reads) given below circle which of the references are hits for the different cache placement schemes. Also, show the final contents of the cache. (The byte addresses are in decimal.) [16pts]

a) direct-mapped

0, 16, 1, 31, 2, 32, 3, 17, 4, 18

b) fully-associative with first-in-first-out replacement

0, 16, 1, 31, 2, 32, 3, 17, 4, 18

1. Consider a computer with a paged logical address space with 16 pages and each page is 2 Kbytes. The logical address space is mapped into a 512-Kbyte of physical memory space. (12pts)
   1. Draw the fields in the logical and physical address and show the number of bits of each field.
   2. Draw the page table of a process and show the number of entries in the table and number of bits per entry.
   3. Populate the page table for process, namely A, which is currently running on the CPU. Several pages of process A is in the physical memory as follows:

|  |  |
| --- | --- |
|  | … |
| #frame 10 | Page 2 of Process A |
| #frame 11 | Page 4 of Process A |
| #frame 12 | Page 1 of Process A |
| #frame 13 | Page 7 of Process A |
|  | … |

1. Consider paged virtual memory systems. Assume a page size of 256 bytes (28), and that processes in this system can have a maximum virtual address space of 64K bytes (216). The system is currently configured with 8K (213) bytes of physical memory. (12pts)
2. How many pages are in the virtual address space?
3. How many pages are in the physical address space?
4. A user process generates the virtual address 12,345 (0011000000111001 in binary). Explain how the system establishes the corresponding physical address assuming that the hardware memory management unit and transfer lookaside buffer (TLB) is used.
5. Consider a paged virtual memory system with a physical memory that can only contain 4 pages. Assume the execution of a program generates the following address trace

*a b c d d e f f b e*

where *a*, *b*, *c*, *d*, *e*, and *f* are the pages referenced and the page frames are initially empty. (16pts)

1. How many page faults occur with first-in-first-out Page Replacement?

|  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| *Time* | 1 2 | | 3 4 | | 5 6 | | 7 8 | | 9 10 | |
| *Request* | *a* | *b* | *c* | *d* | *d* | *e* | *f* | *f* | *b* | *e* |
|  |  |  |  |  |  |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |  |
| Fault? |  |  |  |  |  |  |  |  |  |  |

1. How many page faults occur with LRU Page Replacement?

|  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| *Time* | 1 2 | | 3 4 | | 5 6 | | 7 8 | | 9 10 | |
| *Request* | *a* | *b* | *c* | *d* | *d* | *e* | *f* | *f* | *b* | *e* |
|  |  |  |  |  |  |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |  |
| Fault? |  |  |  |  |  |  |  |  |  |  |