# 上 海 交 通 大 学 试 卷(<u>B</u>卷)

( 2017 至 2018 学年 第 2 学期 )

|    | 班级号            |               | 学号 |      | 姓名 |  |
|----|----------------|---------------|----|------|----|--|
|    |                | 计算机系统基础(2)    |    |      |    |  |
|    |                |               |    |      |    |  |
| Pr | oblem 1: CPU S | Scheduling    |    |      |    |  |
| 1. | [1]            | [2]           |    |      |    |  |
|    | [3]            | [4]           |    |      |    |  |
|    | [5]            | [6]           |    |      |    |  |
| 2. | 1)             |               |    |      |    |  |
|    |                |               |    |      |    |  |
|    | 2)             |               |    |      |    |  |
|    | ,              |               |    |      |    |  |
| Pr | oblem 2: Repla | cement Policy |    |      |    |  |
| 1. | [1]            |               | 2. | [6]  |    |  |
|    | [2]            |               |    | [7]  |    |  |
|    | [3]            |               |    | [8]  |    |  |
|    | [4]            |               |    | [9]  |    |  |
|    | [5]            |               |    | [10] |    |  |
| 2  |                |               |    |      |    |  |

3.

我承诺,我将严 格遵守考试纪律。

承诺人:\_\_\_\_\_

| 题号                 | 1 | 2 | 3 | 4 | 5 |  |  |
|--------------------|---|---|---|---|---|--|--|
| 得分                 |   |   |   |   |   |  |  |
| 批阅人(流水阅<br>卷教师签名处) |   |   |   |   |   |  |  |

### **Problem 3: Address Translation**

1 [1]

[2]

[3]

[4]

[5]

[6]

- 2 [1]
- [2] [3] [4]
- [5]

- [6]
- [7]
- [8] [9]
- [10]

### **Problem 4: Concurrency**

1

2

1.

2.

3.

4

### Problem 6: Lock

# Problem 1: Scheduling (20 points)

1. We have following jobs in the workload. No I/O issues are involved.

| Job | Arrival Time | Run time |
|-----|--------------|----------|
| A   | 0ms          | 6ms      |
| В   | 2ms          | 2ms      |
| С   | 3ms          | 3ms      |
| D   | 6ms          | 3ms      |

- ♦ When a job arrives, it is added to the tail of the work queue.
- ♦ CPU picks job to run after all queue operations.
- → The MLFQ policy has 2 priority queues, higher one with time-slice of 1ms and lower one with time-slice of 2ms. We use RR in each queue. Priority boost isn't supported.
- ♦ No preemption in MLFQ.
- ♦ We do RR by moving the recently executed task to the end of the queue.
- ♦ The priority of operations is RR movement > accepting new job.

Please calculate the **average** response time and **average** turnaround time for different scheduling policies. (2' \* 6)

| Scheduling Policy | Turnaround Time | Response time |
|-------------------|-----------------|---------------|
| FIFO              | [1]             | [2]           |
| STCF              | [3]             | [4]           |
| MLFQ              | [5]             | [6]           |

- 2. We decide to use **MQMS** on a machine with CPU 0 and 1. Each CPU has a scheduling queue.
  - ♦ There are no I/O issues involved.
  - ♦ Each queue uses RR policy with time-slice of 1ms.
  - ♦ We do RR by moving the recently executed task to the end of the queue.
  - ♦ New job will be added to the tail of the queue with less jobs. (If equal, to queue 0)
  - ♦ During work stealing, each queue peek at another. If that queue has more than 2 jobs, it will steal the last job and put it at the end of itself.
  - ♦ The priority of operations is RR movement > accepting new job.
  - ♦ CPU picks job to run after all queue operations.

Given the execution of CPUs, time 0 means the task running during [0ms,1ms)

| Time | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 |
|------|---|---|---|---|---|---|---|---|---|---|----|----|----|----|
| CPU0 | D | D | E | D | E | F | D | E | D | D | D  | D  | D  | C  |
| CPU1 | A | В | A | В | A | В | A | В | F | С | F  | С  | G  | F  |

- 1) Please tell the arrival time of job A, B, F. (2' \* 3)
- 2) What's the frequency of work stealing? (2')

# Problem 2: Replacement Policy (18 points)

Assume we have a primary device with 3 physical blocks, please complete the following questions.

1. Suppose we are using LRU replacement policy, please complete the following table. (**NOTE:** you do not need to consider the order of primary device contents) (1.5'\*5)

| Reference<br>String | 4 | 3 | 2 | 1 | 3   | 4   | 2   | 4   | 1   |
|---------------------|---|---|---|---|-----|-----|-----|-----|-----|
| Primary             | 4 | 4 | 4 | 3 | [1] | [2] | [3] | [4] | [5] |
| Device              |   | 3 | 3 | 2 |     |     |     |     |     |
| Contents            |   |   | 2 | 1 |     |     |     |     |     |

- 2. Suppose we are using clock algorithm with rules:
  - ♦ The clock pointer points to position 0 initially.
  - ♦ We do not reset the clock pointer after we have found an evict page, so next time we start from the position after previous victim.
  - ♦ Each page brought in were set as accessed initially.

Please complete the following table. (**NOTE:** use "\*" to represent current clock pointer) (1.5'\*5).

| Reference<br>String           | 4   | 3           | 2            | 1            | 3   | 4   | 2   | 4   | 1    |
|-------------------------------|-----|-------------|--------------|--------------|-----|-----|-----|-----|------|
| Primary<br>Device<br>Contents | 4 * | 4<br>3<br>* | 4*<br>3<br>2 | 1<br>3*<br>2 | [6] | [7] | [8] | [9] | [10] |

3. Why do we use clock algorithm instead of directly implement LRU policy in most realistic systems? Please give your reason. (3')

# **Problem 3: Address Translation (21 points)**

Assume we have a machine with the following specifications:

- ♦ The memory is byte-addressable
- ♦ 64KB physical memory space
- ♦ 1MB virtual memory space
- ♦ Each page is 256B
- ♦ The size of one page table equals to the size of page
- ♦ length of each PTE is 16B
- ♦ 8 entries, 2-way associative TLB
- ♦ LRU replacement policy in TLB
- ♦ Each L1 cache line is 4B
- ♦ 8 entries, 2-way associative L1 cache
- 1. Please fill the following table. (1' \* 6)

| The VPO bits                             | [1] |
|------------------------------------------|-----|
| The VPN-1 bits                           | [2] |
| The TLB tag bits                         | [3] |
| The number of PTE in one page table      | [4] |
| The number of page table level           | [5] |
| The maximum size of the whole page table | [6] |

2. Given the following page table contents and cache/TLB state, finish the following address translation. (1.5'\*10)

NOTE: Accesses are independent, which means they won't affect the TLB and cache state in the next access.

| VPN | Addr  | Valid |
|-----|-------|-------|
| 0   | fc00  | 1     |
|     | • • • | • • • |
| a   | 3d00  | 1     |

Part of L1 page table

VPN 2

| VPN | Addr          | Valid |
|-----|---------------|-------|
| 3   | 9 <b>f</b> 00 | 1     |
| f   | 3d00          | 1     |

| РΨ | @0xfc00 | ) |
|----|---------|---|
|    | COALCO  | , |

| Addr | Valid |
|------|-------|
| 3e00 | 1     |
| d900 | 1     |

PT @0x9f00

| VPN | Addr          | Valid |
|-----|---------------|-------|
| 0   | 9400          | 1     |
| 1   | 9500          | 1     |
|     | • • •         | • • • |
| b   | 9 <b>f</b> 00 | 0     |
|     | • • •         | • • • |
| f   | a300          | 1     |
|     |               |       |

PT @0x3d00

| Set       | Valid | Tag  | PPN | Valid | Tag  | PPN |
|-----------|-------|------|-----|-------|------|-----|
| 0         | 1     | 0039 | 08  | 1     | 020a | ea  |
| 1         | 1     | 00b6 | 20  | 0     | 02f3 | a7  |
| 2         | 0     | 02ac | 3e  | 1     | 01b2 | 37  |
| 3         | 0     | 000e | d9  | 0     | 00de | 2b  |
| TLB state |       |      |     |       |      |     |

| Set         | Valid | Tag | bytes | Valid | Tag | bytes |
|-------------|-------|-----|-------|-------|-----|-------|
| 0           | 0     | 2c4 |       | 1     | fb7 |       |
| 1           | 1     | ebf |       | 1     | f2b |       |
| 2           | 1     | a52 |       | 0     | 663 |       |
| 3           | 0     | d91 |       | 1     | 3b1 |       |
| cache state |       |     |       |       |     |       |

| Parameter                               | Value  |
|-----------------------------------------|--------|
| Virtual Address                         | 0x3b1c |
| TLB Hit? (Y/N)                          | [1]    |
| Number of Memory Accesses to Page Table | [2]    |
| Page Fault? (Y/N)                       | [3]    |
| Physical Address                        | [4]    |
| Cache Hit? (Y/N)                        | [5]    |

| Parameter                               | Value   |
|-----------------------------------------|---------|
| Virtual Address                         | 0xab235 |
| TLB Hit? (Y/N)                          | [6]     |
| Number of Memory Accesses to Page Table | [7]     |
| Page Fault? (Y/N)                       | [8]     |
| Physical Address                        | [9]     |
| Cache Hit? (Y/N)                        | [10]    |

# **Problem 4: Concurrency (12 points)**

Deadlock is a problem in concurrent programs. Please consider the below execution flow.

| Initially: a=1, b=1, c=1 |          |          |  |  |
|--------------------------|----------|----------|--|--|
| Thread                   | Thread 1 | Thread 2 |  |  |
| Step1                    | P(B)     | P(C)     |  |  |
| Step2                    | P (A)    | P(B)     |  |  |
| Step3                    | P(C)     | V(C)     |  |  |
| Step4                    | V(C)     | P (A)    |  |  |
| Step5                    | V (B)    | V (A)    |  |  |
| Step6                    | V(A)     | V(B)     |  |  |

- 1. Does it cause deadlock? (2') Please draw progress graph and **explain why** base on the graph. (6')
- 2. If we change the order of **two steps with operation P** in Thread 2, the deadlock will be erased. What are the two steps? (4')

# **Problem 5: Memory Mapping (16 points)**

```
... // include headers
 2
   int main() {
       int fd1 = open("a.txt", O_RDWR);
 3
       int fd2 = open("b.txt", O RDWR);
 4
       char *a = mmap(NULL, 4096, PROT READ | PROT WRITE,
 5
                       MAP SHARED, fd1, 0);
 6
 7
       a[0] = 4;
 8
       char *b = mmap(NULL, 4096, PROT READ | PROT WRITE |
 9
                       PROT EXEC, MAP PRIVATE, fd2, 0);
10
11
12
      b[0] += 4;
13
       if (fork() == 0) {
14
15
          a[0] = 1;
          b[0]--;
16
17
       }
18
19
       printf("a[0] = %d\n", a[0]);
20
       printf("b[0] = %d\n", b[0]);
       return 0;
21
22 }
```

#### Assumption:

- ♦ The program runs on an Intel x86 64 Linux system.
- ♦ Before the program executes, a.txt and b.txt are both 4096B long and filled with zero.
- ♦ After fork(), the child process is always executed first.
- ♦ No preemption, so only after the child process exits, the parent process will run.
- → mmap () in line 5 returns 0x1ffc0000000, and before line 5 the L3/L4 page table
  of this address doesn't exist.
- → mmap() in line 9 returns 0x1ffc0200000.
- 1. Given the page table of the process after line 8, please draw the page table of **CHILD** process before it is executed. (6')

NOTE: You should mark the permissions of the physical page as the given figure does.



- 2. Show **ALL** the code location where the private CoW page is copied. (2')
- 3. Please give the output of the program. (4')
- 4. After all the processes exit, what's the content of file a.txt and b.txt. (4')

# Problem 6: Lock (13 points)

```
typedef struct   node t {
1.
    node t* next;
2.
tid t tid;
4. } node t;
5.
6. typedef struct _ lock_t {
7.
         node_t* head;
8. } lock t;
9.
10. void lock init(lock t* lock) {
11.
         lock->head = NULL;
12. }
13. /* each caller should alloc and hold a node by itself */
14. void lock(lock t* lock, node t* node) {
         if (lock == NULL || node == NULL) {
16.
            /* output error message... */
17.
            exit(-1);
18.
        }
       node->next = NULL;
19.
       node->tid = gettid();
node_t* old = test_and_set(&lock->head, node);
if (old == NULL) return;
20.
21.
22.
        old->next = node;
23.
24.
        park();
25. }
26.
27. void unlock(lock t* lock, node t* node) {
         if (lock == NULL | | node == NULL) {
            /* output error message... */
29.
30.
            exit(-1);
31.
32.
        if (node->next == NULL) {
33.
            if (compare and swap([1]_, [2]_, [3]_)) {
34.
                return;
35.
           while(node->next == NULL)
36.
37.
                continue;
38.
        }
39.
        unpark(node->next->tid);
40.}
```

Above code is an implementation of a kind of lock called MCS. Each <code>lock\_t</code> structure represents a lock. Each thread who wants a lock will maintain a <code>node\_t</code> structure. Nodes are connected through their arriving order. The head of lock will always point to the newest node. <code>Test\_and\_set</code> and <code>compare\_and\_swap</code> operations are <code>atomic</code>. <code>DO NOT</code> consider the CPU may reorder the instructions.

- compare\_and\_swap(a, b, c) will set a's value to c and return 1 if and only if a's old value equals b, otherwise return 0 and do nothing. Fill the blanks in the code.
   Hint: consider the situation that another thread acquires the same lock between line 32 and 33. Your code is used to keep the lock running correctly under this situation.
   (3')
- 2. There are two problems in this implementation. Figure them out and try to fix them **if possible**. **Hint:** Consider about **correctness** and **security**. (4')
- 3. Why do we need line **36** and **37**? (2')
- 4. Analyze the lock about its **fairness** and **performance**. (4')