# $\begin{array}{c} \text{Kernel 3} \\ \text{CS452 - Spring 2014} \end{array}$

Real-Time Programming

### Team

Max Chen - mqchen mqchen@uwaterloo.ca

Ford Peprah - hkpeprah ford.peprah@uwaterloo.ca

 $\begin{array}{c} \text{Bill Cowan} \\ \text{University of Waterloo} \\ \textbf{Due Date: Monday}, \ 9^{th}, \ \text{June}, \ 2014 \end{array}$ 

## Table of Contents

| 1 | $\mathbf{Pro}$ | ogram Description         | 3 |
|---|----------------|---------------------------|---|
|   | 1.1            | Getting the Program       | 3 |
|   | 1.2            | Running the Program       | 3 |
|   | 1.3            | Command Prompt            | 4 |
| 2 | Ker            | rnel Structure            | 4 |
|   | 2.1            | Refactoring               | 4 |
|   | 2.2            | Hardware Interrupts       | 4 |
|   |                | 2.2.1 Context Switch      | 4 |
|   |                | 2.2.2 Handling Interrupts | 5 |
|   | 2.3            | ClockServer               | 6 |
|   |                | 2.3.1 Implementation      | 6 |
|   |                | 2.3.2 ClockNotifier       | 6 |
|   |                | 2.3.3 API                 | 6 |
|   | 2.4            | System Calls              | 7 |
|   |                | 2.4.1 AwaitEvent          | 7 |
|   |                | 2.4.2 WaitTid             | 7 |
| 3 | Use            | er Tasks and Output       | 7 |
|   | 3.1            | Program Output            | 7 |
|   | 3.2            | Explanation of Output     | 8 |
| 1 | МГ             | )5 Hashes                 | q |

## 1 Program Description

## 1.1 Getting the Program

To run the program, one must have read/write access to the source code, as well as the ability to make and run the program. Before attempting to run the pogram ensure that the following three conditions are met:

- You are currently logged in as one of cs452, mqchen, or hkpeprah.
- You have a directory in which to store the source code,
   e.g. ~/cs452\_microkern\_mqchen\_hkpeprah.
- You have a folder on the FTP server with your username, e.g. /u/cs452/tftp/ARM/cs452.

First, you must get a copy of the code. To to this, log into one of the aforementioned accounts and change directories to the directory you created above (using cd), then run one of

You will now have a working instance of our kernel2 source code in your current directory. To make the application and upload it to the FTP server at the location listed above (/u/cs452/tftp/ARM/YOUR\_USERNAME), run make upload.

## 1.2 Running the Program

To run the application, you need to load it into the RedBoot terminal. Ensure you've followed the steps listed above in the "Getting the Program" settings to ensure you have the correct directories and account set up. Navigate to the directory in which you cloned the source code and run make upload. The uploaded code should now be located at

```
/u/cs452/tftp/ARM/YOUR_USERNAME/assn3.elf
```

To run the application, go to the RedBoot terminal and run the command

```
load -b 0x00218000 -h 10.15.167.4 ''ARM/YOUR_USERNAME/assn3.elf''; go
```

The application should now begin by running through the game tasks before reaching a prompt. The generated files will be located in DIR/build where DIR is the directory you created in the earlier steps. To access and download an existing version of the code, those can be found at /u/cs452/tftp/ARM/mqchen/assn3.elf and /u/cs452/tftp/ARM/hkpeprah/assn3.elf.

## 1.3 Command Prompt

After the startup tasks have finished running, the user will reach a command prompt, where they will be able to enter commands. The following commands are supported:

| Command  | Description                               |  |
|----------|-------------------------------------------|--|
| q / quit | Exit the prompt and shut down the kernel. |  |
| rps      | Start a Rock-Paper-Scissors game.         |  |
| sl       | Steam locomotive (train).                 |  |

## 2 Kernel Structure

## 2.1 Refactoring

Send-Receive-Reply was modified to better handle the Receive-Before-Send case. In the last kernel submission, our Receive implementation did not store the user supplied parameters when no message was available, and simply had a second call if the first were to fail. This incurred an overhead of an extra context switch. The issue was fixed in this version of the kernel, and the receive parameters are now stored and Send directly copies into them if they are present.

In addition, the way a user task's result is stored was changed. Since hardware interrupts require scratch registers to be saved, the user task result must now be saved on to its stack. This is done by writing the result into the location where r0 is stored on the user stack based on its SP. This method also saves us a word in our task descriptor since the result is no longer stored there.

## 2.2 Hardware Interrupts

The clock interrupt is enabled by:

- 1. Loading 50800 into the clock load register (interrupt every 1/10 second), then starting the clock in 508 KHz, pre-loaded mode
- 2. Enable the clock interrupt interrupt 51 by setting bit 19 of the VIC2 enable register (32-63 in VIC2, 51-32 = 19)

#### 2.2.1 Context Switch

The updated context switch which supports the handling of hardware interrupts now saves the scratch registers. That is, r0-r12, lr (r13 = sp, r15 = pc) are now pushed and popped from the user stack. There is a separate handler for IRQ and SWI that do some slightly different things, but share a good portion of code that is re-used through assembly macros. The first step in both handlers is to save the user state:

- 1. Switch to SYS mode with interrupts disabled (write 0x9F to CPSR).
- 2. Save r0-r12, lr to the user stack, move the SP to r0.
- 3. Switch back to the service mode (SWI/IRQ).
- 4. Push SPSR, LR of the respective service mode to r0 (user stack).

IRQ Only: The IRQ handler subtracts 4 from the LR when storing it to the user stack (interrupted instruction instead of next instruction).

SWI Only: THe SWI handler pops the top of the kernel stack as an address and writes r1 into it. This is the method of passing the arguments (a struct on the user stack) of the sys call to the kernel, as described in K1.

After the user state is saved, the processor is returned to the kernel by popping off the stored kernel registers from its stack. Since the kernel starts in SVC mode, the IRQ handler must first switch to SVC mode before making the pop from the stack. Both handlers will enter the kernel in the exact same way, returning the task's SP (since it's in r0) as the result of the swi\_exit call.

swi\_exit has the same function as it did in K1, switching from the kernel back to the user task. The main difference is that now all registers are restored, and the result is no longer passed in as a parameter to be assigned to r0 (it should already be stored as r0 on the user stack). In addition, the LR received from SVC/IRQ is now stored on the user stack and returned to, instead of the short-cut taken in K1 where the kernel would immediate return to the user's LR. While the shortcut was okay for SWI (the swi\_call function stub meant that the LR for each SWI call would point to the same instruction - mov pc, lr), it would not work for interrupts since the interrupted task could have been executing at any point.

#### 2.2.2 Handling Interrupts

We differentiate between a hardware and software interrupt by the Arg\_t structure passed to the output parameter of the swi\_exit. The default value has request code "INTERRUPT", so in the case of a swi the request code will be overwritten whereas a hardware interrupt will not modify it. In addition, in the case of hardware interrupts, the result is not saved to the stack of the interrupted task.

An interrupt is handled by reading the VIC status registers and the interrupt with the highest priority is determined (done in software by the order in which the interrupts are checked). If a task is blocked on the interrupt, the result of the interrupt is written to its stack and it is then rescheduled.

#### 2.3 ClockServer

#### 2.3.1 Implementation

The ClockServer registers with the NameServer then creates the ClockNotifier which handles processing of timer generated interrupts. We use the 508kHz 32-bit timer. It then writes 5080 to the timer load register; using the 508kHz timer and with 10 milliseconds, this corresponds to 5080 in the timer load register. It then writes to the timer control register to enable it, to set the mode to periodic so that the count resumes back at 5080 after counting down, and to set the timer to the 508kHz timer. The ClockServer then blocks on Receive to handle messages from other tasks. When the ClockServer recieves a message of type Delay or DelayUntil, it adds the task to its delay queue sorted by the number of ticks that the task is waiting; the array is sorted in ascending order. When it receives a message of type Tick it increments its internal counter, replies to the sending task immediately, then iterates through its delay queue waking up every task with a delay less than the current count by replying to them, and stopping as soon as it reaches a task with a delay greater than its current tick count. This ensures that we do not needlessly check tasks that will not wake up as the queue is sorted.

#### 2.3.2 ClockNotifier

The ClockNotifier is responsible for notifying the ClockServer when a tick occurs; a tick is defined as ten milliseconds passing on the 32-bit hardware timer. Since the 508kHz clock is used, ten milliseconds is equivalent to setting the value of the clock to  $508 \cdot 10 = 5080$  upon which it will countdown to 0 then generate an interrupt. The ClockNotifier blocks on the timer interrupt and on return it sends a message to the ClockServer indicating a tick took place, which the ClockServer immediately replies to, and then the ClockNotifier blocks on AwaitEvent again. This task never exits unless the system is shutting down.

#### 2.3.3 API

| Method     | Prototype                            | Description                                    |
|------------|--------------------------------------|------------------------------------------------|
| Time       | int Time()                           | Sends a message to the ClockServer to get      |
|            |                                      | the current tick count.                        |
| Delay      | int Delay(int ticks)                 | Sends a message to the ClockServer to block    |
|            |                                      | the current task until number of ticks passed. |
| DelayUntil | <pre>int DelayUntil(int ticks)</pre> | Sends a message to the ClockServer to block    |
|            |                                      | the current task until the time ticks has been |
|            |                                      | reached.                                       |

## 2.4 System Calls

| System Call | Prototype                                | Description                           |
|-------------|------------------------------------------|---------------------------------------|
| AwaitEvent  | <pre>int AwaitEvent(int eventType)</pre> | Blocks until the event identified oc- |
|             |                                          | curs then returns.                    |
| WaitTid     | <pre>int WaitTid(unsigned int tid)</pre> | Waits until the task specified by the |
|             |                                          | tid exits, then returns.              |

#### 2.4.1 AwaitEvent

AwaitEvent blocks until the event identified by the passed integer, eventType, occurs as an interrupt then returns with the value generated by the interrupt. The value is non-zero. In the event that the passed integer is not a valid event, it returns -1 or if the queues are full it returns -2. Since we do not use event buffers, the previous correspondence for 0, -2 and -3 are irrelevant to our implementation.

#### 2.4.2 WaitTid

WaitTid blocks on the wait queue of the specified task and returns when that task exists with the status of the exit. Returns -1 if the task does not exist.

## 3 User Tasks and Output

## 3.1 Program Output

Our kernel implements larger number as being a higher priority, but gives the same values to the requesting tasks based on their priority.

| Priority (larger is higher) | Delay Times (tics) | Number of Delays | Task ID |
|-----------------------------|--------------------|------------------|---------|
| 3                           | 10                 | 20               | 4       |
| 4                           | 23                 | 9                | 5       |
| 5                           | 33                 | 6                | 6       |
| 6                           | 71                 | 3                | 7       |

where the last column gives the id of the task that was created with those parameters for reference in the output. Then, the output is:

```
Tid: 4 Delay Interval: 10 Delays Complete: 1
Tid: 4 Delay Interval: 10 Delays Complete: 2
Tid: 5 Delay Interval: 23 Delays Complete: 1
Tid: 4 Delay Interval: 10 Delays Complete: 3
Tid: 6 Delay Interval: 33 Delays Complete: 1
```

```
Tid: 4
            Delay Interval: 10
                                     Delays Complete: 4
Tid: 5
            Delay Interval: 23
                                     Delays Complete:
Tid: 4
            Delay
                   Interval: 10
                                     Delays
                                            Complete:
Tid:\ 4
            Delay Interval: 10
                                     Delays
                                            Complete:
Tid: 6
            Delay Interval: 33
                                     Delays Complete:
Tid: 5
            Delay Interval: 23
                                     Delays Complete: 3
Tid: 4
            Delay Interval: 10
                                     Delays Complete:
Tid: 7
                                     Delays
            Delay
                   Interval:
                             71
                                            Complete:
Tid: 4
            Delay
                  Interval: 10
                                     Delays
                                            Complete:
Tid: 4
            Delay Interval: 10
                                     Delays Complete: 9
Tid: 5
            Delay Interval: 23
                                     Delays Complete: 4
                                     Delays
                                            Complete: 3
Tid: 6
            Delay Interval: 33
Tid: 4
            Delay
                   Interval:
                                     Delays
                                            Complete:
Tid: 4
                                     Delays
            Delay Interval: 10
                                            Complete: 11
Tid: 5
            Delay Interval: 23
                                     Delays
                                            Complete: 5
Tid: 4
            Delay Interval: 10
                                     Delays Complete: 12
Tid: 4
            Delay Interval: 10
                                     Delays
                                            Complete: 13
Tid: 6
            Delay
                   Interval:
                                     Delays
                                             Complete:
                                            Complete: 6
Tid: 5
            Delay Interval:
                                     Delays
                             23
                                            Complete: 14
Tid: 4
            Delay Interval: 10
                                     Delays
Tid: 7
                                            Complete: 2
            Delay Interval: 71
                                     Delays
Tid: 4
            Delay
                   Interval: 10
                                     Delays
                                            Complete: 15
Tid: 4
            Delay
                   Interval:
                             10
                                     Delays
                                            Complete:
Tid: 5
            Delay Interval: 23
                                     Delays Complete:
Tid: 6
            Delay Interval: 33
                                     Delays
                                            Complete: 5
Tid: 4
            Delay Interval: 10
                                     Delays
                                            Complete: 17
Tid: 4
            Delay
                  Interval: 10
                                     Delays
                                            Complete: 18
Tid: 5
            Delay Interval:
                             23
                                     Delays Complete: 8
Tid: 4
            Delay Interval: 10
                                     Delays Complete: 19
Tid: 6
            Delay Interval: 33
                                     Delays Complete: 6
Tid: 4
            Delay Interval: 10
                                     Delays Complete: 20
Tid: 5
                   Interval:
                                            Complete:
            Delay
                                     Delays
Tid: 7
            Delay Interval: 71
                                     Delays Complete: 3
```

## 3.2 Explanation of Output

In the context of this description, we define "waking up" as "unblocking and printing." We get this output because task 4 (priority 3) has the lowest delay interval, so it wakes up twice before the next task with the lowest delay interval (task 5) wakes up as 10 \* 2 = 20 < 23. Now, after that task wakes up, task 4 wakes up a third time before task 6 wakes up as 10 \* 3 = 30 < 33. This process repeats a second time before task 4 wakes up for a seventh time, followed by task 7 waking up for the first time as it delays for 7 ticks. Task 6 is the first to finish as it has the lowest total delay time;  $6 \cdot 33 = 198$  ticks. Then task 4 with  $10 \cdot 20 = 200$  ticks, then task 5 with  $23 \cdot 9 = 207$  ticks followed lastly by task 7 with  $3 \cdot 71 = 213$  ticks. The output corresponds to each task delaying the number of ticks it was given before waking up and delaying again until its completed the required number of delays it was passed. When a task delayed, it could only be woken up by a subsequent tick, which for tasks with small delays, meant they would wake up faster as they would be waiting for a smaller amount of time.

## 4 MD5 Hashes

Source files can be accessed at either /u7/mqchen/cs452/cs452-microkern or /u8/hkpeprah/cs452-microkern on the kernel3 branch. The listing of all the fields being submitted alongisde their MD5 hashes and locations are as follows:

```
8\,afa\,0\,4fd\,4ff1\,2\,bc\,4\,8\,3\,2\,7\,1\,2\,8\,6\,d\,5\,2\,dfa\,0\,0
                                      /u8/hkpeprah/cs452-microkern/bin/cs452-upload.sh
de 6700 ffc 18 bb 2 c8 f15 a491 fc 2929 d13\\
                                      /u8/hkpeprah/cs452-microkern/bin/md5.sh
7d3d938f3360ca46d07b07d6fed3711c
                                      /u8/hkpeprah/cs452-microkern/bin/profiler.sh
40e6f5862869392d9733ea2d6defbb68
                                      /u8/hkpeprah/cs452-microkern/include/bwio.h
045\,d5\,b074001cbda021d83b6a76a0a84
                                      /u8/hkpeprah/cs452-microkern/include/mem.h
19fbec18bdcd2bf2169e51577153fbcf
                                      /u8/hkpeprah/cs452-microkern/include/string.h
1 \, f \, 456 \, f \, b \, 5 \, d \, 8193 \, d \, 296733616383 \, ad \, a5f6
                                      /u8/hkpeprah/cs452-microkern/include/syscall.h
                                      /u8/hkpeprah/cs452-microkern/include/task.h
3f3a66bb146ae1bcdcd7e194ecd0f5a9
                                      /u8/hkpeprah/cs452-microkern/include/ts7200.h
a6ab9325f4bb9e615962ff8e3422098b\\
                                      /u8/hkpeprah/cs452-microkern/include/types.h
fb 245 f779 f7a 1935b 87d86 ee 8e 1061e 5\\
                                      /u8/hkpeprah/cs452-microkern/include/vargs.h
d898edf77661ac9a98e2a0c6b9d9b9a6
65e88dcab3d6f99999831e5be89f596
                                      /u8/hkpeprah/cs452-microkern/include/utasks.h
                                      /u8/hkpeprah/cs452-microkern/include/k_syscall.h
f8390c0c32480d90c8e3479c7045674b
0\,d4f70ea95299ed99ce1169d4d19fdda
                                      /u8/hkpeprah/cs452-microkern/include/stdio.h
                                      /u8/hkpeprah/cs452-microkern/include/stdlib.h
c1b25aa7cdcd789673ce0166ed95bd92\\
169383\,b706f2cff669bc8d9390f0bb8e
                                      /u8/hkpeprah/cs452-microkern/include/term.h
c529 af6 c9 c63118404 a561 a9048 f1 bd7\\
                                      /u8/hkpeprah/cs452-microkern/include/syscall_types.h
                                      /u8/hkpeprah/cs452-microkern/include/kernel.h
1624fa508a85025bed31a50a05048f6d
02 da5c5ed64ddf5a5ff08090e1d407cf
                                      /u8/hkpeprah/cs452-microkern/include/hash.h
717\,b31cadb3b90f022dc0690530d1aab
                                      /u8/hkpeprah/cs452-microkern/include/perf_test.h
                                      /u8/hkpeprah/cs452-microkern/include/random.h
65\,b51124e5ee634ebdbba3664aa22a63
133\,db 29a 949a 80cbe 5438515640bd 6e 6
                                      /u8/hkpeprah/cs452-microkern/include/server.h
c5503502f0497c9931e111a27b1ab610\\
                                      /u8/hkpeprah/cs452-microkern/include/shell.h
973c15973ba0d9c7d687b936cc1010da
                                      /u8/hkpeprah/cs452-microkern/include/clock.h
                                      /u8/hkpeprah/cs452-microkern/include/util.h
5\,ab\,49\,ee\,ad\,b\,9ffc\,8f6\,f5\,7\,4d\,7\,3\,0\,4\,4\,4\,cc\,9\,e
                                      /u8/hkpeprah/cs452-microkern/include/interrupt.h
6c6274cf00b0ee960923ebb1d2090fc4
7ff8faa9d929453fa8cd82d0e7c32b19
                                      /u8/hkpeprah/cs452-microkern/include/rps.h
54\,d2cc2d24f83d270517040f70bac478
                                      /u8/hkpeprah/cs452-microkern/include/sl.h
8402\,c682ff31\,b15549689\,baa00\,ea45\,b6
                                      /u8/hkpeprah/cs452-microkern/lib/libbwio.a
8ea45f8c76ecbbe71b0308879535c3fc
                                      /u8/hkpeprah/cs452-microkern/Makefile
0\,f6\,0\,a\,5\,9\,fd\,d\,1\,e\,9\,a\,6\,c\,b\,4\,e\,c\,4\,a\,a\,0\,1\,a\,c\,c\,5\,4\,a\,d
                                      /u8/hkpeprah/cs452-microkern/src/mem.c
e7bf3de21aa43bee5557e8f9bdb31017
                                      /u8/hkpeprah/cs452-microkern/src/orex.ld
4a4a162d7de498e89ab976f72601e7ae
                                      /u8/hkpeprah/cs452-microkern/src/string.c
                                      /u8/hkpeprah/cs452-microkern/src/syscall.c
ff61eb944892f1b117601b42d04d6c83
e840a4d99efecc572eb49c0ab0542c7a
                                      /u8/hkpeprah/cs452-microkern/src/task.c
9\,df1fc9eb3ba0decc600443d441c2210
                                      /u8/hkpeprah/cs452-microkern/src/main.c
033f303c456708c4a5b037f10028e819
                                      /u8/hkpeprah/cs452-microkern/src/k_syscall.c
                                      /u8/hkpeprah/cs452-microkern/src/stdio.c
b 59c 60300 d76a7443e5e4260f70a8232\\
                                      /u8/hkpeprah/cs452-microkern/src/stdlib.c
5e4ef8a626c50e440f460ac0e76d9289
                                      /u8/hkpeprah/cs452-microkern/src/kernel.c
64 \, \mathrm{bab} \, 36 \, \mathrm{f} \, 05082 \, \mathrm{c} \, 56462555 \, \mathrm{db} \, 07 \, \mathrm{e} \, 54 \, \mathrm{c} \, 54
dca68802d19d0962895e0b075cada755
                                      /u8/hkpeprah/cs452-microkern/src/term.c
                                      /u8/hkpeprah/cs452-microkern/src/hash.c
fb02e0ba097afaabae2927e17634270b\\
6719\, f648 f05 c9756 ea0 fe244 a3 f95 e7 e
                                      /u8/hkpeprah/cs452-microkern/src/random.c
                                      /u8/hkpeprah/cs452-microkern/src/server.c
290 ebc756eabc7f201aea21263bd619d
d30a863be1a6f1d2508ee6b5aa239cb3\\
                                      /u8/hkpeprah/cs452-microkern/src/tasks/perf_test.c
59768\,ce3a6a9849fc294c291e7b7de3d
                                      /u8/hkpeprah/cs452-microkern/src/tasks/shell.c
80f459cfef61c3c7f8c078234260c00b
                                      /u8/hkpeprah/cs452-microkern/src/tasks/utasks.c
26e3d424666d0757760ce42c5756d744
                                      /u8/hkpeprah/cs452-microkern/src/tasks/rps.c
                                      /u8/hkpeprah/cs452-microkern/src/tasks/sl.c
afddb68a12c7ba57a26799498c0a22e9
                                      /u8/hkpeprah/cs452-microkern/src/clock.c
28\,ab7fdf77fe41a096c02058981bc246
aa2fafa50629f75cdd3ed0045c90bae3\\
                                      /u8/hkpeprah/cs452-microkern/src/interrupt.c
0c9c7e1968f6fa419d4342138e197149
                                      /u8/hkpeprah/cs452-microkern/tests/tasks1.c
f1f8c6fd425032444162c32949728fee\\
                                      /u8/hkpeprah/cs452-microkern/tests/tasks2.c
                                      /u8/hkpeprah/cs452-microkern/tests/tasks3.c
8ee655142dc66971e1479e506155655b
```