# Computer Architecture (ECS 154A) Study Guide

# 1 Boolean Algebra

### 1.1 Operations

| Operation | Symbol   | Example      |
|-----------|----------|--------------|
| NOT       | _        | $\bar{A}$    |
|           | !        | !A           |
|           | _ ¬      | $\neg A$     |
|           | $\sim$   | $\sim A$     |
| AND       | Λ        | $A \wedge B$ |
|           | *        | A * B        |
|           |          | AB           |
| OR        | V        | $A \lor B$   |
|           | +        | A + B        |
| XOR       | $\oplus$ | $A \oplus B$ |

### 1.2 Equivalence Laws

| Name         | OR Version                             | AND Version                            |
|--------------|----------------------------------------|----------------------------------------|
| Commutative  | A + B = B + A                          | A * B = B * A                          |
| Associative  | (A+B) + C = A + (B+C)                  | (A*B)*C = A*(B*C)                      |
| Distributive | A + (B * C) = (A + B) * (A + C)        | A * (B + C) = (A * B) + (A * C)        |
| Identity     | A + 0 = A                              | A*1=A                                  |
| Annulment    | A+1=1                                  | A*0=0                                  |
| Idempotent   | A + A = A                              | A * A = A                              |
| Complement   | $A + \bar{A} = 1$                      | $A*\bar{A}=0$                          |
| De Morgan's  | $\overline{(A+B)} = \bar{A} * \bar{B}$ | $\overline{(A*B)} = \bar{A} + \bar{B}$ |

# 1.3 Truth Table

Truth tables are mathematical tables composing of every combination of inputs and the resulting function. The number of combinations or rows in the truth table is  $2^N$ , where N is the number of inputs.

## 1.3.1 NOT

$$\begin{array}{c|cccc}
A & f(A) = !A \\
\hline
0 & 1 \\
1 & 0
\end{array}$$

## 1.3.2 AND

| A | В | f(A,B) = A * B |
|---|---|----------------|
| 0 | 0 | 0              |
| 0 | 1 | 0              |
| 1 | 0 | 0              |
| 1 | 1 | 1              |

## 1.3.3 OR

| A | В | f(A,B) = A + B |
|---|---|----------------|
| 0 | 0 | 0              |
| 0 | 1 | 1              |
| 1 | 0 | 1              |
| 1 | 1 | 1              |

# 1.3.4 XOR

- 1.4 Canonical Normal Form
- 1.5 Karnaugh Maps
- 1.6 Quine-McClucksey Algorithm
- 2 Combinational Logic Circuits
- 2.1 Gates
- 2.2 Timing Diagrams
- 2.3 Multiplexers, Decoders, Shifters
- 2.4 Adders and Subtracters
- 2.5 Designing Combinational Logic Circuits
- 3 Finite State Automata
- 3.1 Moore Model
- 3.2 Mealy Model
- 4 Sequential Logic Circuit
- 4.1 Latches
- 4.2 Flip Flops
- 4.3 Registers and Counters
- 4.4 Designing Sequential Logic Circuits
- 5 Single Cycle CPU Design
- 6 Cache

Effectiveness is based on the concept of information reuse: temporal locality and spatial locality.

### 6.1 Memory Hierarchy

Goal: Make memory perform as if it was made of the most expensive and fastest type, but cost as if made of the cheapest type.

- 1. Fast, Hot, Expensive
- 2. Static RAM
- 3. Dynamic RAM
- 4. Disk

A cache is smaller than main memory and is composed of numerous cache lines. Cache lines consist of a dirty bit, a tag, and block(s) of data. If the dirty bit is on, then it signals the CPU to write the data from this cache line into main memory when this cache line is freed. Caches also contain a valid bit which signifies whether there is loaded data into the cache — imagine starting up the computer for the first time, the cache is going to be empty. The issue is that even when a cache is empty (bits are all 0), it still holds some signal. As we continue, when we mention the size of a cache line, we refer to the size of the data blocks in the cache line only (excluding flags and tag).

| ı | T21 .  | TD .  | D1 1/\   |
|---|--------|-------|----------|
| ı | r iags | l lag | Block(S) |
| ı | - 1000 |       | Dioon(b) |

### 6.2 Direct Mapped Cache

A given address is partitioned into three components: Tag, line number, and offset. The line number directly accesses a specific cache line. It then compares the tag partitioned from the address to the tag stored in the cache line. If the tags match, then it is a cache hit. Otherwise, it becomes a cache miss. Assuming that it was a cache hit, it then proceeds to use the offset to select which block to read or write. The block of data in the cache line can be thought of as an array and the offset as an index into this "array".

#### **6.2.1** Format

| Tag   Line Number | Offset |
|-------------------|--------|
|-------------------|--------|

### 6.2.2 Example

A CPU is using 24-bit addresses and is byte-addressable. Each line in cache holds 16 bytes of data and each block is 1 byte. Assuming that the tag is 12 bits wide, find the following: number of lines in cache, size of cache, size of tag, and sizes of each partition in the format.

```
Size(Cache\ Line) = 16\ bytes
Bit\_Width(Offset) = Number\ of\ bits\ needed\ to\ address\ 16\ blocks
= \log_2 16
= 4\ bits
Bit\_Width(Line\ Number) = Size(Address) - Bit\_Width(Tag) - Bit\_Width(Offset)
= 24\ bits - 12\ bits - 4\ bits
= 8\ bits
Number\ of\ Cache\ Lines = 2^{Bit\_Width(Line\ Number)}
= 2^8
= 256
```

Size of Cache = Number of Cache Lines 
$$*$$
 Size of Cache Lines =  $2^8 * 2^4$  = 2048 bytes = 2 KB

### Format Partition Sizes and Bit Range

|           | Tag        | Line Number | Offset   |
|-----------|------------|-------------|----------|
| Size      | 12 bits    | 8 bits      | 4 bits   |
| Bit Range | Bits 12-23 | Bits 4-11   | Bits 0-3 |

## 6.3 Fully Associative

A given address is partitioned into two components: Tag and offset. The reason why a line number is unused is because the CPU will perform a linear search through all cache lines looking for either a cache hit, an unused cache line, or a cache line to replace. Because of this, using full association has the lowest miss rates. The tag and offset performs the same as the tag and offset in direct mapping.

#### **6.3.1** Format

| Tag | Offset |
|-----|--------|
|-----|--------|

### 6.3.2 Example

A CPU is using 11-bit addresses and is byte-addressable. The cache size is 128 bytes and the size of each cache line is 8 bytes. Find the following: Bit width of offset, bit width of tag, number of cache lines, and sizes and bit ranges of the partitions.

$$\begin{array}{lll} Bit\_Width(Offset) & = & Number \ of \ bits \ needed \ to \ address \ 8 \ blocks \\ & = & \log_2 8 \\ & = & 3 \ bits \end{array}$$

Format Partition Sizes and Bit Range

|           | Tag       | Offset   |
|-----------|-----------|----------|
| Size      | 8 bits    | 3 bits   |
| Bit Range | Bits 3-10 | Bits 0-2 |

#### 6.4 Set Associative

Set associative caching is a hybrid between direct-mapped and fully associative. A given address is partitioned into three components: Tag, set number, and offset. What makes set associative different from direct-mapped caching is that a given address will access a specific set of cache lines instead of a single cache line. Within that set of cache lines, it will perform a linear search like the fully associative cache. The tag and offset perform the same as the direct-mapped cache and fully associative cache. **Important terminology:** N-way associative cache means that there are N cache lines per set.

#### **6.4.1** Format

| Tag | Set Number | Offset |
|-----|------------|--------|
|-----|------------|--------|

### 6.4.2 Example

A CPU is using 64-bit addresses and is byte-addressable. It also uses a 3-way set associative cache with a cache size of 98,304 bytes and 32 sets. Find the following: Number of lines per set, size of each set, number of cache lines, size of cache line, bit width of offset, bit width of set number, bit width of tag, and partition sizes and bit ranges.

Number of Lines per Set = 3 (Given)

Size(Set) = Size(Cache)/Number of Sets = 98304/32= 3072 bytes

Number of sets =  $Size(Cache)/Bit_Width(Offset)$ = 11 bits - 3 bits= 8 bits

Number of Cache Lines = Size(Cache)/Size(Cache Line) = 128 bytes/8 bytes

= 16 Cache Lines

# Format Partition Sizes and Bit Range

|           | Tag       | Offset   |
|-----------|-----------|----------|
| Size      | 8 bits    | 3 bits   |
| Bit Range | Bits 3-10 | Bits 0-2 |

- 7 Virtual Memory
- 8 Multi-Cycle CPU Design
- 9 Pipeline CPU Design