

# **UESTC4019: Real-Time Computer Systems and Architecture**

Lecture 14
Instruction Sets – Addressing Modes and Formats (Part-3)

#### Addressing Modes

- The most common addressing techniques or modes are:
  - Immediate
  - Direct
  - Indirect
  - Register
  - Register indirect
  - Displacement
  - Stack

## Addressing Modes

• Following notation is used:

A = contents of an address field in the instruction

R = contents of an address field in the instruction that refers to a register

EA = actual (effective) address of the location containing the referenced operand

(X) = contents of memory location X or register X



## Basic Addressing Modes

| Mode              | Algorithm         | Principal Advantage | Principal Disadvantage     |
|-------------------|-------------------|---------------------|----------------------------|
| Immediate         | Operand = A       | No memory reference | Limited operand magnitude  |
| Direct            | EA = A            | Simple              | Limited address space      |
| Indirect          | EA = (A)          | Large address space | Multiple memory references |
| Register          | EA = R            | No memory reference | Limited address space      |
| Register indirect | EA = (R)          | Large address space | Extra memory reference     |
| Displacement      | EA = A + (R)      | Flexibility         | Complexity                 |
| Stack             | EA = top of stack | No memory reference | Limited applicability      |

#### Immediate Addressing (1 of 2)

- Simplest form of addressing
- Operand = A
- This mode can be used to define and use constants or set initial values of variables
  - Typically the number will be stored in twos complement form
  - The leftmost bit of the operand field is used as a sign bit



#### Immediate Addressing (2 of 2)

#### Advantage:

 No memory reference other than the instruction fetch is required to obtain the operand, thus saving one memory or cache cycle in the instruction cycle

#### Disadvantage:

 The size of the number is restricted to the size of the address field, which, in most instruction sets, is small compared with the word length

#### Direct Addressing

- Address field contains the effective address of the operand
- Effective address = address field
- EA = A
- Was common in earlier generations of computers
- Requires only one memory reference and no special calculation
- Limitation is that it provides only a limited address space



#### Indirect Addressing (1 of 2)

- Reference to the address of a word in memory which contains a full-length address of the operand
- EA = (A)
  - Parentheses are to be interpreted as meaning contents of
- Advantage:
  - For a word length of N an address space of 2<sup>N</sup> is now available



## Indirect Addressing (2 of 2)

- Disadvantage:
  - Instruction execution requires two memory references to fetch the operand
    - One to get its address and a second to get its value
    - A rarely used variant of indirect addressing is multilevel or cascaded indirect addressing
  - EA = ( ... (A) ... )
  - Disadvantage is that three or more memory references could be required to fetch an operand

#### Register Addressing

- Address field refers to a register rather than a main memory address
- EA = R
- Advantages:
  - Only a small address field is needed in the instruction
  - No time-consuming memory references are required
- Disadvantage:
  - The address space is very limited



## Register Indirect Addressing

- Analogous to indirect addressing
  - The only difference is whether the address field refers to a memory location or a register
- EA = (R)
- Address space limitation of the address field is overcome by having that field refer to a word-length location containing an address
- Uses one less memory reference than indirect addressing



#### Displacement Addressing (1 of 2)

- Combines the capabilities of direct addressing and register indirect addressing
- EA = A + (R)
- Requires that the instruction have two address fields, at least one of which is explicit
  - The value contained in one address field (value = A) is used directly
  - The other address field refers to a register whose contents are added to A to produce the effective address



## Displacement Addressing (2 of 2)

- Most common uses:
  - Relative addressing
  - Base-register addressing
  - Indexing

#### **Example Question**

- What facts go into determining the use of the addressing bits of an instruction?
  - Number of addressing modes
  - Number of operands
  - Register versus memory
  - Number of register sets
  - Address range
  - Address granularity

## Relative Addressing

- The implicitly referenced register is the program counter (PC)
  - The next instruction address is added to the address field to produce the EA
  - Typically the address field is treated as a twos complement number for this operation
  - Thus the effective address is a displacement relative to the address of the instruction
- Exploits the concept of locality
- Saves address bits in the instruction if most memory references are relatively near to the instruction being executed

## Base-Register Addressing

- The referenced register contains a main memory address and the address field contains a displacement from that address
- The register reference may be explicit or implicit
- Exploits the locality of memory references
- Convenient means of implementing segmentation
- In some implementations a single segment base register is employed and is used implicitly
- In others the programmer may choose a register to hold the base address of a segment and the instruction must reference it explicitly

## Indexing (1 of 2)

- The address field references a main memory address and the referenced register contains a positive displacement from that address
- The method of calculating the EA is the same as for baseregister addressing
- An important use is to provide an efficient mechanism for performing iterative operations

## Indexing (2 of 2)

#### Autoindexing

- Automatically increment or decrement the index register after each reference to it
- -EA = A + (R)
- $-(R)\leftarrow(R)+1$

#### Postindexing

- Indexing is performed after the indirection
- EA = (A) + (R)

#### Preindexing

- Indexing is performed before the indirection
- EA = (A + (R))

## ARM Indexing Methods



#### Stack Addressing (1 of 2)

- A stack is a linear array of locations
  - Sometimes referred to as a pushdown list or last-infirst-out queue
- A stack is a reserved block of locations
  - Items are appended to the top of the stack so that the block is partially filled
- Associated with the stack is a pointer whose value is the address of the top of the stack
  - The stack pointer is maintained in a register

#### Stack Addressing (2 of 2)

- Thus references to stack locations in memory are in fact register indirect addresses
- Is a form of implied addressing
- The machine instructions need not include a memory reference but implicitly operate on the top of the stack



## ARM Data Processing Instruction Addressing and Branch Instructions

- Data processing instructions
  - Use either register addressing or a mixture of register and immediate addressing
  - For register addressing the value in one of the register operands may be scaled using one of the five shift operators
- Branch instructions
  - The only form of addressing for branch instructions is immediate
  - Instruction contains 24 bit value
    - Shifted 2 bits left so that the address is on a word boundary
    - Effective range ± 32MB from the program counter

## ARM Load/Store Multiple Addressing



#### **Instruction Formats**

- Define the layout of the bits of an instruction, in terms of its constituent fields
- Must include an opcode and, implicitly or explicitly, indicate the addressing mode for each operand
- For most instruction sets more than one instruction format is used

#### Instruction Length

- Most basic design issue
- Affects, and is affected by:
  - Memory size
  - Memory organization
  - Bus structure
  - Processor complexity
  - Processor speed
  - Should be equal to the memory-transfer length or one should be a multiple of the other
  - Should be a multiple of the character length, which is usually 8 bits, and of the length of fixed-point numbers

#### Allocation of Bits

- Number of addressing modes
- Number of operands
- Register versus memory
- Number of register sets
- Address range
- Address granularity

#### PDP-8 Instruction Formats

BSW = Byte SWap

One of the simplest instruction designs for a general-purpose computer was for the PDP-8 [BELL78b].

The PDP-8 uses 12-bit instructions and operates on 12-bit words.

There is a single general-purpose register, the accumulator.

|                              | 0 1                       |           |          |          | y Refere | nce Inst | ructions  |            |          |         |     |
|------------------------------|---------------------------|-----------|----------|----------|----------|----------|-----------|------------|----------|---------|-----|
|                              | Opcode                    |           | D/I      | Z/C      | _        |          | Di        | splacem    | ent      |         |     |
| 0                            |                           | 2         | 3        | 4        | 5        |          |           |            |          |         | 11  |
|                              |                           |           |          | Innu     | t/Outpu  | t Instru | ctions    |            |          |         |     |
| 1                            | 1                         | 0         |          | три      |          | vice     | ctions    |            |          | Opcode  |     |
| 0                            | 1                         | 2         | 3        |          |          | ,100     |           | 8          | 9        | ореоце  | 11  |
| _                            |                           | _         | _        |          |          |          |           | _          | -        |         |     |
|                              |                           |           |          | Register | r Refere | nce Inst | ructions  | :          |          |         |     |
| Group                        | 1 Microi                  | nstructio | ons      | _        |          |          |           |            |          |         |     |
| 1                            | 1                         | 1         | 0        | CLA      | CLL      | CMA      | CML       | RAR        | RAL      | BSW     | IAC |
| 0                            | 1                         | 2         | 3        | 4        | 5        | 6        | 7         | 8          | 9        | 10      | 11  |
|                              |                           |           |          |          |          |          |           |            |          |         |     |
| Group                        | 2 Microi                  | nstructio | ons      |          |          |          |           |            |          |         |     |
| 1                            | 1                         | 1         | 1        | CLA      | SMA      | SZA      | SNL       | RSS        | OSR      | HLT     | 0   |
| 0                            | 1                         | 2         | 3        | 4        | 5        | 6        | 7         | 8          | 9        | 10      | 11  |
|                              |                           |           |          |          |          |          |           |            |          |         |     |
| Group                        | 3 Microi                  | nstructio | ons      |          |          |          |           |            |          |         |     |
| 1                            | 1                         | 1         | 1        | CLA      | MQA      | 0        | MQL       | 0          | 0        | 0       | 1   |
| 0                            | 1                         | 2         | 3        | 4        | 5        | 6        | 7         | 8          | 9        | 10      | 11  |
|                              |                           |           |          |          |          |          |           |            |          |         |     |
|                              | = Direct                  |           |          |          |          |          | = Incren  |            |          |         |     |
| Z/C = Page 0 or Current page |                           |           |          |          |          |          | = Skip c  |            |          |         |     |
| CLA = Clear Accumulator      |                           |           |          |          |          |          | = Skip c  |            |          |         |     |
|                              | <ul><li>Clear l</li></ul> |           |          |          |          |          | = Skip c  |            |          |         |     |
|                              | = CoMp                    |           |          | lator    |          |          | = Rever   |            |          |         |     |
|                              | = CoMp                    |           |          |          |          |          | = Or wit  | h Switc    | h Regist | er      |     |
|                              | = Rotate                  |           |          | _        |          |          | = HaLT    |            |          |         |     |
| RAL                          | = Rotate                  | Accum     | ulator L | eft      | MQA =    | = Multip | olier Que | otient int | o Accun  | nulator |     |

MQL = Multiplier Quotient Load

#### PDP-10 Instruction Format



I = indirect bit

A sharp contrast to the instruction set of the PDP-8 is that of the PDP-10. The PDP-10 was designed to be a large-scale time-shared system, with an emphasis on making the system easy to program, even if additional hardware expense was involved.

#### Variable-Length Instructions

- Variations can be provided efficiently and compactly
- Increases the complexity of the processor
- Does not remove the desirability of making all of the instruction lengths integrally related to word length
  - Because the processor does not know the length of the next instruction to be fetched a typical strategy is to fetch a number of bytes or words equal to at least the longest possible instruction
  - Sometimes multiple instructions are fetched

#### Instruction Formats for the PDP-11

The PDP-11 was designed to provide a powerful and flexible instruction set within the constraints of a 16-bit minicomputer [BELL70].

The PDP-11 employs a set of eight 16-bit general-purpose registers.



Numbers below fields indicate bit length

Source and Destination each contain a 3-bit addressing mode field and a 3-bit register number

FP indicates one of four floating-point registers

R indicates one of the general-purpose registers

CC is the condition code field

## Examples of VAX Instructions

| Hexadecimal<br>Format                  | Explanation                                                                                                                                                                | Assembler Notation and Description                                                                                                             |
|----------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------|
| 8 bits → <b>0 5</b>                    | Opcode for RSB                                                                                                                                                             | RSB<br>Return from subroutine                                                                                                                  |
| D 4 5 9                                | Opcode for CLRL<br>Register R9                                                                                                                                             | CLRL R9<br>Clear register R9                                                                                                                   |
| B 0<br>C 4<br>6 4<br>0 1<br>A B<br>1 9 | Opcode for MOVW Word displacement mode, Register R4 356 in hexadecimal Byte displacement mode, Register R11 25 in hexadecimal                                              | MOVW 356(R4), 25(R11)  Move a word from address that is 356 plus contents of R4 to address that is 25 plus contents of R11                     |
| C 1<br>0 5<br>5 0<br>4 2<br>D F        | Opcode for ADDL3  Short literal 5  Register mode R0  Index prefix R2  Indirect word relative (displacement from PC)  Amount of displacement from PC relative to location A | ADDL3 #5, R0, @A[R2] Add 5 to a 32-bit integer in R0 and store the result in location whose address is sum of A and 4 times the contents of R2 |

#### ARM Instruction Formats (1 of 2)

|                                 | 31 30 29 28 | 27 2 | 6 25 | 24 | 23 | 22  | 21 | 20 | 19 18 17 16 | 15 14 13 12 | 11 10 9 8  | 7  | 6 5    | 4        | 3    | 2 1 | 0 |
|---------------------------------|-------------|------|------|----|----|-----|----|----|-------------|-------------|------------|----|--------|----------|------|-----|---|
| data processing immediate shift | cond        | 0 (  | 0 0  | C  | рс | ode | e  | S  | Rn          | Rd          | shift amou | nt | shift  | 0        |      | Rm  |   |
| data processing register shift  | cond        | 0 (  | 0 0  | C  | pc | ode | e  | S  | Rn          | Rd          | Rs         | 0  | shift  | 1        |      | Rm  |   |
| data processing immediate       | cond        | 0 (  | ) 1  | C  | рс | ode | e  | S  | Rn          | Rd          | rotate     |    | im     | me       | edia | ite |   |
| load/store immediate offset     | cond        | 0    | 1 0  | Р  | U  | В   | W  | L  | Rn          | Rd          | im         | nm | ediate | <u>.</u> |      |     |   |
| load/store<br>register offset   | cond        | 0 '  | 1 1  | Р  | U  | В   | W  | L  | Rn          | Rd          | shift amou | nt | shift  | 0        |      | Rm  |   |
| load/store<br>multiple          | cond        | 1 (  | 0 0  | Р  | U  | S   | W  | L  | Rn          |             | regist     | er | list   |          |      |     |   |
| branch/branch<br>with link      | cond        | 1 (  | ) 1  | L  |    |     |    |    |             | 24-bi       | t offset   |    |        |          |      |     |   |

- S = For data processing instructions, signifies that the instruction updates the condition codes
- S = For load/store multiple instructions, signifies whether instruction execution is restricted to supervisor mode
- P, U, W = bits that distinguish among different types of addressing\_mode

- B = Distinguishes between an unsigned byte (B==1) and a word (B==0) access
- L = For load/store instructions, distinguishes between a Load (L==1) and a Store (L==0)
- L = For branch instructions, determines whether a return address is stored in the link register

#### ARM Instruction Formats (2 of 2)

- All instructions in the ARM architecture are 32 bits long and follow a regular format
- The first four bits of an instruction are the condition code.

  Virtually all ARM instructions can be conditionally executed.
- The next three bits specify the general type of instruction.
- For most instructions other than branch instructions, the next five bits constitute an opcode and/or modifier bits for the operation.
- The remaining 20 bits are for operand addressing. The regular structure of the instruction formats eases the job of the instruction decode units.

## Examples of Use of ARM Immediate Constants



ror #0 - range 0 through 0x000000FF - step 0x00000001



ror #8 - range 0 through 0xFF000000 - step 0x01000000



ror #30 - range 0 through 0x000003FC - step 0x00000004

# Expanding a Thumb ADD Instruction into its ARM Equivalent



The Thumb instruction set is a re-encoded subset of the ARM instruction set. Thumb is designed to increase the performance of ARM implementations that use a 16-bit or narrower memory data bus and to allow better code density than provided by the ARM instruction set. The Thumb instruction set contains a subset of the ARM 32-bit instruction set recoded into 16-bit instructions.

#### Thumb-2 Instruction Set (1 of 2)

- The only instruction set available on the Cortex-M microcontroller products
- Is a major enhancement to the Thumb instruction set architecture (ISA)
  - Introduces 32-bit instructions that can be intermixed freely with the older 16-bit Thumb instructions
  - Most 32-bit Thumb instructions are unconditional, whereas almost all ARM instructions can be conditional

#### Thumb-2 Instruction Set (2 of 2)

- Introduces a new If-Then (IT) instruction that delivers much of the functionality of the condition field in ARM instructions
- Delivers overall code density comparable with Thumb, together with the performance levels associated with the ARMISA
- Before Thumb-2 developers had to choose between Thumb for size and ARM for performance

## Thumb-2 Encoding



| Halfword 1 [15:13] | Halfword1 [12:11] | Length                | Functionality                                 |
|--------------------|-------------------|-----------------------|-----------------------------------------------|
| Not 111            | xx                | 16 bits (1 halfword)  | 16-bit Thumb instruction                      |
| 111                | 00                | 16 bits (1 halfword)  | 16-bit Thumb unconditional branch instruction |
| 111                | Not 00            | 32 bits (2 halfwords) | 32-bit Thumb-2 instruction                    |

The encoding is compatible with the existing Thumb unconditional branch instructions, which has the bit pattern 11100 in the five leftmost bits of the instruction. No other 16-bit instruction begins with the pattern 111 in the three leftmost bits, so the bit patterns 11101, 11110, and 11111 indicate that this is a 32-bit Thumb instruction.

## Computation of the Formula N=I+J+K

| Address |      | Cont | tents |      |
|---------|------|------|-------|------|
| 101     | 0010 | 0010 | 101   | 2201 |
| 102     | 0001 | 0010 | 102   | 1202 |
| 103     | 0001 | 0010 | 103   | 1203 |
| 104     | 0011 | 0010 | 104   | 3204 |
|         |      |      |       |      |
| 201     | 0000 | 0000 | 201   | 0002 |
| 202     | 0000 | 0000 | 202   | 0003 |
| 203     | 0000 | 0000 | 203   | 0004 |
| 204     | 0000 | 0000 | 204   | 0000 |

| Address | Contents |
|---------|----------|
| 101     | 2201     |
| 102     | 1202     |
| 103     | 1203     |
| 104     | 3204     |
|         |          |
| 201     | 0002     |
| 202     | 0003     |
| 203     | 0004     |
| 204     | 0000     |

(a) Binary program

(b) Hexadecimal program

| Address | Instru | ıction |
|---------|--------|--------|
| 101     | LDA    | 201    |
| 102     | ADD    | 202    |
| 103     | ADD    | 203    |
| 104     | STA    | 204    |
|         |        |        |
| 201     | DAT    | 2      |
| 202     | DAT    | 3      |
| 203     | DAT    | 4      |
| 204     | DAT    | 0      |

| (a) Sumbolia progr |    |
|--------------------|----|
|                    | am |
| (c) Symbolic progr | am |

| Label   | Operation | Operand |
|---------|-----------|---------|
| FORMUL  | LDA       | I       |
| TOTATOL | ADD       | J       |
|         | ADD       | K       |
|         | STA       | N       |
|         |           |         |
| I       | DATA      | 2       |
| J       | DATA      | 3       |
| K       | DATA      | 4       |
| N       | DATA      | 0       |

(d) Assembly program