# Volt64 CPU

Andrew Clark May 19, 2023

## **Table of Contents**

| Table of Contents Registers, Main Widths, etc. |                       |  |  |  |  |  |  |  |  | 1 |   |  |  |  |      |  |  |    |
|------------------------------------------------|-----------------------|--|--|--|--|--|--|--|--|---|---|--|--|--|------|--|--|----|
|                                                |                       |  |  |  |  |  |  |  |  |   | 2 |  |  |  |      |  |  |    |
| Vector 7                                       | Type Casting          |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 5  |
| Exception                                      | ons                   |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 6  |
| Instruct                                       | ions                  |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 8  |
| 5.1                                            | Group 0 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  | <br> |  |  | 8  |
| 5.2                                            | Group 1 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  | <br> |  |  | 10 |
| 5.3                                            | Group 2 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  | <br> |  |  | 11 |
| 5.4                                            | Group 3 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 12 |
| 5.5                                            | Group 4 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 13 |
| 5.6                                            | Group 5 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 14 |
| 5.7                                            | Group 6 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 15 |
| 5.8                                            | Group 7 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  | <br> |  |  | 17 |
| 5.9                                            | Group 8 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  | <br> |  |  | 18 |
| 5.10                                           | Group 9 Instructions  |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 20 |
| 5.11                                           | Group 10 Instructions |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 23 |
|                                                | Group 11 Instructions |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 25 |
| 5 13                                           | Group 12 Instructions |  |  |  |  |  |  |  |  |   |   |  |  |  |      |  |  | 28 |

## Registers, Main Widths, etc.

- The machine is an implementation of Line Associative Registers (LARs). Both instruction LARs (ILARs) and data LARs (DLARs) are included in the design. There are a grand total of 128 ILARs and 128 DLARs, but they are split between the LARs owned by the supervisor mode and the LARs owned by the user mode. There are 64 supervisor mode ILARs, 64 supervisor mode DLARs, 64 user mode ILARs, and 64 user mode DLARs.
- · Addresses are 64-bit.
- 8 types of integer operations are supported, specifically u8, s8, u16, s16, u32, s32, u64, and s64.
- The machine boots in supervisor mode. On that note, the processor jumps to address 0x0 whenever it enters supervisor mode.

#### ILARs

- In user mode, ILARs 0 to 63 are referred to as i0, i1, i2, ..., i61, i62, ipc.
- In supervisor mode, ILARs 64 to 127 are referred to as i0, i1, i2, ..., i61, i62, ipc. Note that supervisor mode ILARs are encoded into instructions without the most significant bit, i.e. supervisor mode i0 is encoded as 0b000000.
- The two ILARs called "i0" have all their fields set to zero, and when written to, the contents of the ILAR does not change.
- The two ILARs called "ipc" are the program counters for the two operating modes of the processor.
- An ILAR's data field is 128 bytes long. It is composed of 32-bit instructions aligned to 32 bits.
- An ILAR's scalar offset field is (7 2 = 5)-bit due to instructions being 32-bit and the data field being 128 bytes long.
- The base\_address field of an ILAR is (64 6 = 58)-bit.
- An ILAR's tag field is 7-bit because there are 128 total ILARs.

#### • DLARs

- In user mode, DLARs 0 to 63 are referred to as d0, d1, d2, ..., d57, d58, dt0 (aka d59, assembler temporary 0), dt1 (aka d60, assembler temporary 1), dcp (aka d61, intended to be used as the constant pools pointer), dfp (aka d62, intended to be used as the frame pointer), dsp (aka d63, intended to be used as the stack pointer).
- In supervisor mode, DLARs 64 to 127 are referred to as d0, d1, d2, ..., d57, d58, dt0 (aka d59, assembler temporary 0), dt1 (aka d60, assembler temporary 1), dcp (aka d61, intended to be used as the constant pools

pointer), dfp (aka d62, intended to be used as the frame pointer), dsp (aka d63, intended to be used as the stack pointer). Note that supervisor mode DLARs are encoded into instructions without the most significant bit, i.e. supervisor mode d0 is encoded as 0b000000.

- The two DLARs called "d0" have all their fields set to zero, and when written to, the contents of the particular d0 DLAR do not change.
- A DLAR's data field is 128 bytes long. It is composed of the scalar data elements of the 128 byte vectors, where the type of the scalar data elements is determined by the type tag field of the DLAR.
- A DLAR's scalar offset field is 7-bit due to the data field being 128 bytes long. However, the address of the scalar data is forcibly aligned to the width of the DLAR's type when used as source operands in instructions that read from the full address field of the instruction.
- The base address field of a DLAR is (64 6 = 58)-bit.
- Note that the full address of a scalar located in a DLAR, dA, is determined by the following formula: cat (dA.base\_address, dA.scalar\_offset).
   If dA.type is a size other than that of u8 and s8, dA.scalar\_offset will be aligned to the size of dA.type for this calculation.
- DLARs can take on the following types (3-bit enum):
  - \* 8-bit, unsigned (u8)
  - \* 8-bit, signed (\$8)
  - \* 16-bit, unsigned (u16)
  - \* 16-bit, signed (s16)
  - \* 32-bit, unsigned (u32)
  - \* 32-bit, signed (s32)
  - \* 64-bit, unsigned (u64)
  - \* 64-bit, signed (s64)
- A DLAR's tag field is 7-bit because there are 128 total DLARs.
- Similarly, a DLAR's reference count field is 7-bit because there are 128 total DLARs.
- A DLAR's dirty field is 1-bit.
- The ie register
  - "ie" is short for "interrupt enable".
  - This register is 1-bit.
  - This register is a flag indicating whether or not an interrupt can be serviced.
     It can be read from/written to using cpy instructions.
  - The retiz and retinz instructions set ie to 0b1 and returns to user mode from supervisor mode.
- The xct register

- "exception type"
- This register is 32-bit.
- This register is set to the numerical value of an exception's type upon the machine entering supervisor mode. It can be read from/written to using cpy instructions.

#### • The swiarg0 register

- This register is 128 bytes long.
- This register indicates argument 0 to swi. In supervisor mode, it can be read from/written to with cpy instructions.

#### • The swiarg1 register

- This register is 128 bytes long.
- This register indicates argument 1 to swi. In supervisor mode, it can be read from/written to with cpy instructions.

#### • The swiarg2 register

- This register is 128 bytes long.
- This register indicates argument 2 to swi. In supervisor mode, it can be read from/written to with cpy instructions.

#### • The swiarg3 register

- This register is 128 bytes long.
- This register indicates argument 3 to swi. In supervisor mode, it can be read from/written to with cpy instructions.

## **Vector Type Casting**

- The scalar offset field determines which sub-vectors of a DLAR to use for vector operations when DLAR types do not match, with the following behavior:
  - 1. When up-casting any source DLAR, dsrc, to the type of any destination DLAR, ddst, the offset into dsrc.data of element 0x0 of the subvector of dsrc.data that will be used will be calculated by aligning dsrc.scalar offset to ddst.type.
  - 2. When down-casting any source DLAR, dsrc, to the type of any destination DLAR, ddst, the offset into ddst.data of element 0x0 of the sub-vector of ddst.data that will be used will be calculated by aligning ddst.scalar offset to dsrc.type.
  - 3. If either of the following conditions is met:
    - dB.type and dC.type are both smaller in size than the type of a destination DLAR, ddst, and dB.type and dC.type are the same size
    - dB.type and dC.type are both smaller in size than the type of a destination DLAR, ddst, and dB.type and dC.type are not the same size

then up-casting will be performed on both dB and dC as in item 1. of this list.

- 4. If the following conditions are all met:
  - dB.type and dC.type are both the same size
  - the shared size of dB.type and dC.type is larger than the size of the type of a destination DLAR, ddst

then the down-casting will be performed on both dB and dC as in item 2. of this list.

- 5. If the following conditions are all met:
  - exactly one of dB.type and dC.type is smaller than the type of a destination DLAR, ddst
  - there also exists a source DLAR, dB or dC, whose type is the same size as ddst.type

then only the source DLAR whose type is smaller than ddst.type will be up-casted, as in item 1. of this list. The other source DLAR will be used as is, without casting.

- 6. If the following conditions are all met:
  - exactly one of dB.type and dC.type is larger than the type of a destination DLAR, ddst,
  - there also exists a source DLAR (dB or dC) whose type is the same size as ddst.type

only the source DLAR with a type larger than ddst.type will be downcasted, as in item 2. of this list.

Also, the sub-vector of the source DLAR whose type is the same size as ddst.type will be selected as if item 2. of this list were applied with the two source DLARs, where the "destination" DLAR for that operation is the source DLAR whose type is same size as ddst.type.

Note that the source DLAR whose type is the same size as ddst.type will not have any of its fields changed by this second application of item 2. of this list.

- 7. If the following conditions are all met:
  - dB.type and dC.type do not have the same size
  - neither dB.type nor dC.type is the same size as a destination DLAR, ddst
  - Either of the following conditions is met:
    - \* exactly one of dB.type and dC.type is has a larger size than ddst.type, and the type of the other source DLAR has a smaller size than ddst.type
    - \* both of dB. type and dC. type is have larger sizes than ddst.type

then dC.type will be instead interpreted as dB.type, and the specific item in this list which applies to that situation will be performed.

## **Exceptions**

Some instructions may cause an exception to occur, putting the processor in supervisor mode.

The following exceptions may occur during normal execution of a program. xct is set to a numerical value representing these upon the processor encountering an exception.

- When xct == 0x0: Nothing.
- When xct == 0x1: Taking a non-software interrupt (which would also set ie to 0b0).
- When xct == 0x2: swi.
- When xct == 0x3: retiz or retinz when in user mode.
- When xct == 0x4: retxz or retxnz when in user mode.
- When xct == 0x5: Undefined instruction.
- When xct == 0x6: Division by zero.
- When xct == 0x7: Reserved for future expansion.

- Note: This value of xct might be used for floating point exceptions in a future revision of this document.
- When xct == 0x8: cpy.s that reads from ie in user mode.
- When xct == 0x9: cpy.s that writes to ie in user mode.
- When xct == 0xa: cpy.s that reads from xct in user mode.
- When xct == 0xb: cpy.s that writes to xct in user mode.
- When xct == 0xc: cpy.v that reads from swiarg0 when in user mode.
- When xct == 0xd: cpy.v that writes to swiarg0 when in user mode.
- When xct == 0xe: cpy.v that reads from swiarg1 when in user mode.
- When xct == 0xf: cpy.v that writes to swiarg1 when in user mode.
- When xct == 0x10: cpy.v that reads from swiarg2 when in user mode.
- When xct == 0x11: cpy.v that writes to swiarg2 when in user mode.
- When xct == 0x12: cpy.v that reads from swiarg3 when in user mode.
- When xct == 0x13: cpy.v that writes to swiarg3 when in user mode.
- When xct == 0x14: Instructions that read from supervisor mode ILARs or DLARs when in user mode.
- When xct == 0x15: Instructions that write to supervisor mode ILARs or DLARs when in user mode.
- When xct == 0x16: When user mode ipc's next destination is not in any ILARs.

#### **Instructions**

#### 5.1 Group 0 Instructions

- Encoding 0: 0000 aaaa aabb bbbb cccc cc00 000v oooo
- Encoding 1: 0000 aaaa aabb bbbb cccc ccdd dddd oooo
  - a: DLAR a
  - b: DLAR b or ILAR b
  - c: DLAR c
  - d: DLAR d
  - 17.
    - \* when 0b0: scalar operation. The assembly syntax indicating a scalar operation simply adds ".s" to the instruction's name.
    - \* when 0b1: vector operation. The assembly syntax indicating a vector operation simply adds ".v" to the instruction's name.
  - o: Opcode
- Most instructions in this group use Encoding 0. The instructions div.s and div.v use Encoding 1. These two instructions have opcodes of 0b1110 and 0b1111, respectively, such that bit 0 of the instruction specifies the same information as the v bit does for other instructions in Encoding 0.
- Instruction List:
  - 1. add dA, dB, dC
  - 2. sub dA, dB, dC
  - 3. slt dA, dB, dC
  - 4. mul dA, dB, dC
  - 5. and dA, dB, dC
  - 6. or dA, dB, dC
  - 7. xor dA, dB, dC
  - 8. shl dA, dB, dC
    - Logical shift left.
    - This instruction casts a temporary copy of dC to the unsigned type that
      is the same size as dA's type and uses that instead of dC.
  - 9. shr dA, dB, dC
    - Logical shift right if dA is unsigned, or arithmetic shift right if dA is signed.
    - This instruction casts a temporary copy of dC to the unsigned type that
      is the same size as dA's type and uses that instead of dC.

- 10. rol dA, dB, dC
  - Rotate left.
  - This instruction casts a temporary copy of dC to the unsigned type that
    is the same size as dA's type and uses that instead of dC.
- 11. ror dA, dB, dC
  - Rotate right.
  - This instruction casts a temporary copy of dC to the unsigned type that
    is the same size as dA's type and uses that instead of dC.
- 12. add dA, dB.addr, dC
  - For add.s dA, dB.addr, dC, this instruction uses the value of cast(dA.type, align(dB.type, dB.addr)) as the second argument of the add.
  - For add.v dA, dB.addr, dC, this instruction duplicates the value of cast (dA.type, align (dB.type, dB.addr)) (a scalar) into a temporary (DLAR data's length number of bytes long) vector of element type dA.type for the purposes of this calculation.
- 13. add dA, iB.addr, dC
  - For add.s dA, iB.addr, dC, this instruction uses the value of cast (dA.type, iB.addr) as the second argument of the add.
  - For add.v dA, iB.addr, dC, this instruction duplicates the value of cast (dA.type, iB.addr) (a scalar) into a temporary (DLAR data's length number of bytes long) vector of element type dA.type for the purposes of this calculation.
- 14. shl dA, dB.addr, dC
  - For shl.s dA, dB.addr, dC, this instruction uses the value of cast(dA.type, align(dB.type, dB.addr)) as the second argument of the shl.
  - For shl.v dA, dB.addr, dC, this instruction duplicates the value of cast (dA.type, align (dB.type, dB.addr)) (a scalar) into a temporary (DLAR data's length number of bytes long) vector of element type dA.type for the purposes of this calculation.
  - This instruction casts a temporary copy of dC to the unsigned type that
    is the same size as dA's type and uses that instead of dC.
- 15. div.s dA, dB, dC, dD
  - This instruction writes the quotient into dA, and the remainder into dD.
- 16. div.v dA, dB, dC, dD
  - This instruction writes the quotient into dA, and the remainder into dD.

#### 5.2 Group 1 Instructions

- Encoding: 0001 aaaa aabb bbbb cccc cc00 0000 0000
  - a: DLAR a
  - b: DLAR b
  - c: DLAR c
  - o: Opcode
- Instruction List:
  - 1. add.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and performs a sum of all the scalar data elements of the temporary copy of dB, then stores the result in dA's scalar data.
  - 2. mul.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and performs a product of all the scalar data elements of the temporary copy of dB, then stores the result in dA's scalar data.
  - 3. max.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and finds the scalar data element of the temporary copy of dB that is the maximum, then stores the result in dA's scalar data.
  - 4. min.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and finds the scalar data element of the temporary copy of dB that is the minimum, then stores the result in dA's scalar data.
  - 5. and.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and performs a bitwise AND reduction of all the scalar data elements of the temporary copy of dB, then stores the result in dA's scalar data.
  - 6. or.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and performs a bitwise OR reduction of all the scalar data elements of the temporary copy of dB, then stores the result in dA's scalar data.
  - 7. xor.r dA, dB
    - This instruction casts a temporary copy of dB to the dA's type and performs a bitwise XOR reduction of all the scalar data elements of the temporary copy of dB, then stores the result in dA's scalar data.
  - 8. Reserved for future expansion.

## 5.3 Group 2 Instructions

- Encoding: 0010 aaaa aabb bbbb iiii iiii iiii oooo
  - a: DLAR a
  - b: DLAR b
  - i: simm12 (sign-extended 12-bit immediate)
  - o: Opcode
- For these instructions, the value of cast (u64, dB.scalar\_data) + cast (s64, simm12) is used as the address to load from.
- Also, the type dA is set to is indicated in the instruction name, with, for example, ldu8i setting dA's type to u8.
- Instruction List:
  - 1. ldu8i dA, dB, simm12
  - 2. lds8i dA, dB, simm12
  - 3. ldu16i dA, dB, simm12
  - 4. lds16i dA, dB, simm12
  - 5. ldu32i dA, dB, simm12
  - 6. lds32i dA, dB, simm12
  - 7. ldu64i dA, dB, simm12
  - 8. lds64i dA, dB, simm12
  - 9. Reserved for future expansion.
  - 10. Reserved for future expansion.
  - 11. Reserved for future expansion.
  - 12. Reserved for future expansion.
  - 13. Reserved for future expansion.
  - 14. Reserved for future expansion.
  - 15. Reserved for future expansion.
  - 16. Reserved for future expansion.

#### 5.4 Group 3 Instructions

- Encoding: 0011 aaaa aabb bbbb iiii iiii iiii oooo
  - a: DLAR a
  - b: DLAR b
  - i: simm12 (sign-extended 12-bit immediate)
  - o: Opcode
- For these instructions, the value of cast (u64, align (dB.type, dB.addr))
  - + cast(s64, simm12) is used as the address to load from.
- Also, the type dA is set to is indicated in the instruction name, with, for example, ldu8 setting dA's type to u8.
- Instruction List:
  - 1. ldu8i dA, dB.addr, simm12
  - 2. lds8i dA, dB.addr, simm12
  - 3. ldu16i dA, dB.addr, simm12
  - 4. lds16i dA, dB.addr, simm12
  - 5. ldu32i dA, dB.addr, simm12
  - 6. lds32i dA, dB.addr, simm12
  - 7. ldu64i dA, dB.addr, simm12
  - 8. lds64i dA, dB.addr, simm12
  - 9. Reserved for future expansion.
  - 10. Reserved for future expansion.
  - 11. Reserved for future expansion.
  - 12. Reserved for future expansion.
  - 13. Reserved for future expansion.
  - 14. Reserved for future expansion.
  - 15. Reserved for future expansion.
  - 16. Reserved for future expansion.

## 5.5 Group 4 Instructions

- Encoding: 0100 aaaa aabb bbbb cccc cc00 0000 oooo
  - a: DLAR a
  - b: DLAR b
  - c: DLAR c
  - o: Opcode
- For these instructions, the value of cast (u64, dB.scalar\_data) + cast (u64, dC.scalar\_data) is used as the address to load from.
- Also, the type dA is set to is indicated in the instruction name, with, for example, ldu8 setting dA's type to u8.
- Instruction List:
  - 1. ldu8 dA, dB, dC
  - 2. lds8 dA, dB, dC
  - 3. ldu16 dA, dB, dC
  - 4. lds16 dA, dB, dC
  - 5. ldu32 dA, dB, dC
  - 6. lds32 dA, dB, dC
  - 7. ldu64 dA, dB, dC
  - 8. lds64 dA, dB, dC
  - 9. Reserved for future expansion.
  - 10. Reserved for future expansion.
  - 11. Reserved for future expansion.
  - 12. Reserved for future expansion.
  - 13. Reserved for future expansion.
  - 14. Reserved for future expansion.
  - 15. Reserved for future expansion.
  - 16. Reserved for future expansion.

## 5.6 Group 5 Instructions

- Encoding: 0101 aaaa aabb bbbb iiii iiii iiii oooo
  - a: DLAR a
  - b: DLAR b
  - i: simm12 (sign-extended 12-bit immediate)
  - o: Opcode
- For these instructions, the value of cast (u64, dB.scalar\_data) + cast (s64, simm12) is used as the address being stored to.
- Also, the type dA is set to is indicated in the instruction name, with, for example, stu8 setting dA's type to u8.
- Instruction List:
  - 1. stu8i dA, dB, simm12
  - 2. sts8i dA, dB, simm12
  - 3. stu16i dA, dB, simm12
  - 4. sts16i dA, dB, simm12
  - 5. stu32i dA, dB, simm12
  - 6. sts32i dA, dB, simm12
  - 7. stu64i dA, dB, simm12
  - 8. sts64i dA, dB, simm12
  - 9. Reserved for future expansion.
  - 10. Reserved for future expansion.
  - 11. Reserved for future expansion.
  - 12. Reserved for future expansion.
  - 13. Reserved for future expansion.
  - 14. Reserved for future expansion.
  - 15. Reserved for future expansion.
  - 16. Reserved for future expansion.

## 5.7 Group 6 Instructions

- Encoding: 0110 aaaa aabb bbbb 0000 0000 0000 oooo
  - a: DLAR a
  - b: DLAR b
  - o: Opcode
- For these instructions, the dB register's scalar data field is used.
- Instruction List:
  - 1. dpu8 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the u8 type. The casted scalar data is then stored into every u8 vector element of dA. The type of dA is then changed to u8.
  - 2. dps8 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the s8 type. The casted scalar data is then stored into every s8 vector element of dA. The type of dA is then changed to s8.
  - 3. dpu16 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the u16 type. The casted scalar data is then stored into every u16 vector element of dA. The type of dA is then changed to u16.
  - 4. dps16 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the s16 type. The casted scalar data is then stored into every s16 vector element of dA. The type of dA is then changed to s16.
  - 5. dpu32 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the u64 type. The casted scalar data is then stored into every u32 vector element of dA. The type of dA is then changed to u32.
  - 6. dps32 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the s64 type. The casted scalar data is then stored into every s32 vector element of dA. The type of dA is then changed to s32.
  - 7. dpu64 dA, dB
    - This instruction casts (a temporary copy of) the scalar data of dB to the u64 type. The casted scalar data is then stored into every u64 vector element of dA. The type of dA is then changed to u64.
  - 8. dps64 dA, dB

- This instruction casts (a temporary copy of) the scalar data of dB to the s64 type. The casted scalar data is then stored into every s64 vector element of dA. The type of dA is then changed to s64.
- 9. Reserved for future expansion.
- 10. Reserved for future expansion.
- 11. Reserved for future expansion.
- 12. Reserved for future expansion.
- 13. Reserved for future expansion.
- 14. Reserved for future expansion.
- 15. Reserved for future expansion.
- 16. Reserved for future expansion.

#### 5.8 Group 7 Instructions

- Encoding 0: 0111 aaaa aabb bbbb cccc cc00 000j jjjo
- Encoding 1: 0111 aaaa aabb bbbb iiii iiii iiij jjjo
  - a: ILAR a
  - b: ILAR b
  - c: DLAR c
  - i: isimm11 (11-bit immediate, left shifted by 2, then sign-extended)
  - j: jimm4, the number of consecutive ILARs past iA to fetch into.
  - o: Opcode
- For instructions using Encoding 0, the address to fetch from is computed with the following formula: cast(u64, align(u64, dC.scalar\_data)).
- For instructions using Encoding 1, the address to fetchi from is computed with the following formula: iB.addr + cast(u64, (isimm11 << 2)).
- Additionally, this and any other instructions with "fetch" in their names are the only way to fetch instructions from memory (or other ILARs) in this CPU's design. The instruction pipeline only automatically fetches instructions from ipc, with straight line code that overflows outside of ipc causing ipc to set its data field to that of the other ILARs that have the data from the destination. If no ILARs have the data from the destination, an exception is thrown.
- Instruction List:
  - 1. fetch iA, iB, dC, jimm4
    - This instruction uses Encoding 0.
  - 2. fetchi iA, iB, isimm11, jimm4
    - This instruction uses Encoding 1.

#### 5.9 Group 8 Instructions

- Encoding: 1000 aaaa aabb bbbb 0iii iijj jjjv oooo
  - a: DLAR a
  - b: ILAR b
  - i: iimm5 (5-bit immediate, left shifted by 2, then zero-extended)
  - j: jimm5 (5-bit immediate, left shifted by 2, then zero-extended)
  - v:
    - \* when 0b0: scalar operation (uses the scalar data of dA). The assembly syntax indicating a scalar operation simply adds ".s" to the instruction's name.
    - \* when 0b1: vector operation (uses the vector data of dA). The assembly syntax indicating a vector operation simply adds ".v" to the instruction's name.
  - o: Opcode
- These instructions use the scalar or vector data field of dA.
- Instruction List:
  - 1. jz dA, iB, iimm5
    - This instruction jumps to iB[iimm5 << 2] if the particular data field of dA is zero.
  - 2. jnz dA, iB, iimm5
    - This instruction jumps to iB[iimm5 << 2] if the particular data field of dA is non-zero.
  - 3. sel dA, iB, iimm5, jimm5
    - This instruction jumps to iB[iimm5 << 2] if the particular data field of dA is non-zero, otherwise to the address iB[jimm5 << 2].</li>
  - 4. Reserved for future expansion.
  - 5. retiz dA
    - This instruction returns from an interrupt if dA is zero, setting ie to 0b1.
    - This instruction causes an exception if the processor is in user mode.
  - 6. retinz dA
    - This instruction returns from an interrupt if dA is non-zero, setting ie to 0b1.
    - This instruction causes an exception if the processor is in user mode.
  - 7. retxz dA
    - This instruction returns from supervisor mode to user mode if dA is zero.

- This instruction causes an exception if the processor is in user mode.

#### 8. retxnz dA

- This instruction returns from supervisor mode to user mode if dA is non-zero.
- This instruction causes an exception if the processor is in user mode.
- 9. Reserved for future expansion.
- 10. Reserved for future expansion.
- 11. Reserved for future expansion.
- 12. Reserved for future expansion.
- 13. Reserved for future expansion.
- 14. Reserved for future expansion.
- 15. Reserved for future expansion.
- 16. Reserved for future expansion.

#### 5.10 Group 9 Instructions

- Encoding: 1001 aaaa aabb bbbb cccc ccii iiii Sooo
  - a: ILAR a or DLAR a
  - b: ILAR b or DLAR b
  - c: DLAR c
  - i: imm6 (zero-extended 6-bit immediate), amount of consecutive LARs to use
  - S:
    - \* when <code>ObO</code>: Destination LARs (the ones starting with <code>iA</code> or <code>dA</code>) or source LARs (the ones starting with <code>iB/dB</code> and, if the instruction uses it, <code>dC</code>) are supervisor mode LARs. An example of the syntax for <code>getaddrs</code> in this case is <code>getaddrs.U</code>. The ".U" suffix indicates this instruction will have the S bit set to <code>ObO</code>.
    - \* when <code>Obl</code>: Destination LARs (the ones starting with <code>iA</code> or <code>dA</code>) or source LARs (the ones starting with <code>iB/dB</code> and, if the instruction uses it, <code>dC</code>) are supervisor mode LARs. An example of the syntax for <code>getaddrs</code> in this case is <code>getaddrs</code>.S. The ".S" suffix indicates this instruction will have the S bit set to <code>Obl</code>.
    - \* Note: whether S applies to destination LARs or source LARs is indicated in the description of the instruction.
  - o: Opcode
- Instruction List:
  - 1. getaddrs dA, dB, imm6
    - This instruction uses the S bit to indicate which mode the source DLARs belong to.
    - This instruction grabs the addresses of source DLARs starting with dB and then also the following imm6 1 source DLARs. The grabbed addresses are then written into consecutive scalar data elements of destination DLARs (starting with dA and continuing into the following destination DLARs as necessary).
    - When supervisor mode LARs are used for the source(s), this instruction causes an exception if used in user mode.
  - 2. getaddrs dA, iB, imm6
    - This instruction uses the S bit to indicate which mode the source ILARs belong to.
    - This instruction grabs the addresses of source ILARs starting with dB and then also the following imm6 1 source DLARs. The grabbed addresses are then written into consecutive scalar data elements of destination DLARs (starting with dA and continuing into the following destination DLARs as necessary).

 When supervisor mode LARs are used for the source(s), this instruction causes an exception if used in user mode.

#### 3. gettypes dA, dB, imm6

- This instruction uses the S bit to indicate which mode the source DLARs belong to.
- This instruction grabs the types of source DLARs starting with dB and then also the following imm6 1 source DLARs. The grabbed types are then written into consecutive scalar data elements of destination DLARs (starting with dA and continuing into the following destination DLARs as necessary).
- When supervisor mode LARs are used for the source(s), this instruction causes an exception if used in user mode.
- 4. ldm dA, dB, dC, imm6
  - This instruction's name is short for "load multiple".
  - This instruction uses the S bit to indicate to which mode the destination DLARs belong.
  - This instruction uses addresses stored in the imm6 scalar data elements of consecutive DLARs (starting with dB) and types stored in the imm6 scalar data elements of consecutive DLARs (starting with dC).
     Multiple loads from memory are performed into the imm6 destination DLARs (starting with dA).
  - When supervisor mode LARs are used for the destination(s), this instruction causes an exception if used in user mode.
- 5. fetchm iA, dB, imm6
  - This instruction's name is short for "fetch multiple".
  - This instruction uses the S bit to indicate to which mode the destination DLARs belong.
  - This instruction uses addresses stored in the imm6 scalar data elements
    of consecutive DLARs (starting with dB). Multiple fetches from memory are performed into the imm6 destination ILARs (starting with iA).
  - When supervisor mode LARs are used for the destination(s), this instruction causes an exception if used in user mode.
- 6. reload dA, imm6
  - This instruction uses the S bit to indicate to which mode the operand DLARs belong.
  - This instruction forcibly re-loads the imm6 consecutive DLARs, starting with dA, from memory. The dirty flags of each of these DLARs' shared data elements are cleared.
  - When supervisor mode LARs are used for the operands, this instruction causes an exception if used in user mode.
- 7. reload iA, imm6

- This instruction uses the S bit to indicate to which mode the operand DLARs belong.
- This instruction forcibly re-loads the imm6 consecutive ILARs, starting with iA, from memory.
- When supervisor mode LARs are used for the operands, this instruction causes an exception if used in user mode.
- Note: This instruction is potentially useful for self-modifying code.

#### 8. flush dA, imm6

- This instruction uses the S bit to indicate to which mode the operand DLARs belong.
- This instruction forcibly flushes the imm6 consecutive DLARs, starting with dA, to memory. The dirty flags of each of these DLARs' shared data elements are cleared.
- When supervisor mode LARs are used for the operands, this instruction causes an exception if used in user mode.

## 5.11 Group 10 Instructions

- Encoding: 1010 aaaa aabb bbbb cccc ccdd dddd oooo
  - a: DLAR a
  - b: DLAR b
  - c: DLAR c
  - d: DLAR d
  - o: Opcode
- Instruction List:
  - 1. cpy.s dA, ie
    - This instruction copies ie to dA.scalar\_data.
  - 2. cpy.s ie, dA
    - This instruction copies dA.scalar data to ie.
  - 3. cpy.s dA, xct
    - This instruction copies xct to dA.scalar data.
  - 4. cpy.s xct, dA
    - This instruction copies dA.scalar data to xct.
  - 5. cpy.v dA, swiarg0
    - This instruction copies swiarg0 to dA.vector data.
  - 6. cpy.v swiarg0, dA
    - This instruction copies dA. vector data to swiarg0.
  - 7. cpy.v dA, swiarg1
    - This instruction copies swiarg1 to dA.vector data.
  - 8. cpy.v swiarg1, dA
    - This instruction copies dA. vector data to swiarg1.
  - 9. cpy.v dA, swiarg2
    - This instruction copies swiarg2 to dA.vector\_data.
  - 10. cpy.v swiarg2, dA
    - This instruction copies dA. vector data to swiarg2.
  - 11. cpy.v dA, swiarg3
    - This instruction copies swiarg3 to dA.vector data.
  - 12. cpy.v swiarg3, dA
    - This instruction copies dA. vector data to swiarg3.
  - 13. swi dA, dB, dC, dD
    - This instruction copies dA.vector\_data to to swiarg0.

- This instruction copies dB.vector\_data to to swiarg1.
- This instruction copies dC.vector\_data to to swiarg2.
- This instruction copies dD.vector data to to swiarg3.
- Lastly, this instruction performs a software interrupt, switching the processor to supervisor mode.
- Note that this instruction always causes an exception to occur.
- 14. Reserved for future expansion.
- 15. Reserved for future expansion.
- 16. Reserved for future expansion.

## 5.12 Group 11 Instructions

- Encoding: 1011 aaaa aabb bbbb cccc cc00 00vo oooo
  - a: DLAR a
  - b: DLAR b
  - c: DLAR c
  - v:
    - \* when 0b0: scalar operation. The assembly syntax indicating a scalar operation simply adds ".s" to the instruction's name.
    - \* when 0b1: vector operation. The assembly syntax indicating a vector operation simply adds ".v" to the instruction's name.
  - o: Opcode
- These instructions perform a read from/write to the IO address calculated by the following formula: cast(u64, dB.scalar\_data) + cast(u64, dC.scalar\_data).
- When a scalar operation is being performed, only the scalar data of dA is read into from/written out to IO space.
- When a vector operation is being performed, the entire vector data of dA is read into from/written out to IO space.
- Instruction List:
  - 1. inu8 dA, dB, dC
    - This instruction sets the type of dA to u8 before performing anything else of the operation.
    - This instruction reads from IO space and writes to dA.
  - 2. ins8 dA, dB, dC
    - This instruction sets the type of dA to s8 before performing anything else of the operation.
    - This instruction reads from IO space and writes to dA.
  - 3. inu16 dA, dB, dC
    - This instruction sets the type of dA to u16 before performing anything else of the operation.
    - This instruction reads from IO space and writes to dA.
  - 4. ins16 dA, dB, dC
    - This instruction sets the type of dA to s16 before performing anything else of the operation.
    - This instruction reads from IO space and writes to dA.
  - 5. inu32 dA, dB, dC

- This instruction sets the type of dA to u32 before performing anything else of the operation.
- This instruction reads from IO space and writes to dA.
- 6. ins32 dA, dB, dC
  - This instruction sets the type of dA to s32 before performing anything else of the operation.
  - This instruction reads from IO space and writes to dA.
- 7. inu64 dA, dB, dC
  - This instruction sets the type of dA to u64 before performing anything else of the operation.
  - This instruction reads from IO space and writes to dA.
- 8. ins64 dA, dB, dC
  - This instruction sets the type of dA to s64 before performing anything else of the operation.
  - This instruction reads from IO space and writes to dA.
- 9. Reserved for future expansion.
- 10. Reserved for future expansion.
- 11. Reserved for future expansion.
- 12. Reserved for future expansion.
- 13. Reserved for future expansion.
- 14. Reserved for future expansion.
- 15. Reserved for future expansion.
- 16. Reserved for future expansion.
- 17. outu8 dA, dB, dC
  - This instruction sets the type of dA to u8 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 18. outs8 dA, dB, dC
  - This instruction sets the type of dA to s8 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 19. outu16 dA, dB, dC
  - This instruction sets the type of dA to u16 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 20. outs16 dA, dB, dC
  - This instruction sets the type of dA to s16 before performing anything else of the operation.

- This instruction writes to IO space and reads from dA.
- 21. outu32 dA, dB, dC
  - This instruction sets the type of dA to u32 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 22. outs32 dA, dB, dC
  - This instruction sets the type of dA to s32 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 23. outu64 dA, dB, dC
  - This instruction sets the type of dA to u64 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 24. outs64 dA, dB, dC
  - This instruction sets the type of dA to s64 before performing anything else of the operation.
  - This instruction writes to IO space and reads from dA.
- 25. Reserved for future expansion.
- 26. Reserved for future expansion.
- 27. Reserved for future expansion.
- 28. Reserved for future expansion.
- 29. Reserved for future expansion.
- 30. Reserved for future expansion.
- 31. Reserved for future expansion.
- 32. Reserved for future expansion.

## 5.13 Group 12 Instructions

- Encoding: 1100 aaaa aaii iiii iiii iiii iiioo
  - a: DLAR a
  - i: simm20: (sign-extended 20-bit immediate)
  - o: Opcode
- Instruction List:
  - cpyi.s dA, simm20
    - \* This instruction copies the value of cast (dA.type, cast (s64, simm20)) to dA.scalar data.
  - addi.s dA, ipc.addr, simm20
    - \* This instruction copies the value of cast (dA.type, ipc.addr) + cast(dA.type, cast(s64, simm20)) to dA.scalar\_data.
  - Reserved for future expansion.
  - Reserved for future expansion.