tiny-asm: an assembler for riscv

jacob navia

# Contents

| Co | ontent | S              |                                      | 3  |
|----|--------|----------------|--------------------------------------|----|
| Li | st of  | $\Gamma$ ables |                                      | 5  |
| Li | st of  | Figures        |                                      | 7  |
| 1  | The    | RISC-          | V assembler                          | 9  |
|    | 1.1    | Introd         | $\operatorname{luction}$             | 9  |
|    |        | 1.1.1          | Requirements                         | 10 |
|    | 1.2    |                | ng tiny-asm                          | 11 |
|    | 1.3    | What           | is an assembler?                     | 11 |
|    | 1.4    |                | iew                                  | 12 |
|    | 1.5    | Gener          | al concepts and data structures      | 17 |
|    |        | 1.5.1          | Binary File Descriptor (bfd)         | 17 |
|    |        | 1.5.2          | Symbols                              | 18 |
|    |        |                | asymbol                              | 18 |
|    |        |                | symbolS                              | 19 |
|    |        |                | The symbol_flags structure           | 20 |
|    |        |                | local_symbol                         | 21 |
|    |        |                | Symbol table                         | 21 |
|    |        | 1.5.3          | Fragments                            | 22 |
|    |        | 1.5.4          | Fixups                               | 23 |
|    |        |                | Constructors                         | 23 |
|    |        |                | Applying a fixup                     | 23 |
|    |        | 1.5.5          | Relocations                          | 24 |
|    |        |                | Data structures for relocations      | 24 |
|    |        | 1.5.6          | Sections and subsections             | 27 |
|    | 1.6    |                | ction formats and encoding           | 27 |
|    | 1.7    |                | nstruction formats                   | 28 |
|    |        | 1.7.1          | The "R" format                       | 29 |
|    |        |                | Software handling                    | 30 |
|    |        | 1.7.2          | The "I" format                       | 30 |
|    |        |                | Software handling                    | 31 |
|    |        | 1.7.3          | The "U" format                       | 31 |
|    |        |                | Software handling                    | 32 |
|    |        | 1.7.4          | The "S" format                       | 32 |
|    |        | 1.7.5          | The "B" format                       | 33 |
|    |        | 1.7.6          | The "J" format                       | 34 |
|    | 1.8    |                | ompressed instructions               | 35 |
|    |        | 1.8.1          | The compressed register (CR) format  | 36 |
|    |        |                | The software side                    | 36 |
|    |        | 1.8.2          | The compressed immediate (CI) format | 37 |

4 Contents

|       | 1.8.3   | The stack relative store (CSS) format                                  |
|-------|---------|------------------------------------------------------------------------|
|       | 1.8.4   | The wide immediate (CIW) format                                        |
|       | 1.8.5   | The compressed load (CL) format                                        |
| 1.9   | The or  | ocode table                                                            |
|       |         | e detailed view of instruction encoding                                |
|       |         | g the object file                                                      |
| 1.11  |         |                                                                        |
| 1 10  |         | Write the object file                                                  |
| 1.12  |         | bler directives                                                        |
|       |         | .align, .p2align, p2alignw, p2alignl                                   |
|       |         | .ascii, .asciiz, .string, .string8, .string16, .string32, .string64 52 |
|       | 1.12.3  | .bss                                                                   |
|       | 1.12.4  | .byte, .dc, .dc.a, .dc.b, .dc.d, .dc.l, .dc.s, .dc.w, etc              |
|       | 1.12.5  | data                                                                   |
|       | 1.12.6  | Other data directives                                                  |
|       |         | debug, extern, format, lflags, name, noformat, spc, xref               |
|       |         | equ, equiv, eqv, set                                                   |
|       |         | globl                                                                  |
|       |         | Oattach to group                                                       |
|       |         | 0 1                                                                    |
|       |         | , , , , , , , , , , , , , , , , , , ,                                  |
|       |         | Phidden                                                                |
|       |         | m Bident                                                               |
|       |         | $4 \mathrm{insn}$                                                      |
|       | 1.12.15 | $6\mathrm{internal}$                                                   |
|       | 1.12.16 | Bloc                                                                   |
|       | 1.12.17 | 7 local                                                                |
|       | 1.12.18 | 3.option                                                               |
|       | 1.12.19 | Oorg                                                                   |
|       |         | ) protected                                                            |
|       |         | reloc                                                                  |
|       |         | Rtext                                                                  |
|       |         | 3 uleb $128$ , sleb $128$                                              |
|       |         | Other directives                                                       |
| 1 10  |         |                                                                        |
| 1.13  |         | i directives                                                           |
|       | 1.13.1  | Concepts                                                               |
|       |         | The CIE                                                                |
|       |         | The FDE                                                                |
|       |         | Software representation                                                |
|       | 1.13.2  | An example                                                             |
|       | 1.13.3  | cfi_sections                                                           |
|       | 1.13.4  | cfi_startproc                                                          |
|       |         | cfi_def_cfa_offset                                                     |
|       |         | cfi_offset                                                             |
|       |         | cfi_restore                                                            |
|       |         | cfi_def_cfa                                                            |
|       | 1.13.9  |                                                                        |
|       |         | — <u> </u>                                                             |
| 1 1 4 |         | cfi_remember_state and .cfi_restore_state                              |
| 1.14  |         | instructions                                                           |
|       | 1.14.1  | Loads, stores and addition                                             |
|       |         | Load and store instructions in short                                   |
|       |         | Addressing modes                                                       |
|       |         | Recognizing addressing modes                                           |
|       | 1.14.2  | Digression: assembler macros                                           |
|       |         | Subtraction                                                            |

| 1.14.4 Comparisons 1.14.5 Multiplication and Division  Multiplication  XuanTie-OpenC910  Division  1.14.6 Shifts 1.14.7 Control flow  Inconditional Jumps                                                                                                      | 88<br>89<br>89<br>90<br>91<br>91              |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------|
| 1.14.8 Conditional expressions         1.14.9 And, Or, Xor         1.14.10 Reading timers         Reading standard counters         1.14.11 CSR instructions         1.14.12 Boolean instructions         The Zbkb extension         1.14.13 Pause instruction | 92<br>92<br>92<br>93<br>93<br>93<br>95<br>96  |
| Comparisons  1.14.15 Atomic instructions  The aq and rl bits  Encodings                                                                                                                                                                                        |                                               |
| 1.16 Instructions specific to the Thead processor 1.17 Pseudo instructions                                                                                                                                                                                     | 104<br>106<br>110<br>112<br>113<br>146<br>147 |
| List of Tables                                                                                                                                                                                                                                                 | 153                                           |
| 1.1 RISCV symbolic register names 1.2 The different instruction formats 1.3 Encoding of conditional branches 1.4 Compressed register numbers 1.5 Compressed formats 1.6 Compressed store instructions with the CSS format 1.7 Opcode flags                     | 28<br>29<br>33<br>34<br>35<br>36<br>38<br>41  |

| 1.8  | Opcode arguments letters                          | 42       |
|------|---------------------------------------------------|----------|
| 1.8  | Opcode arguments letters                          | 43       |
| 1.9  | Accepted rounding modes for the 'm' parameter     | 43       |
| 1.10 | Compressed instruction types                      | 43       |
| 1.10 | Compressed instruction types                      | 44       |
|      | Common Information Entry fields                   | 70       |
|      | Common Information Entry fields                   | 71       |
|      | FDE fields                                        | 71       |
|      | FDE fields                                        | 72       |
|      | Standard load and store operations                | 84       |
|      | Macro letter arguments                            | 88       |
|      | Thead Multiplication extensions                   | 89       |
|      | Thead Multiplication extensions                   | 90       |
|      | Standard shift operations                         | 91       |
|      | Standard inconditional jumps                      | 91       |
|      | Standard conditional expressions                  | 92       |
|      | Standard boolean instructions                     | 92       |
|      | Counter reading                                   | 93       |
|      | Zbb boolean extension instructions                | 93<br>93 |
|      |                                                   |          |
|      | Zbb boolean extension instructions                | 94<br>95 |
|      | Zbb boolean extension instructions                |          |
|      | Zbkb instructions                                 | 95       |
|      | Zbkb instructions                                 | 96       |
|      | Format bits (Bits 25-26)                          | 97       |
|      | Rounding mode bits (Bits 12-14)                   | 97       |
|      | Rounding mode bits (Bits 12-14)                   | 98       |
|      | Floating point load/store instructions            | 98       |
|      | Floating point arithmetic instructions            | 98       |
|      | Floating point arithmetic instructions            | 99       |
|      | Floating point square root, min, max instructions | 99       |
| 1.30 | fclass results                                    | 99       |
| 1.30 | fclass results                                    | 100      |
| 1.31 | Floating point conversion instructions            | 100      |
| 1.32 | Move to/from integer registers                    | 100      |
| 1.33 | Floating point comparison instructions            | 101      |
| 1.34 | load reserved/store conditional instructions      | 101      |
| 1.35 | Atomic Memory Operation instructions              | 102      |
| 1.35 | Atomic Memory Operation instructions              | 103      |
| 1.36 | Atomic Memory Operation instructions              | 104      |
| 1.36 | Atomic Memory Operation instructions              | 105      |
| 1.36 | Atomic Memory Operation instructions              | 106      |
|      | Thead instructions                                | 106      |
|      | Thead instructions                                | 107      |
| 1.37 |                                                   | 108      |
| 1.37 | Thead instructions                                | 109      |
| 1.37 |                                                   | 110      |
|      | Pseudo instructions                               | 110      |
|      |                                                   | 111      |
| 2.50 |                                                   |          |

| List | f Figures | 7 |
|------|-----------|---|
|      | 9         |   |

| 1.38 Pseudo instructions                             | 112      |
|------------------------------------------------------|----------|
| List of Figures                                      |          |
| 1.1. Occasion of the consultant control flow         | 10       |
| 1.1 Overview of the assembler control flow           | 13<br>15 |
| 1.3 read_a_source_file function                      | 16       |
| 1.4 R Instruction layout                             | 30       |
| 1.5 I Instruction layout                             | 30       |
| 1.6 U Instruction layout                             | 32       |
| 1.7 S Instruction layout                             | 33       |
| 1.8 <b>B</b> Instruction layout                      | 33       |
| 1.9 J Instruction layout                             | 34       |
| 1.10 Compressed CR Instruction layout                | 36       |
| 1.11 Compressed immediate CI Instruction layout      | 37       |
| 1.12 Store to stack offset (CSS) instructions layout | 38       |

 1.17 Modified C910 R Instruction layout
 90

 1.18 Modified R Instruction layout
 97

 1.19 Atomic instructions layout
 102

40

 $\frac{45}{86}$ 

## 1 The RISC-V assembler

### 1.1 Introduction

The tiny assembler is a "digest" of the GNU gas assembler. I have extracted from the 1.3Gb of binutils-gdb source code<sup>1</sup> two files: asm.c and asm.h.

There are two goals here:

- 1. To produce a small and fast assembler to be used as a compiler back-end. The elimination of features proceeds according to this goal: assemble machine generated output, without consideration for any human user, since all input to the assembler is supposed to be machine generated.
- 2. To produce a minimal set of sources that is *easy to read and understand* so that people can hack away without a lengthy learning curve. This documentation also, contributes to this objective.

In this version of the tiny-assembler there isn't:

- No input pre-processing. No include files, nor any fancy macro processing.
- No fancy error messages, messages will be emitted only in english. If you want other language error output you are welcome to do it yourself. The rationale behind this is obviously that a high level language user, programming in C++ or C, will be completely unable to understand the assembler messages even if they are translated into his/her native language.
- This assembler is geared to the riscv CPU. All support for any other machine has been dropped, specially support for machines that have ceased to exist for more than 20 years: the Motorola 68000 family, the Sparc, the Z80, etc. I think that even gas could drop support for those machines also.
- The code has been cleaned up from all cruft like this:

```
/* The magic number BSD_FILL_SIZE_CROCK_4 is from BSD 4.2 VAX
* flavoured AS. The following bizarre behaviour is to be
* compatible with above. I guess they tried to take up to 8
* bytes from a 4-byte expression and they forgot to sign
* extend. */
#define BSD_FILL_SIZE_CROCK_4 (4)
```

So, we are still in 2023 keeping bug compatibility with an assembler for a machine that ceased production in 2000?

<sup>&</sup>lt;sup>1</sup>I have just done a du -b ./binutils-gdb Probably is a bit less since I didn't do an extensive search for only .c and .h files.

- All the indirection through macros that are expanded into members of function tables that makes the code impossible to follow are eliminated. Now, if you see code like foo(bar); it is highly likely that you are calling function foo with argument bar...
- All libraries are eliminated. Tiny-asm doesn't use BFD nor libiberty nor libopcodes. The only library used is zlib.
- There are only two files: asm.c and asm.h. No other include files are there, as far as I remember, excepting system includes like stdio.h of course.

I have avoided to put much code samples here. There only two source files, and if you want to see the exact code sequences you are free to look them up, it is not very difficult. I see no interest in filling pages with code.

## 111 Requirements

I have concentrated in explaining how things work, and that includes talking about specifications and the standards used. You should have:

1. Source code: If you want the official sources of the GNU assembler you should download the binutils-gdb package. It is available in many places, for instance in github binutils.

You can download the sources of the tiny-asm from: https://github.com/jacob-navia/tiny-asm.

- 2. Assembler user documentation in "Using as".

  https://sourceware.org/binutils/docs/as/ This is the official documentation for the Gnu Assembler. Tiny-asm has kept most of it, and the algorithms, names of functions and variables are almost always the same. Knowing what the user specifications are will help you understand what the different assembler directives are doing.
- 3. The RISC-V Instruction Set Manual Volume I: Unprivileged ISA. There are a lot of versions of this document in the internet. Please try the most recent that you can find, of course. The official sources of the documentation are in riscv-specs, but there are apparently more recent ones. There is a depository in github at github, but they are in a strange format called "adoc" that is difficult to find a translator for, in non-windows systems.
- 4. DWARF debug information standard, the most recent being DWARF5 (2017) at the DWARF standard. This will enable you to better understand the debug information (cfi) directives of the assembler.
- 5. The ELF (Executable and Link Format) standard has an official page in the linux foundation at

https://refspecs.linuxfoundation.org/elf/elf.pdf.

ELF is the object format standard followed by the assembler. This will help you understand the write\_object\_file better.

6. You should obviously have a riscv machine. If you don't use a simulator (slow) buy a cheap board that can run linux. The chinese propose several machines, like pine64. This is the machine I am using, for around 110 US\$. You can buy similar ones directly from the chinese, for instance

waveshare, or buy it from amazon.com, there are several boards available there. The Sifive company sells riscv boards also, but they are not interested in retail sales. Demands for price and availability go into the bit bucket unless you are a huge company with orders of several hundred boards probably. But you can always try at

#### https://www.sifive.com.

Recently, the Chinese company sipeed has released a clone of the raspberry pi using riscy. It is called Lichee pi 4A and comes in two versions: 8GB and 16GB RAM. It sells for around 150 euros at the "Ali-baba" Chinese retailer: ali-express. This machine is much faster than many other boards and features a processor with the vector instructions extension.

## 1.2 Building tiny-asm

The build process runs as follows:

- 1. Download the software from github
- 2. Build it:

```
$ gcc -o asm -g asm.c -lz
```

That is it. There is no Makefile but you can write one. I wrote this one:

```
star64:~/tiny-asm$ cat Makefile
asm: asm.o
gcc -o asm asm.o -g -lz
asm.o: asm.c asm.h
gcc -W -Wall -Wstrict-prototypes -Wmissing-prototypes\
-Wshadow -Wwrite-strings -g -c asm.c
clean:
rm -f asm.o asm
```

The Makefile for gas is 2268 lines... an impressing piece of software. However I think that 9 lines is much easier to understand. The user wants to use an assembler, maybe modify it, so there is no point in making him/her try to modify a 2 thousand line Makefile.

## 1.3 What is an assembler?

In these times of hype and exaggeration we speak about "artificial intelligence", of "programming without coding", and many other things that are designed to make us believe that we must buy the next gizmo.

Let me tell you one thing: All that a machine can do will be described here, in the assembler. And you will see no intelligence, no high level concepts, just extremely simple operations like adding a number to another, or reading and storing data in memory in small sized chunks.

Machines can't do anything else, and the smallest living being is many orders of magnitude more complex than these machines.

Why assembly language?

Because it is the only way to understand what is *really* happening inside a computer, the only way to pierce the hype from the abstractions of software and get into the last interface: the interface between the hardware interpreter and the software.

All computers are that: an interpreter. They receive encoded instructions and executes those, producing results that are written to memory.

This interface is called the "ISA" or "Instruction Set Architecture". It is the whole set of *instructions* i.e. small pieces of operations that are encoded in the wires of the machine and give software the set of operations it can perform.

The machine that tiny-asm encodes instructions for is called a RISC-V (risc five) ISA. It is a Reduced Instruction Set Computer, i.e it is a descendant of an idea of an IBM engineer. The first prototype computer to use reduced instruction set computer (RISC) architecture was designed by IBM researcher John Cocke and his team in the late 1970s. This idea is not new, but it has come to fruition lately. Almost all the smart phones in the world use an "Acorn Risc Machine" (ARM) architecture.

Assembly is not really a complicated language. Your learn it in a few words. It consists of instructions for the machine, and instructions for the assembler, also called directives. They are written one per line, in a simple format:

<instruction name> <arguments...>

where <arguments> is just a comma separated list.

All instructions in the riscv ISA have the first argument as the *destination* where the result of the operation is written to. The rest of the arguments are the inputs that the machine uses to perform the operation. For instance an addition needs two numbers as input and is written like this:

add t1,t2,t3

where those  $t_n$  are special memory locations called  $registers^2$ . The machine will take t2 and add to it t3 and put the result into the t1 register.

Now, a machine can't read. It can't read the "add" above. So, here comes "tiny-asm", the super-hero of our story. It will take the text above "add t1,t2,t3" and figure out that "add" is a legal operation that is present in its opcode table. It will take the *code* of the operation (a series of small numbers), it will put the codes of the register t1, t2, and t3 in the places the instruction format has setup for them, and will ship that to the growing program.

The integrated circuit will decode the instruction encoded by the assembler, separate the different parts, the destination register, the data sources, etc, and dispatch the whole to the specialized part within the CPU that handles the instruction.

Low level programming is difficult and sometimes tedious. You are just telling the machine to move this piece of data here, do that micro-operation and put the result somewhere. The *reason* why you are doing these operations is written nowhere. There are no classes nor structures, just nothing but micro-instructions and data movements. You get numbed down by the sheer size of things you have to write just to do something that you write in a single instruction in a high level language.

To avoid this, it is better to let the machine work for you. Just use a compiler that can output assembly language, and work using the program that the compiler gives you.

Of course do not use a high level of optimizations when you do that, because the code that the compiler will hand you will be completely unreadable.

And remember: The compiler has to output a program that is correct for all types of programs. You, when programming using tiny-asm, you have just to better *this* program, so you can do things that the compiler will never do in the general case.

Since you have the source code of the assembler, after reading this documentation you will be able to understand how it works and modify it as you like.

Since the assembler is at the base of everything, many subjects are discussed here: cryptographic operations, bit operations, instruction formats, object file formats, debug information format, etc. Each of those subjects would need a book of its own. Do not expect an in-depth treatment of any of those subjects. They will be discussed with the point of view of the assembler, nothing more.

#### 14 Overview

Like all assemblers, this assembler has a **parser**, where the text of the input file is converted into logical units that represent either instructions for the machine, or for the assembler

<sup>&</sup>lt;sup>2</sup>See table §1.1 page 28

1.4. Overview

itself, called *pseudo instructions*, and an **encoder**, where the instruction and its arguments are encoded into a 32 or 16 bit instruction and added to the current fragment. And then there is the object file generation, where the instructions and associated information are packed into the ELF format.



Figure 1.1: Overview of the assembler control flow

In figure 1.1 (page 13) we have these three main parts. Please keep in mind that this is a high level abstraction of the control flow. Obviously, if we would put each statement in the diagram we would have to cram 40 000 lines into a diagram... too much.

We start with main that organizes all three parts <sup>3</sup>. It calls the initialization, gas\_init, that initializes the symbols (symbol\_begin,),the fragments initialization, the sub-segments, etc

"Fragments" are understood in the assembler as pieces of code already assembled but that can grow, getting new instructions or other data. They are of variable length, and they will be strung together in a process called "relaxation" at the end of the assembly.

The initialization of the "sub-segments" means the text, data, and bss sections are created. Are "sub-segments" just plain object file sections? Not quite. There are "sections" like the "ABS" (absolute) section or the "UND" (undefined) sections that will never be written out in the object file.

There are other initializations that give us the opportunity of explaining some concepts that will be important later on. The  $eh\_begin$  function, for instance, initializes the "exception handling" stuff. This is a complicated system that allows languages like C++ to walk the stack at run time, searching for a handler that will accept handling the exception that has just occurred.

This process involves an impressive machinery that contains a set of tables that associate addresses in the code to descriptions of the stack contents that allow a debugger or a runtime interpreter to see what functions have in terms of local variables and the space that each stack frame uses in the stack. And even if you are programming in C and you do not have any need for exceptions you will get them anyway since your C code could be called from a C++ program.

Other initializations concerns the start of the dwarf2 debug information generation. Yes, the assembler can emit debug information for the program it is assembling. This way, the assembly programmer can follow the program line by line. tiny-asm has kept this even if it is highly unlikely that the compiler, that emits its own and much richer debug information, will need this.

The initialization of the "dot symbol" needs also some explaining. The current location when assembling a program is called "dot", i.e. a point. This symbol is always associated with the current address following a long assembler tradition that goes back to the start of the micro-computer age.

Eventually we come to the perform\_an\_assembly\_pass function. This one continues the initialization process by creating the standard sections of the object file:

- The text section. This is a misnomer since there isn't anything textual inside. It contains the binary codes that will be interpreted by the integrated circuit. This is the most important output of the whole assembly process.
- The data section. This contains the tables, constants, structures and everything that the programmer has defined as static data that will be loaded at the start of the program by the program loader.
- The BSS section that contains nothing. It is just a reserved memory space that will be allocated by the program loader when it loads the program and contains always zeroes at the start.
- There are many other sections in an ELF format file. Let's stop here.

Then, we finish the setup process by calling md\_begin and elf\_begin functions.

<sup>&</sup>lt;sup>3</sup>Please be aware that in the diagram there is a direct link between, for instance, the function dot\_symbol\_init and perform\_an\_assembly\_pass. This does NOT mean that the first calls the second directly. It means that the flow of the program returns to gas\_init and then returns to the main function, and it is main function that calls perform\_an\_assembly\_pass.

That would be quite complicated to draw, however. So, the diagram simplifies this.

1.4. Overview 15

The md\_begin function reads all the static tables and builds hash tables from the for fast access. The opcodes are stored in hash tables, together with other data like the register names, the Control and Status Registers (CSRs) and what have you.

The elf\_begin function builds symbols for each section in the object file. This allows to emit relocations or symbol addresses as an offset from the start of the section.

The setup phase behind us, we start the real work of the assembler: the well named read\_a\_source\_file. This function does the parsing and the encoding of the instructions and directives.

In the diagram below, the functions aren't shown with their actual names but with their functional description. The GAS developers took (as you can see) a lot of effort to choose clear names that describe quite well what each function is doing. Still, I thought that here we will use functional boxes instead of function names, since some of the functions described here do not exist as a separated subroutine but they are just pieces of read\_a\_source\_file.



Figure 1.2: A more detailed view of the parser

We assume that the assembler input is a single file containing instructions, data, and



Figure 1.3: read\_a\_source\_file function

assembler directives. In this version of the assembler, parsing is reduced to a bare minimum since we assume that we are assembling compiler output, and all the sophistication that is needed for an assembler adapted to human use is not needed for an assembler that is used to parse machine output.

We start with the function read\_a\_source\_file that organizes the parsing and the instruction generation<sup>4</sup>.

- 1. Setup. Here, we setup the input file name, in variable physical\_input\_file and we care about writing a file name record if we are emitting debug information.
- 2. We read a chunk of the input file. Currently, BUFFER\_SIZE is set to 256 \* 1024, and can be changed just by editing the corresponding line in asm.h
- 3. We start parsing lines. The first thing we read should be a symbol. If it ends with a colon, it is a label definition. We call the corresponding function colon() and continue parsing. If it is not finished by a colon, we see if the first letter is a point. If it is, it is an assembler directive. We call the corresponding function stored in the pseudoops structure (called pseudo\_typeS) and we go fishing for the next line. If it is not a pseudo-operation, it must be a machine instruction. We call the md\_assemble function.

The md\_assemble function does basically following things:

1. Test if the instruction is valid using the current set of RISCV specifications. There are instructions that can be issued only with 64 or even 128 bits, or floating point instructions that depend on floating point being implemented in hardware, etc. RISCV

<sup>&</sup>lt;sup>4</sup>Actually, the initialization phase is executed before, but we will abstract that away for the time being

machines can have a number of extensions implemented, since the basic ISA (Instruction Set Architecture) doesn't even have multiplication or division!

Each "extension" has a letter that characterizes it. For instance, in the machine I am using we have in /proc/cpuinfo a line with:

#### isa : rv64imafdc

This means that the machine is a risc V 64 bits machine (rv64), with the integer (i), multiplication (m), a (Atomic), f (single precision floating point), d (double precision floating point) and c (Compressed instructions in 16 bits) extensions. The assembler should test if any instruction is legal in the current subset, and reject those that do not comply.

Since we are an assembler for reading compiler output, we just assume the compiler doesn't emit wrong instructions and skip this test.

- 2. We call the riscv\_ip function to encode the instruction. Basically it uses the args character string to know what arguments are expected. It verifies that those are correct, and inserts all necessary bits at the required positions. We will see later how these formats are defined.
- 3. If assembly succeeded the new instruction is added to the current fragment.

The riscv\_ip function is basically a huge switch statement. The function will go through each one of the characters present in the args string of the opcode and add the necessary bits to the instruction.

In the tables below you will find the description of the different formats defined for each of the instructions in a riscv machine. This tables will help you understand how riscv\_ip works.

## 1.5 General concepts and data structures

## 1.5.1 Binary File Descriptor (bfd)

This structure is at the heart of the BFD library. In the context of an assembler, there is only one of these beasts around, called stdoutput.

The main things stored here are:

- The file name.
- A table of function pointers that dispatches to the back end for doing most of the work.
- The input output stream
- A pointer to the back-end private data.

Of course there are a lot of other fields that you can study by reading the definition of this structure in asm.h.

It is important to underscore here that the table of function pointers has been completely eliminated in tiny-asm. There are no more indirection through the xvec field, since tiny-asm will only assemble riscv instructions. The front end and the back end have been merged into a monolithic whole. Still, this design is essential for understanding gas.

## 1.5.2 Symbols

There are several types of different structures that together represent a symbol. They will be described below, but in general they reflect the need by the bfd library to make a back-end independent structure that holds some general information, a high level abstraction of a symbol. Back-ends can differ in the object format, and in the cpu used, so the information that is common to all those very different contexts is rather minimal.



## asymbol

This is the bfd-internal format, holding the following things:

- the\_bfd. This points to the bfd that this symbol refers to. Since under tiny-asm there is only one bfd, called stdoutput, this is redundant. In other contexts, for instance in the linker where there are a lot of binary files opened for reading and one for writing, this makes much more sense.
- Name The name of the symbol.
- Value. Here there is either a pointer to some other symbol, or a numeric value.
- Flags. A long list of different flags. Some of them aren't used in tiny-asm, but their definition is still there since they are used in the linker.
- A pointer to the section.
- A pointer to special data used by the back end. It is a union of a generic pointer and an address.

To access the fields of an <code>asymbol</code> inline functions with rather lengthy names are provided. These functions look like this:

```
1 static inline asection * bfd_asymbol_section (const asymbol *sy)
2 {
3     return sy \rightarrow section;
4 }
```

This function accesses the  $section field^5$ 

<sup>&</sup>lt;sup>5</sup>Is this really necessary? What are the real advantages of using bfd\_asymbol\_section(foo) instead of foo section?

Yes, I know. It is called "information hiding". But the problem is that information hiding **hides** information, precisely, and if you are trying to understand what the code is doing, you do not know beforehand if that is a call to a lengthy function or just a field access.

asymbols are global, and used for data that concerns a whole section. The constructor for these objects is elf\_make\_empty\_symbol

## symbolS

This structure is used for all kinds of symbols (labels, functions) that the assembler extracts from the source code. In the symbol table, we find pointers to this one and to the local\_symbol structures. The size of this should be the same or less than struct local\_symbol, and fields that do not fit in that size go into an overflow structure called xsymbol for exension of symbol.

```
Listing 1.1: symbolS, xsymbol, elf obj sy
1 typedef struct symbol {
      struct symbol_flags flags; /* Symbol flags. */
2
                                 /* Hash value calculated from name. */
      hashval_t hash;
3
                                 /* The symbol name. */
      const char *name:
4
      fragS *frag;
                    /* Pointer to the frag of this symbol, if any. Otherwise NULL.*/
      asymbol
                    *bsym;
                                 /* BFD symbol */
      struct xsymbol *x;
                                 /* Extra symbol fields that won't fit. */
8 } symbolS;
10 /* Extra fields to make up a full symbol. */
11 struct xsymbol {
      expressionS value; /* Symbol value. Note that this is NOT a pointer */
12
      /* Forwards and backwards chain pointers. */
13
      struct symbol *next;
14
      struct symbol *previous;
15
      struct elf_obj_sy obj; /* Yet another symbol structure (YASS!) */
16
17 };
19 /* Additional information we keep for each symbol. */
20 struct elf_obj_sy {
      unsigned int local: 1; /* Whether the symbol has been marked as local. */
^{21}
      unsigned int rename : 1; /* Whether the symbol has been marked for rename with
22
          000. */
      unsigned int bad_version:1; /* Whether the symbol has a bad version name. */
23
      /* Whether visibility of the symbol should be changed. */
24
      ENUM_BITFIELD (elf_visibility) visibility : 2;
25
      /* Keep track of .size expressions that involve yet unresolved differences */
26
27
      expressionS *size;
      /* The list of names specified by the .symver directive. */
      struct elf_versioned_name_list *versioned_name;
29
30 };
      The constructors for symbols are:
      • symbol_make. This constructor is simple, code below:
            static symbolS *symbol_make(const char *name)
               /* Let the machine description default it, e.g. for register names. */
               symbolS *symbolP = md_undefined_symbol((char *)name);
      5
               if (!symbolP) symbolP = symbol_new(name,undefined_section,&
                   zero_address_frag,0);
               return (symbolP);
      6
            }
```

So, if you read this you would think that first, as the commentary says, is calling to some function... Actually, for the riscv backend, we have a #define in asm.h:

```
#define_md_undefined_symbol(name)_(0) symbol_make is just an alias for symbol_new.
```

- symbol\_create. This function allocates space for the new symbol, sets some default fields, and then calls symbol\\_init that will finish the construction of the new symbol.
- symbol\_new. This is a small function that calls symbol\_create and then links the new symbol into the global list of symbols using the function symbol\_append.
- symbol\_find\_or\_make(const char \*name). This function searches for a symbol and if not found creates an undefined symbol, returning a pointer to it. When creating a symbol, it checks if it is a local symbol. Then either calls the constructor for a local or a true symbol.

In the symbol table, full fledged symbols or local symbols appear. The distinction between them is that for many symbols like labels, or similar, all the huge amount of information described above make no sense. A shorter and smaller structure is used, what makes considerable gains in memory space.

#### The symbol\_flags structure

Flags that annotate an object are normally defined as #defines for instance

```
1 #define FLAG_F00 1
2 #define FLAGG _BAR 2
```

But there are better ways, for instance by defining a structure that can be accessed in a much clearer way.

## Listing 1.2: The symbol flags structure

```
struct symbol_flags {
   unsigned int local_symbol:1; /* Whether the symbol is a local_symbol or not. */
   unsigned int written:1;
                             /* Whether symbol has been written. */
   /* Whether symbol value has been completely resolved (used during final pass
    * over symbol table). */
   unsigned int resolved:1;
   /* Whether the symbol value is currently being resolved (used to detect loops
    * in symbol dependencies). */
   unsigned int resolving:1;
   /* Whether the symbol value is used in a reloc. This is used to ensure that
    * symbols used in relocs are written out, even if they are local and would
    * otherwise not be. */
   unsigned int used_in_reloc:1;
   /* Whether the symbol is used as an operand or in an expression. */
   unsigned int used:1;
   unsigned int volatil:1; /* Whether the symbol can be re-defined. */
   /* * Whether the symbol is a forward reference, and whether such has been
    * determined. */
   unsigned int forward_ref:1;
   unsigned int forward_resolved:1;
   /* This is set if the symbol is set with a .weakref directive. */
   unsigned int weakrefr:1;
   /* This is set when the symbol is referenced as part of a .weakref
    * directive, but only if the symbol was not in the symbol table
```

```
* before. It is cleared as soon as any direct reference to the symbol
       * is present. */
      unsigned int weakrefd:1;
      /* Whether the symbol has been marked to be removed by a .symver
       * directive. */
      unsigned int removed:1;
      /* Set when a warning about the symbol containing multibyte characters
       * is generated. */
      unsigned int multibyte_warned:1;
  };
  Many of these fields are never used directly. The field used_in_reloc, for instance, is never
  accessed directly. The reason is clear when you look at the code for its access functions:
1 /* Mark a symbol as having been used in a reloc. */
2 static void symbol_mark_used_in_reloc(symbolS * s)
3 {
      if (s \rightarrow flags.local\_symbol)
          s = local_symbol_convert(s);
6
      s \rightarrow flags.used_in_reloc = 1;
7 }
8 /* Return if a symbol has been used in a relocation */
9 static int symbol_used_in_reloc_p(symbolS * s)/* Return whether a symbol has been
       used in a reloc. */
10 €
      if (s \rightarrow flags.local\_symbol)
          return 0; /* Local symbols can't be used in relocations */
      return s→flags.used_in_reloc;
14 }
```

#### local symbol

11

12

13

This structure can represent only symbols with offsets within the same frag.

## Listing 1.3: local symbol

```
1 struct local_symbol {
     struct symbol_flags flags; /* Flags: Only local_symbol and resolved relevant.*/
     hashval_t hash;
                            /* Hash value calculated from name. */
3
                             /* The symbol name. */
     const char *name;
                 *frag;
                            /* The symbol frag. */
     fragS
    asection *section; /* The symbol section. */
                            /* The value of the symbol. */
     valueT
              value;
8 };
```

Constructor for the local\_symbol structure is the function local\_symbol\_make.

## Symbol table

```
Listing 1.4: union symbol entry t
1 /* This structure makes up the entries of the symbol table */
2 typedef union symbol_entry {
     struct local_symbol lsy;
     struct symbol sy;
5 } symbol_entry_t;
```

The symbol table is a hash table called sy\_hash, created at initialization in the function symbol\_begin called from gas\_init.

Adding symbols into the symbol table is done with symbol\_table\_insert, the function symbol\_find searches for a given symbol.

## 1.5.3 Fragments

As the assembler reads the source file, it stores the information it parses into frags. These structures contain two parts: a fixed part, i.e. the bytes that are known to be emitted, and a variable part, whose length can change. Memory is allocated with a worst case philosophy: the maximum size of the variable part is known, and will be used when reserving memory for each fragment.

When the source file has been fully parsed the relaxation process will go through all these fragments and build the linear structures needed in the object file.

Listing 1.5: struct frag

```
1 struct frag {
      /* Object file address. This is zero until relaxation writes it. */
2
      addressT fr_address;
3
      /* When relaxing multiple times, remember the address the frag had in the last
         relax pass. */
      addressT last_fr_address;
      /* (Fixed) number of octets we know we have. May be 0. */
      valueT fr_fix;
      /* May be used for (Variable) number of octets after above.
10
      The generic frag handling code no longer makes any use of fr_var. */
1.1
      offsetT fr_var;
12
      /* For variable-length tail. */
13
      offsetT fr_offset;
14
      /* For variable-length tail. Points to the symbol to use according to fr_type */
15
      symbolS *fr_symbol;
      /* Points to opcode low addr byte, for relaxation. */
17
18
      char *fr_opcode;
19
      /* Chain forward; ascending address order. Rooted in frch_root. */
20
      struct frag *fr_next;
21
22
      /* Where the frag was created, or where it became a variant frag. */
23
24
      const char *fr_file;
      unsigned int fr_line;
25
26
      /* A serial number for a sequence of frags having at most one alignment
27
         or org frag, and that at the tail of the sequence. */
28
      unsigned int region:16;
29
30
      /* Flipped each relax pass so we can easily determine whether fr_address
31
         has been adjusted. */
32
      unsigned int relax_marker:1;
33
34
      /* Used to ensure that all insns are emitted on proper address boundaries. */
35
      unsigned int has_code:1;
36
      unsigned int insn_addr:6;
37
38
      /* This field indicates the interpretation of fr_offset, fr_symbol and the
39
       * variable-length tail of the frag, as well as the treatment it gets in various
40
```

```
* phases of processing. It does not affect the initial fr_fix characters;
41
       * they are always supposed to be output verbatim (fixups aside). */
42
      relax_stateT fr_type;
43
      /* This is used as a size for this frag and is updated during relaxation */
44
45
      relax_substateT fr_subtype;
46
      /* A simple structure (see below) that is used when creating a mapping symbol and
47
            when.
         checking if the mapping symbol is still useful. Mapiing symbols are create
48
             when
         transitioning from a given mapping state to another */
49
      struct riscv_frag_type tc_frag_data;
50
      /* Data begins here. */
52
53
      char fr_literal[1];
54 };
55
56 struct riscv_frag_type { symbolS *first_map_symbol, *last_map_symbol; };
```

## 1.5.4 Fixups

In many situations, the assembler can't finish a calculation because all data needed for it isn't available. For instance a symbol is yet unresolved, or the exact location for some instruction component is absent.

In those situations the assembler emits a fixup. This is nothing else than an instruction on how to patch the output later, when all the data is known.

Fixups are described in a structure called fixS that holds mainly following kinds of information:

- next The fixS structures are linked in a list.
- fx\_frag The fragment where the fix should be applied.
- fx\_where The position within that fragment where the fix should be applied.
- The quantity to be added or subtracted. If it is a symbol, a pointer to that symbol will be stored in the fields fx\_addsy or fx\_subsy. Otherwise, if it is just a number it will stored in the field fx\_offset.
- fx\_size. The size (in bytes) of the fixup, i.e. how many bytes should be written at the given location.
- There are many other fields that you can look up in the definition in asm.h. They are described in the comments surrounding their definition.

#### Constructors

Two functions build a fixup: fix\_new and fix\_new\_exp. The second one is for a fixup referring to an expression, the first is for a symbol with an optional offset. They differ only in that fix\_new\_exp determines the symbol to add or subtract from the given expression. Both call fix\_new\_internal to do the actual fix.

## Applying a fixup

A fixup is resolved by the function md\_apply\_fix. It uses the type of fixup to determine the sequence of actions to be performed: to fix the high 20 bits of a 32 bit address, or the lower 12, or add to a 64 bit address an addend, etc. The code consists (yes, you guessed it!) of a big switch statement with all the handled types of relocation existing for riscv machines, and it is not very difficult to follow.

#### 1.5.5 Relocations

Sometimes a fixup can't be resolved. For instance this C code:

```
1 #include <stdio.h>
2 int main(void) {
     printf("hello\n");
4 }
     gcc translates this to:
      .section
                .rodata
1
2 .LCO:
     .string "hello"
3
4
     .text
5 main:
      /* irrelevant stuff ellided */
     lla a0,.LCO
     call
            puts@plt
      /* further stuff ellided */
```

The address of the puts procedure can't be established by the assembler, nor the linker, only by the program loader that will know at load time the address of the shared library libc6.so. The assembler makes the same thing as when establishing a fixup. It makes a new fixup, this time for the linker, that will tell it where the address of the puts function needs to be stored.

This kind of fixup is called a relocation.

The linker can't resolve the address either, so it will make a relocation for the program loader, that will patch the code accordingly when the program starts<sup>6</sup>.

Relocations, contrary to simple fixups have a standard format prescribed in the object file format, in our case ELF.

Listing 1.6: Elf relocation structure

```
1 typedef struct {
2    unsigned char r_offset[8]; /* Location at which to apply the action */
3    unsigned char r_info[8]; /* index and type of relocation */
4    unsigned char r_addend[8]; /* Constant addend used to compute value */
5 } Elf64_External_Rela;
```

This format doesn't exactly correspond to the internal one used by the assembler. The function bfd\_elf64\_swap\_reloca\_out converts from the bfd format to the ELF one.

#### Data structures for relocations

The central structure for relocations is the howto\_table, that describes in detail the types of relocation and how they are handled. It is a table of structures defined by the following type:

```
1 struct reloc_howto_struct {
2     /* Contains the relocation type according to the riscv standard */
3     unsigned int type;
4
5     /* The size of the item to be relocated in bytes. */
6     unsigned int size:4;
7
8     /* The number of bits in the field to be relocated. This is used
```

 $<sup>^6</sup>$ The process is obviously much more complicated. Here we leave all the details out, to take a high level view.

```
when doing overflow checking. */
      unsigned int bitsize:7;
10
11
      /* The value the final relocation is shifted right by. This drops
12
      unwanted data from the relocation. */
13
      unsigned int rightshift:6;
14
1.5
      /* The bit position of the reloc value in the destination.
16
      The relocated value is left shifted by this amount. */
17
      unsigned int bitpos:6;
18
19
      /* What type of overflow error should be checked for when
20
21
      ENUM_BITFIELD (complain_overflow) complain_on_overflow:2;
22
23
      /* The relocation value should be negated before applying. */
24
      unsigned int negate:1;
25
26
      /* The relocation is relative to the item being relocated. */
27
      unsigned int pc_relative:1;
28
29
      /* This field is true only in 3 relocation types that refer to thread storage.
30
         They are: R_RISCV_TLS_DTPREL32, R_RISCV_TLS_DTPREL64, and
         R_RISCV_TPREL_HI20 */
      unsigned int partial_inplace:1;
```

The original comment above this field read like this

Some formats record a relocation addend in the section contents rather than with the relocation.

This means that the place for the relocation can be filled either with zero bits or with a number to which the relocation is added.

For ELF formats this is the distinction between USE\_REL and USE\_RELA (though the code checks for USE\_REL == 1/0). The value of this field is TRUE if the addend is recorded with the section contents; when performing a partial link (ld -r) the section contents (the data) will be modified. The value of this field is FALSE if addends are recorded with the relocation (in arelent addend); when performing a partial link the relocation will be modified. All relocations for all ELF USE\_RELA targets should set this field to FALSE (values of TRUE should be looked on with suspicion). However, the converse is not true: not all relocations of all ELF USE\_REL targets set this field to TRUE. Why this is so is peculiar to each particular target. For relocs that aren't used in partial links (e.g. GOT stuff) it doesn't matter what this is set to.

I have read many times that, trying to make sense of it. In tiny-asm, all relocations are reloca. Still, in the code this field is used, and some relocations, specifically the thread local storage relocations, do use this field.

```
/* When some formats create PC relative instructions, they leave
the value of the pc of the place being relocated in the offset
slot of the instruction, so that a PC relative relocation can
be made just by adding in an ordinary offset (e.g., sun3 a.out).
Some formats leave the displacement part of an instruction
empty (e.g., ELF); this flag signals the fact. */
unsigned int pcrel_offset:1;
```

```
42
      /* This field is not used in tiny-asm */
      unsigned int install_addend:1;
44
45
      /* src_mask selects the part of the instruction (or data) to be used
46
      in the relocation sum. If the target relocations don't have an
47
      addend in the reloc, eg. ELF \it USE\_REL, \it src\_mask will normally equal
48
      dst_mask to extract the addend from the section contents. If
49
      relocations do have an addend in the reloc, eg. ELF USE_RELA, this
50
      field should normally be zero. Non-zero values for ELF USE_RELA
51
      targets should be viewed with suspicion as normally the value in
52
      the dst_mask part of the section contents should be ignored. */
53
      bfd_vma src_mask;
      /* dst_mask selects which parts of the instruction (or data) are
56
      replaced with a relocated value. */
57
      bfd_vma dst_mask;
58
59
      /* If this is non null, then, the supplied function is called rather than the
60
      normal one. Under tiny-asm, this field is never NULL. Three functions are used.
61
      See below for further expplanations. */
62
      bfd_reloc_status_type (*special_function)
63
      (bfd *, arelent *, struct bfd_symbol *, void *, asection *,
      bfd *, char **);
      /* The textual name of the relocation type. */
67
      const char *name;
68
69 };
```

The special\_function field has 3 possible values:

1. bfd\_elf\_generic\_reloc. This function adds the input section position if the symbol meets certain conditions:

The output\_bfd variable is never NULL since it is always stdoutput.

- 2. riscv\_elf\_add\_sub\_reloc. This function does exactly the same tests that the preceding one, and returns exactly the same values. They differ in that this one has a lot of code for the case when the output\_bfd is NULL, what could be the case in the linker, but not in the assembler.
- 3. riscv\_elf\_ignore\_reloc. Beware of these explicit names! This function doesn't ignore the relocation at all, but does a

reloc\_entry->address += input\_section->output\_offset; without any tests at all.  $^7$ 

<sup>&</sup>lt;sup>7</sup> After an enquiry in the group binutils, Alain Modra answered:

#### 1.5.6 Sections and subsections

Assembled data falls into four sections: opcodes, initialized data, uninitialized data and debug information. You may have separate groups of data in those sections that you want to end up near to each other in the object file, even though they are not contiguous in the assembler source.

Tiny-asm allows you to use subsections for this purpose. Within each section, there can be numbered subsections with values from 0 to 8191  $^8$ .

Objects assembled into the same subsection go into the object file together with other objects in the same subsection. For example, a compiler might want to store constants in the text section, but might not want to have them interspersed with the program being assembled. In this case, the compiler could issue a .text 0 before each section of code being output, and a .text 1 before each group of constants being output.

Subsections are optional. If you do not use subsections, everything goes in subsection number zero.

Each subsection is zero-padded up to a multiple of four bytes.

Subsections appear in your object file in numeric order, lowest numbered to highest. The object file contains no representation of subsections; ld, objdump and other programs that manipulate object files see no trace of them. They just see all your text subsections as a single text section, and all your data subsections as a data section.

To specify which subsection you want subsequent statements assembled into, use a numeric argument to specify it, in a .text <number> or a .data <number> statement. If you just say .text then an implicit zero is assumed. Likewise .data means .data 0.

In the source code, sometimes subsections are called "subsegments".

## 1.6 Instruction formats and encoding

Yes, there are several parts in an assembler, but there is a fundamental part that makes the purpose of the whole program: **encoding instructions**. The essential part is here: transforming ASCII text representing instructions into a series of 16 or 32 bit sequences that encode each operation that the machine can do, including operations that are seldom, if ever, used.

To understand how the assembler works, it is important to keep in mind how the machine works, the names of its parts, and the intricacies of instruction encoding. Yes, yes, that looks awfully dry and uninteresting. But (to me) it is interesting, and if you do not like to understand how things work, please go to tik-tok and play some games...

There are several types of instruction encoding, named R, I, S, B, U, J.

- All are 32 bits, like the ARM.
- The first 7 bits are reserved for the opcode (bits 0 to 6).
- The same operand, for instance the source register 1 (sr1) is at the same position, bits 15 to 19.
- All instructions have at least one register operand.
- Since we have 32 registers, all register encoding take 5 bits.



There is nothing wrong with the name. No relocation of section contents is done. The only change made here is for ld -r to keep the relocation associated with the same location in the output as it was in the input.

That said, I think it would be better for the R\_RISCV\_SET\_ULEB128, and R\_RISCV\_SUB\_ULEB128, howto to use a special function something like ppc64\_elf\_unhandled\_reloc.

<sup>&</sup>lt;sup>8</sup>This limit is mentioned in the GAS documentation. In the software, actually, there isn't a single test to enforce this limit, so you can write any number between 1 and MAX INT.

The risc v introduces a more functional naming schema, where registers are assigned usage names, instead of the register numbers. Here is a correspondence table between them:

Table 1.1: RISCV symbolic register names

|         |       | integer .            | Cerptors |       |                     |
|---------|-------|----------------------|----------|-------|---------------------|
| x0      | zero  | Hard-wired zero      | x16      | a6    | Seventh argument    |
| x1      | ra    | Return Address       | x17      | a7    | Eighth argument     |
| x2      | sp    | Stack pointer        | x18      | s2    | Saved 2             |
| x3      | gp    | Global pointer       | x19      | s3    | Saved 3             |
| x4      | tp    | Thread Pointer       | x20      | s4    | Saved 4             |
| x5      | t0    | Temporary/Alternate  | x21      | s5    | Saved 5             |
|         |       | link register        |          |       |                     |
| x6      | t1    | Temporary            | x22      | s6    | Saved 6             |
| x7      | t2    | Temporary            | x23      | s7    | Saved 7             |
| x8      | fp/s0 | Frame pointer        | x24      | s8    | Saved 8             |
| x9      | s1    | Saved 1              | x25      | s9    | Saved 9             |
| x10     | a0    | First argument / Re- | x26      | s10   | Saved 10            |
|         |       | turn value           |          |       |                     |
| x11     | a1    | Second Argument /    | x27      | s11   | Saved 11            |
|         |       | Return value         |          |       |                     |
| x12-x15 | a2-a5 | Argument 3-5         | x28-x31  | t3-t6 | Temporary registers |

Floating point registers

| f0 | 0-f7   | ft0-ft7 | Fp temps                  | f2-f7   | fa2-fa7  | function arguments  |
|----|--------|---------|---------------------------|---------|----------|---------------------|
| f8 | 8-f9   | fs0-fs1 | Fp saved registers        | f18-f27 | fs2-fs11 | saved registers     |
| f  | 10-f11 | fa0-fa1 | Fp arguments/return value | f28-f31 | ft8-ft11 | Temporary registers |

The difference between the ABI names and the actual register numbers is due to the fact that the ranges of registers are not contiguous. For instance the range of saved registers has two of them as x8 and x9, then the rest is x18 to x27.

## 1.7 The instruction formats

Each format is designed to be used by similar type of instructions.

- R Register to register ALU instructions.
- I Immediate and load.
- **S** Store and comparisons.
- B Branch.
- U J Jump and jump with link (call) instructions.

The RISC-V manual comments these formats like this

The RISC-V ISA keeps the source (rs1 and rs2) and destination (rd) registers at the same position in all formats to simplify decoding. Except for the 5-bit immediates used in CSR instructions, immediates are always sign-extended, and are generally packed towards the leftmost available bits in the instruction and have been allocated to reduce hardware complexity. In particular, the sign bit

for all immediates is always in bit 31 of the instruction to speed sign-extension circuitry.  $^9$ 

All instructions are 32 bits. This requirement, that riscv shares with ARM and other machines, is necessary to make possible parallel decoding of instructions. In the x86, for instance, each instruction is of variable length, what makes parallel decoding of instructions an incredibly difficult undertaking.

The C extension compresses 32 bit instructions into 16 bits, what makes for more compact code. They expand into 32 bits instructions.

Table 1.2: The different instruction formats

```
"R" format:
                           "I" format
                                                          "U" format
                                                          struct uFormat {
struct rFormat {
                           struct iFormat {
                               unsigned imm12:12;
                                                              unsigned imm20:20;
   unsigned extra7:7;
                               unsigned src1:5;
   unsigned src2:5;
                                                              unsigned dst:5;
   unsigned src1:5;
                               unsigned extra3:3;
                                                              unsigned opcode:7;
   unsigned extra3:3;
                               unsigned dst:3;
                                                          };
   unsigned dst:5;
                               unsigned opcode:7;
   unsigned opcode:7;
};
"S" format
                                                          "J" format
                           "B" format
                                                          struct jFormat {
struct sFormat {
                           struct bFormat {
   unsigned imm12_2:7;
                               unsigned imm12_sign:1;
                                                              unsigned imm12_sign:1;
   unsigned src2:5;
                               unsigned imm12_10_5:6;
                                                              unsigned imm12_1_10:10;
   unsigned src1:5,
                               unsigned src2:5;
                                                              unsigned imm12_11:1;
   unsigned extra3:3;
                               unsigned src1:5;
                                                              unsigned imm12_12_19:7;
   unsigned imm12_1:5;
                               unsigned extra3:3;
                                                              unsigned dst:5;
   unsigned opcode:7;
                               unsigned imm12_1_4:4;
                                                              unsigned opcode:7;
};
                               unsigned imm12_11:1;
                                                          };
                               unsigned opcode:7;
                           };
```

## 1.7.1 The "R" format

This format features 3 registers (destination, source 1 and source 2) and has two fields of 3 and seven bits available for use to customize the opcodes. We use a 32 bit addition as an example of this format: addw a0,a0,a1. The addition using ABI names is addw a0,a0,a1 but using actual register numbers we have addw x10,x10,x11. For this instruction the 10 bits of extra-3 and extra-7 are empty.

We have then:

- Opcode:  $0\ 1\ 1\ 1\ 0\ 1\ 1 \to 0x3b$  (59 decimal).
- Destination register:  $0\ 1\ 0\ 1\ 0 \to 0xA$  (10 decimal). Register 10 is a0, that contains the first argument and is loaded with the result.

<sup>&</sup>lt;sup>9</sup>RISC-V User level ISA V 2.2 §2.2. They add further down: Decoding register specifiers is usually on the critical paths in implementations, and so the instruction format was chosen to keep all register specifiers at the same position in all formats at the expense of having to move immediate bits across formats



Figure 1.4: **R** Instruction layout

- Source 1: 0 1 0 1 0  $\rightarrow$  0xA (10 decimal). Register 10 (a0) is the first source.
- Source 2: 0 1 0 1 1  $\rightarrow$  0xB (11 decimal). Register 11 (a1) is the second source.

#### Software handling

We have an instruction with the args format of "Cs,Cw,Ct" that expects source and destination to be identical (s and w) followed by a target register in the expected range for compressed registers. All of that is true, and we succeed with a compressed 16 bit instruction.

Obviously this is not what we wanted. We wanted a 32 bit 'R' instruction. To be able to do that, we add the following instruction at the top of our assembler file

.option arch -c

I.e. we disable all compressed instructions.

We see here that the *order* in the layout of the opcode table is very important. The instructions that are **more** constrained should come first, and the general formats should come last. For instance the compressed instruction should come first, and non-compressed last, since the software stops at the first match.

## 1.7.2 The "I" format

This format changes the "R" format by merging src2 with extra-7 to give a 12 bit field where an immediate integer value caan be stored (up to  $2^{12}-1 \rightarrow 4095$  values can be stored).



Figure 1.5: I Instruction layout

#### Software handling

The first instruction that the software tries has its args string: "Cs,Cw,Ct", we expect a source register in compressed format, i.e. register 8-15, followed by the *same* register. The second condition succeeds, and the software passes to the third argument: we expect a register, and we find the constant 1025. Nope, this instruction is not the one.

The next addw instruction to be tested has the string "Cs,Ct,Cw", a permutation of the above that fails also, for the same reasons.

More instructions are tried, with strings d,Cu,Co that fails, "d,s,t" that fails also since we have an immediate constant and not a register in the third position ('t' field). At last we arrive at an instruction with args field of d,s,j", i.e. a sign extended immediate ('j') in the third position. This time the software succeeds and we are done. Accessing the different fields is done with macros. Here is one example of a series of macros that extracts the immediate field of the immediate value in the instruction above

```
#define RV_X(x, s, n) (((x) >> (s)) & ((1 << (n)) - 1))
```

This macro extracts <n> bits from <x>, beginning in bit position <s>. It has two parts:

- 1. The left side of the "and" operation that shifts the given number <s> bits to the right to bring it to position zero, and
- 2. An expression that builds a mask of  $\langle n \rangle$  1 bits by shifting a 1  $\langle n \rangle$  positions to the right and subtracting one, what gives a power of two minus 1. A power of two minus 1 is a field full of ON bits in two's complement notation. For instance 1  $\langle \langle 3 \rangle \rangle$  8 (1000). You subtract 1 from that and you obtain 0111 (7), i.e. 3 bits "on", a mask to extract the lower 3 bits from a number.

```
#define RV_IMM_SIGN(x) (-(((x) >> 31) & 1))
```

This macro returns either -1 or 0, depending if the sign of the 32 bit number is negative or positive. Since -1 is 32 bits of "1" bits, it can be used to sign extend a number.

The two macros above are used in these new ones:

```
#define EXTRACT_ITYPE_IMM(x) (RV_X(x,20,12)|(RV_IMM_SIGN(x) << 12))
#define ENCODE_ITYPE_IMM(x) (RV_X(x, 0, 12) << 20)</pre>
```

The first macro extracts 12 bits from the given number ( $\langle x \rangle$ ) and sign-extends its sign. The second extracts the lower 12 bits of the value, and puts them at position 20-31  $^{10}$ 

## 1.7.3 The "U" format

A variant of the I format featuring more space for immediate constants is the U format, that can hold immediate constants with 20 bits.

The lui<sup>11</sup>. instruction loads an unsigned 20 bits immediate stored in the bits 12 to 31 of the instruction into the upper 20 bits of the destination and sets the lower 12 bits to zero. In C language notation we have: dst = (imm20 << 12); The authors justify these choices with:

<sup>&</sup>lt;sup>10</sup>It is a pity that machines implementing the boolean extension aren't widely available yet. I miss the ARM boolean instructions that will reduce many of those macros to a couple of instructions.

 $<sup>^{11}</sup>$ lui stands for load upper immediate

Figure 1.6: U Instruction layout

In practice, most immediates are either small or require all XLEN bits. We chose an asymmetric immediate split (12 bits in regular instructions plus a special load upper immediate instruction with 20 bits) to increase the opcode space available for regular instructions. Immediates are sign-extended because we did not observe a benefit to using zero-extension for some immediates as in the MIPS ISA and wanted to keep the ISA as simple as possible. 12

#### Software handling

Looking up the args description for this instruction, we find the character string "d,Cu". This means we should expect a register name, followed by a comma, and an immediate value to be able to use a C (compressed) instruction. But that doesn't work, our constant is beyond bounds of the compressed immediate.

The software continues its search for the correct instruction and we come to the next instruction in the list that has the args string "d,u", without any compression requirements. This time a match is found, and necessary bits are inserted as shown in figure 1.6 page 32.

Obviously, loading an immediate constant that will be shifted by 12 bits is seldom used. This is thought for loading the upper 20 bits of an *address*, then adding the lower 12 bits with another instruction. This constant was choosen in this example so that it has a 1 bit at the end of 10 bits, and 1 at the start to be visible in the drawing.

To extract the J type immediate we use the following macro:

```
#define EXTRACT_UTYPE_IMM(x) ((RV_X(x, 12, 20) << 12) | (RV_IMM_SIGN(x) << 32))
```

## 1.7.4 The "S" format

In this format, the dst field disappears and its bits are used to hold the lower 4 bits of an immediate value. An instruction that uses this format is the sd (store double word) instruction.

We use the instruction sd ra,64(sp) as example. This instruction means: Store the contents of the return address register (ra) at the memory address obtained by adding 64 to the contents of the sp register. We have here an address that is obtained by adding the contents of a register and a displacement that must fit into 12 bit. As you can see here, this is a much easier format than the ARM jungle of different types of offsets where you never really know which one to use. The Risc-V manual specifies that all offsets are signed. 13

Except for the 5-bit immediates used in CSR instructions, immediates are always sign-extended, and are generally packed towards the leftmost available bits in the instruction and have been allocated to reduce hardware complexity. In particular, the sign bit for all immediates is always in bit 31 of the instruction to speed sign-extension circuitry.

<sup>&</sup>lt;sup>12</sup>Riscy ISA Architecture §2.2

 $<sup>^{13}\</sup>mathrm{They}$  say:

Figure 1.7: S Instruction layout



We have then for this instruction:

- src1 is 0 0 0 1 0, or register 2.
- src2 is 0 0 0 0 1, or register 1.
- The immediate is the concatenation of imm[4:0] and imm[11:5] i.e; 0 0 0 0 0 1 0 0 0 0 0 or 64.

For extracting the immediate from the instruction we use the macro

```
#define EXTRACT_STYPE_IMM(x) \
(RV_X(x, 7, 5) | (RV_X(x, 25, 7) << 5) | (RV_IMM_SIGN(x) << 12))</pre>
```

This macro extracts five bits beginning at position seven, then 7 bits from position 25 upwards, shifted by 5 left, so that they come right after the first five. The whole is sign extended in the same way as explained in section 1.7.2 page 31.

## 1.7.5 The "B" format

Figure 1.8: B Instruction layout



In this format, we have a 13 bit immediate for branches. The immediate represents the amount that will be added to the program counter to reach the specified location, in multiples of 2. Since the lowest bit of the immediate will be always zero, it has been replaced by bit 11 (the twelfth bit) adding one bit to the quantity being written. The range of the branch is  $\pm$  4K.

The different conditional branches are specified in the extra-3 group, with

Table 1.3: Encoding of conditional branches

| extra-3 | Instruction | Description     |
|---------|-------------|-----------------|
| 0 0 0   | beq         | branch if equal |

| 0 0 1 | bne  | branch if different              |
|-------|------|----------------------------------|
| 1 0 0 | blt  | branch if less than              |
| 1 0 1 | bge  | branch if greater/equal          |
| 1 1 0 | bltu | branch if less than unsigned     |
| 1 1 1 | bgeu | branch if greater equal unsigned |

Table 1.3: Encoding of conditional branches

All these instructions share the same opcode: 99. The extra-3 field is used to extend the opcode for different instructions.

The macro to access the immediate value is way more complicated due to the bit scrambling...

```
#define EXTRACT_BTYPE_IMM(x) ((RV_X(x, 8, 4) << 1) | \
(RV_X(x, 25, 6) << 5) | (RV_X(x, 7, 1) << 11) | (RV_IMM_SIGN(x) << 12))</pre>
```

## 1.7.6 The "J" format

The only difference between the U and J formats is that the 20-bit immediate is shifted left by 12 bits to form U immediates and by 1 bit to form J immediates. In the "J" format, the immediate represents an offset in pairs of 16 bit instructions from the current PC.

Figure 1.9: J Instruction layout



Why this scrambled layout? Citing the Risc-v manual:

Although more complex implementations might have separate adders for branch and jump calculations and so would not benefit from keeping the location of immediate bits constant across types of instruction, we wanted to reduce the hardware cost of the simplest implementations. By rotating bits in the instruction encoding of B and J immediates instead of using dynamic hardware muxes to multiply the immediate by 2, we reduce instruction signal fanout and immediate mux costs by around a factor of 2.

The scrambled immediate encoding will add negligible time to static or ahead-of-time compilation. For dynamic generation of instructions, there is some small additional overhead, but the most common short forward branches have straightforward immediate encodings.

The macro to extract this monster from its hiding place looks like this

```
#define EXTRACT_JTYPE_IMM(x) ((RV_X(x, 21, 10) << 1)|(RV_X(x, 20, 1) << 11) | \
(RV_X(x, 12, 8) << 12) | (RV_IMM_SIGN(x) << 20))
#define ENCODE_JTYPE_IMM(x) ((RV_X(x, 1, 10) << 21)|(RV_X(x, 11, 1) << 20) | \
(RV_X(x, 12, 8) << 12) | (RV_X(x, 20, 1) << 31))
```

## 1.8 The compressed instructions

The Risc-v instructions are normally 32 bits in length. The "C" extension (C for Compressed) encodes certain instructions in 16 bits, what leads to big savings in code size. These instructions aren't enabled by default in the assembler. You can enable them (if your machine actually supports them) with the instruction: .option arch, +c. Enabling them or not is not that important, since the linker will replace longer with shorter instruction whenever possible. For instance the jumps can't be really calculated until all the instructions are compressed, what only the linker can know.

The compressed instructions are enabled when one of these conditions is true:

- The compressed 16 bit instructions have the lowest 2 bits of the opcode set to either 00, 01, or 10.
- 32 bits instructions have their lowest two bits set to 11. The following 3 bits should have any value different from 111.
- The 48 bit instructions have their lowest 6 bits set to 011111. (5 bits set)
- 64 bit instructions have the 7 lower bits set to 0111111. (6 bits set)

The criteria for making a compressed instruction are as follows:

- The immediate or the address offset is small.
- One of the registers used is the zero register (x0), the return address register or link register ra (x1), or the stack pointer sp (x2).
- The destination and first source register are the same.
- The registers used belong to the 8 most popular ones, described with 3 bits in the table below 14.

000 010 number 001 011 100 101 110 111 int reg. number x8 x9x10x11 x12x13x14x15ABI name s0a0a2a3s1a1a4 $a_5$ FP reg number f8f9f10 f11 f12f13 f15 f14 FP ABI name fs0fs1fa0fa1 fa2 fa3 fa4 fa5

Table 1.4: Compressed register numbers

There are nine different compressed instruction layouts.

In the table below the registers that use the 3 bit number are marked with a '.

Table 1.5: Compressed formats

| Meaning     | Code | 15 14 13 | 12          | 11 1 | 10 9 | 8          | 7    | 6   | 5         | 4  | 3    | 2 | 1  | 0 |
|-------------|------|----------|-------------|------|------|------------|------|-----|-----------|----|------|---|----|---|
| Register    | CR   | Extra-4  | m dst/src1  |      |      |            | src2 |     |           |    | op   |   |    |   |
| Immediate   | CI   | Extra-3  | S rd/rs1    |      |      | immediat e |      |     |           | op |      |   |    |   |
| Store local | CSS  | Extra-3  | imm         |      |      |            | rs2  |     |           |    | op   |   |    |   |
| Wide imm    | CIW  | Extra-3  | $_{ m imm}$ |      |      |            | rd'  |     |           |    | op   |   |    |   |
| Load        | CL   | Extra-3  | imm         |      |      | rs1'       |      | imm |           |    | rd'  |   | op |   |
| Store       | CS   | Extra-3  | imm         |      |      | rs1'       |      | imm |           |    | rs2' |   | О  | p |
| Arithmetic  | CA   | Extra-6  |             |      | re   | rd'/rs1'   |      |     | Extra-2 r |    |      |   | О  | р |

<sup>&</sup>lt;sup>14</sup>Actually those numbers are just the normal register number modulo 8.

Table 1.5: Compressed formats

| Branch | CB | Extra-3 | offset      | rs1' | offset | op |  |  |
|--------|----|---------|-------------|------|--------|----|--|--|
| Jump   | CJ | Extra-3 | jump target |      |        |    |  |  |

## 1.8.1 The compressed register (CR) format

Figure 1.10: Compressed CR Instruction layout



This format accepts instructions where the destination and the first source register are the same. It has four fields, here from right to left, i.e. from bit 0 to 15:

- 1. OP: Bits 0-1. Value: 2.
- 2. Src2: Bits 2-6. The second source register. Note that it is specified in 5 bits, like dst/src1, so any register of the set of 32 is possible, except the zero register. In this case it is 13, i.e. register a3 (x13).
- 3. dst / src1: Bits 7-11. The source 1 and the destination register are the same. Also specified in 5 bits, in this case it is 12: the a2 (x12) register.
- 4. Extra-4: Bits 12-15. Value: 9. Complements the opcode. This field can have two values that correspond to mv (move) or, in the example, add.

#### The software side

The argument description for addw,a2,a2,a3 is the character string Cs,Cw,Ct. The first argument is a compressed format source register (Cs), followed by a compressed format register that should be equal to the preceding one (Cw), followed by a compressed format second source register, (Ct).

The code for the 's' case in riscv\_ip is as follows:

It is a typical sample of the code in the encoder (riscv\_ip). We search for a register name with reg\_lookup and we ensure that is between 8 and 15. If that is not the case, the matching process for this instruction candidate fails, and we look for the next one (break).

If it is, we insert the operand in the right position and continue with this candidate.

Note that the identifier CRS1S doesn't appear in ANY macro, variable or enumeration in the whole program.

It is a literal name argument! When we look at the definition of  ${\tt INSERT\_OPERAND}$  we find:

```
1 #define INSERT_OPERAND(FIELD,INSN,VALUE) \
    INSERT_BITS ((INSN).insn_opcode,VALUE,OP_MASK_##FIELD,OP_SH_##FIELD)
```

The ## operand before the FIELD macro argument makes the preprocessor convert it to OP\_MASK\_CRS1S what is defined with #define OP\_MASK\_CRS1S 0x7 in asm.h.

The first level expansion converts this to:

```
1 #define INSERT_OPERAND(FIELD,INSN,VALUE) \
     INSERT_BITS ((INSN).insn_opcode, VALUE, OP_MASK_CRS1S, OP_SH_CRS1S)
     The INSERT_BITS macro is defined as follows:
1 #define INSERT_BITS(STRUCT, VALUE, MASK, SHIFT) \
        (STRUCT) = (((STRUCT) & ~((insn_t)(MASK) << (SHIFT))) \
2
           | ((insn_t)((VALUE) & (MASK)) << (SHIFT)))
3
     This macro has two parts, separated by an | (or) sign:
1 ((STRUCT) & ~((insn_t)(MASK) << (SHIFT))) and
```

2 ((insn\_t)((VALUE) & (MASK)) << (SHIFT)</pre>

In the first one we set to zero all bits in the field that will be written. The second one introduces the bits into the right position. The or operation joins those parts into a single value.

The encoder works like an interpreter for a "language" of single letters that represent pieces of instruction fields. They indicate what to expect at the given position. Its actions can be only be "break" (discard the current candidate) or insert the correct bits and "continue" with it.

# 1.8.2 The compressed immediate (CI) format

These instructions perform operations between a register and a small immediate encoded in only 6 bits. The register can't be the zero register, and the immediate can't be zero. There



Figure 1.11: Compressed immediate CI Instruction layout

are four instructions that use the compressed immediate format. They differ in the extra-3 field. From least significant bit to the most significant one we have:

- 1. OP: Bits 0-1, always with value 1 for the CI format.
- 2. The immediate field, in bits 2 to 6 that encodes immediate bits 0 to 4. In the example above this is 27, 1 1 0 1 1 in binary.
- 3. The destination and the source register number over 5 bits. In the example we have 14 since the register a4 has the number 14.
- 4. The sign of the immediate value in a single bit (index 12th).

5. The Extra-3 field, that allows for 3 instructions to be distinguished: addi, addiw, and addi16sp. The last one adds a number of 16 bits quantities to the stack and is used to adjust the stack at the prologue or at the epilogue of a function. Since the stack must be aligned to a multiple of 16, there is no need to keep the lower 4 bits. This makes for adjustments of -512 to 496 bytes.

To access the immediate value we use

```
#define EXTRACT_CITYPE_IMM(x) (RV_X(x, 2, 5) | (-RV_X(x, 12, 1) << 5))
#define ENCODE_CITYPE_IMM(x) ((RV_X(x, 0, 5) << 2) | (RV_X(x, 5, 1) << 12))
```

The first macro uses the same technique for sign extending that our RV\_IMM\_SIGN uses (see 1.7.2 page 31). We just need another expression since the other was fixed for 32 bits.

# 1.8.3 The stack relative store (CSS) format

| Syntax                  | Operation                                     |
|-------------------------|-----------------------------------------------|
| c.swsp rs2 (uimm6)(sp)  | Store word to an offset from sp.              |
| Crowop 102 (dimme) (op) | $mem[sp + (uimm6 << 2)] \leftarrow rs2[031]$  |
| c.sdsp rs2 (uimm6)(sp)  | Store double word to an offset from sp.       |
|                         | $mem[sp + (uimm6 << 3)] \leftarrow rs2[063]$  |
| c.fswsp fs2 (uimm6)(sp) | Store single precision to an offset from sp.  |
|                         | $mem[sp + ()uimm6 << 2)] \leftarrow fs2[031]$ |
| c.fsdsp fs2 (uimm6)(sp) | Store double precision to an offset from sp.  |
|                         | $mem[sp + (uimm6 << 3)] \leftarrow fs2[063]$  |

Table 1.6: Compressed store instructions with the CSS format

Figure 1.12: Store to stack offset (CSS) instructions layout



In our example instruction we have an op field of 2, an  $\mathtt{src}$  field of 16 (10000) and the cryptic "011000" sequence that is translated into 00110 (6 decimal) since the bits are scrambled: they are stored as bits 5 4 3 2 7 6 The macros to access the immediate displacement here are:

```
#define EXTRACT_CSSTYPE_IMM(x) (RV_X(x, 7, 6) << 0)
#define ENCODE_CSSTYPE_IMM(x) (RV_X(x, 0, 6) << 7)</pre>
```

The encoding of instruction c.swsp needs only one source register: the source of the 32 bit data to store in memory. Any register will do since we have a register number in 5 bits. The value of the immediate displacement will be added to the stack pointer scaled by 4 to form the effective address. In the example above the 6 binary is scaled to 24. <sup>15</sup>

<sup>&</sup>lt;sup>15</sup>By an unfortunate coincidence the scrambled bits of the constant are 011000, what is 24 in binary. Beware, nothing in this business is simple, and a 24 can be scrambled to 6, then scaled to 24 back again.

The argument description string is "CV,CM(Cc)": We need a register name (CV), followed by a small constant (CM) that is a displacement (the parentheses) of the stack pointer (Cc). The constant value will be zero extended, since obviously negative offsets for the stack aren't very useful!

The reach of this instruction is  $2^7 - 1$  values since we have 7 bits. Scaled by 4, i.e. 127 \*  $4 \rightarrow 508$ .

And... "one more thing" as Steve Jobs liked to say, there is a problem with zero off-sets from the stack pointer. Normally a zero offset is omitted, i.e. you do NOT write sw a6,0(sp), you just write sw a6,(sp). The handling of the CM directive tests for this with the function riscv\_handle\_implicit\_zero\_offset.

## 1.8.4 The wide immediate (CIW) format

This format is used to encode a constant in bits 5 to 12. It is used in the addi4spn instruction. The constant encoded in those 8 bits is scaled by 4, i.e. the two lower bits are implicit zeroes. The scaled value will be added to the stack pointer and written to the register whose index is stored in the 3 bits rd'. This instruction builds then pointers to values stored in the local stack frame.

Figure 1.13: Store to stack offset (CIW) instructions layout



- 1. The OP field is zero.
- 2. The destination (rd') is 7, the register number in 3 bits of the a5 register
- 3. Now, this is more complicated to explain. The poor immediate bits are *scrambled*, i.e. they are **not** in the natural order but in the order: 5, 4, 9, 8, 7, 6, 2, 3. The bits 1 and 0 are implicitly zero. The quantity (128) has a single bit on at the position 7, what in our scrambled layout corresponds to bit 8. <sup>16</sup>. The Risc-V ISA manual justifies this saying:

The immediate fields are scrambled in the instruction formats instead of in sequential order so that as many bits as possible are in the same position in every instruction, thereby simplifying implementations.<sup>17</sup>

The "simpliying" above refers to hardware simplification.

4. The Extra-3 field is zero.

The macros used to access the immediate are:

<sup>&</sup>lt;sup>16</sup>The number 128 is 1000 0000 in binary. Bit 7 is one. In the scrambled order we have bit 7 in the fourth position of the immediate field, counting from left to right, as shown in the figure 1.13

<sup>&</sup>lt;sup>17</sup>Risc-V Unprivileged ISA V20191213 §16.2

```
#define EXTRACT_CIWTYPE_ADDI4SPN_IMM(x) ((RV_X(x, 6, 1) << 2) |\
(RV_X(x, 5, 1) << 3) | (RV_X(x, 11, 2) << 4) | (RV_X(x, 7, 4) << 6))
#define ENCODE_CIWTYPE_ADDI4SPN_IMM(x) ((RV_X(x, 2, 1) << 6) |\
(RV_X(x, 3, 1) << 5) | (RV_X(x, 4, 2) << 11) | (RV_X(x, 6, 4) << 7))
```

The argument description string for this instruction is "Ct,Cc,CK"

# 1.8.5 The compressed load (CL) format

Figure 1.14: Compressed load CL Instruction layout



- 1. The OP field is zero.
- 2. The destination register is 5 (a3). 18
- 3. This field corresponds to an offset from a register. The constant should be aligned by a multiple of 4, since we are loading 4 bytes. The two lower bits then should be zero and they are implicit, i.e. they are absent from the encoding. The value is split between two bits at positions 5 and 6, and the rest in positions 10, 11, and 12. The two bits in positions 5 and 6 are scrambled, and bit 6 corresponds to bit 2 of the immediate and bit 5 is bit 6 of the immediate value, they are not consecutive.
- 4. The rs1' field contains 1 1 1, what corresponds to x15 (a5).
- 5. We have in bits 10, 11, and 12 the bits 3, 4, and 5 of the immediate value.
- 6. The extra-3 field contains constant 2.

# 1.9 The opcode table

The full table of opcodes (called riscv\_opcodes) consists of entries with the following structure:

```
struct riscv_opcode {
const char *name;
```

The name of the instruction in lower case. This is also the used as the key to the hash table. Several instructions can share the same name, and they are recognized by their different arguments.

## unsigned xlen\_requirement;

The word bit length (32, 64, or 128) that is required to use this instruction. A zero here means no requirement.

 $<sup>^{18}</sup>$ These values are in table 1.4

```
enum riscv_insn_class insn_class;
```

The instruction class to which it belongs. For instance the instructions belonging to the basic integer operations are INSN\_CLASS\_I one of the member of the enum riscv\_insn\_class. This was used to decide whether or not this instruction is legal in the current machine architecture context, but this test has been dropped since we assume that the compiler will not generate instructions that are illegal for the target machine.

```
const char *args;
```

A string describing the arguments for this instruction. This string will be interpreted by the riscv\_ip function in a rather big set of nested switch statements.

```
insn_t match;
insn_t mask;
```

The basic opcode for the instruction. When assembling, this opcode is modified by the arguments to produce the actual instruction that is used. If pinfo is INSN\_MACRO, then this is 0. Otherwise the mask field is a bit mask used to isolate the relevant portions of the opcode when disassembling. If pinfo is INSN\_MACRO then this field contains the macro identifier, encoded as a member of an anonymous enumeration and casted to an integer.

```
int (*match_func) (const struct riscv_opcode *op, insn_t word);
```

A function to determine if a word corresponds to this instruction. Usually, this computes ((word & mask) == match).

unsigned long pinfo;

Additional information about the instruction. They are:

| Symbol          | Description                                                      |  |  |
|-----------------|------------------------------------------------------------------|--|--|
| INSN_ALIAS      | Just an alias, for example "mv" for "addi dest,src,zero          |  |  |
| INSN_BRANCH     | Unconditional branch                                             |  |  |
| INSN_CONDBRANCH | Conditional branch                                               |  |  |
| INSN_JSR        | Jump to a subroutine                                             |  |  |
| INSN_DREF       | Data reference                                                   |  |  |
| INSN_V_EEW64    | Instruction allowed only when the machine is a 64 bit ma-        |  |  |
|                 | chine or more                                                    |  |  |
| INSN_XX_BYTE    | 5 different data size specifiers, for XX=1, 2, 4, 8, or 16 bytes |  |  |

Table 1.7: Opcode flags

};

The field args above needs more explanation. It is a one (or more) letters that represent the type of argument that can be expected in an instruction. This can be a register, a constant within a certain range, or other things. During assembly, the assembler reads and interprets this character string to weed out wrong choices or emit warnings, and to verify that all constrains are met.

The table below should document all the letters used by the riscv\_ip function. They are listed in the order they appear there; only for the first level. If a letter has a continuation (for instance for the compressed instructions), the secondary switch statement is explained in another table <sup>19</sup>.

<sup>&</sup>lt;sup>19</sup>Nested tables are as difficult to read as nested switch statements.

Table 1.8: Opcode arguments letters

| Character             | Argument expected                                                                        |  |  |  |
|-----------------------|------------------------------------------------------------------------------------------|--|--|--|
| \0                    | End of the argument string. Here are done the final checks, for instance                 |  |  |  |
|                       | that this instruction corresponds to the bit length of the machine (64 bit               |  |  |  |
|                       | instructions can't be done in a 32 bit machine). It checks also if the end               |  |  |  |
|                       | of the argument string coincides with the end of the actual arguments                    |  |  |  |
|                       | present. If everything goes well it sets the errors to zero and branchs to               |  |  |  |
|                       | the end of the riscv_ip function.                                                        |  |  |  |
| ,                     | Synchronization. Arguments are separated by commas. The software                         |  |  |  |
|                       | tests this and ignores the separators.                                                   |  |  |  |
| ()[]                  | Displacement or index. Same behavior as for commas.                                      |  |  |  |
| 0                     | Expects a zero displacement. For instance: lr.w a5,0(sp).                                |  |  |  |
| 1                     | Used for thread local storage.                                                           |  |  |  |
| <                     | Shift amount for shifts less than 32.                                                    |  |  |  |
| >                     | Shift amount for 0 to word length - 1. Normally 63.                                      |  |  |  |
| 0<br>1<br><<br>><br>A | Requests a symbol                                                                        |  |  |  |
| a                     | 20 bit relative offset.                                                                  |  |  |  |
| В                     | Requests a symbol or a constant.                                                         |  |  |  |
| С                     | Compressed format instructions. This leads to a nested switch state-                     |  |  |  |
|                       | ment, since all the compressed argument descriptions begin with a C                      |  |  |  |
|                       | letter. This switch is described in table 1.10 page 44.                                  |  |  |  |
| c                     | Call using the global object table                                                       |  |  |  |
| D                     | Floating point destination register                                                      |  |  |  |
| d                     | Destination register.                                                                    |  |  |  |
| E                     | Control register number. This is used only in privileged instructions.                   |  |  |  |
| F                     | Expects a bit field, that is defined by the following character. Used in                 |  |  |  |
| -                     | the .inst directive only.                                                                |  |  |  |
| I                     | M_LI macro. Immediate value.                                                             |  |  |  |
| j                     | Sign extended immediate.                                                                 |  |  |  |
| m                     | Rounding mode. This argument expects a character string represent-                       |  |  |  |
| 111                   | ing the rounding mode. It can be one of "rne", "rtz", "rdn", "rup",                      |  |  |  |
|                       | "rmm", "dyn". See table 1.9 page 43.                                                     |  |  |  |
| 0                     | Opcode field                                                                             |  |  |  |
| 0                     | Expects a load/store displacement.                                                       |  |  |  |
| P                     | Fence predecessor                                                                        |  |  |  |
| p                     | PC relative offset                                                                       |  |  |  |
| Q                     | Fence successor                                                                          |  |  |  |
| q                     | Expects a register store displacement.                                                   |  |  |  |
| R                     | Floating point RS3 for .insn directive.                                                  |  |  |  |
| r                     | RS3. Integer register for the .insn directive.                                           |  |  |  |
| S                     | Floating point source 1.                                                                 |  |  |  |
| s                     | First source register. Also called src1 in the documentation.                            |  |  |  |
| T                     | Floating point source 2                                                                  |  |  |  |
| t                     | Second source register. The 't' is for target. It is also called src2 in                 |  |  |  |
| U                     | the documentation.                                                                       |  |  |  |
| U                     | Floating point source.                                                                   |  |  |  |
|                       | _                                                                                        |  |  |  |
| V                     | Expects a 20 bit immediate  Vector instructions. This leads to a posted switch statement |  |  |  |
| V                     | Vector instructions. This leads to a nested switch statement.                            |  |  |  |

Table 1.8: Opcode arguments letters

| Character               | Argument expected                                                          |  |  |  |
|-------------------------|----------------------------------------------------------------------------|--|--|--|
| У                       | Expects a bs immediate used in the cryptography sm4 instructions sm4es     |  |  |  |
|                         | and sm4ks. It is a 2 bit constant that tells the sm4 algoritrhm which byte |  |  |  |
|                         | to choose: it represents the number of bytes to shift right rs2, selecting |  |  |  |
|                         | thus a single byte. See §1.36 page 106.                                    |  |  |  |
| Y                       | Expects an rnum immediate. This is a constant that appears in the          |  |  |  |
|                         | instruction aes64ks1i. It is used only in that instruction, and should     |  |  |  |
|                         | be $0 \le rnum \le 10$ .                                                   |  |  |  |
| W                       | Expects an offset for the prefetch instruction. The offset should have     |  |  |  |
|                         | the 5 lower bits at zero. Followed by letters "if".                        |  |  |  |
| X                       | Integer immediate                                                          |  |  |  |
| Xu                      | eXtract unsigned $n$ bits starting at position $m$ . These arguments look  |  |  |  |
|                         | like this: Xu2@25, meaning eXtract 2 bits starting at bit 25.              |  |  |  |
| $\overline{\mathbf{z}}$ | Expects a CSR number, a CSRRxI Immediate. Control and Status               |  |  |  |
|                         | Registers are specified in a different instruction format. For this to     |  |  |  |
|                         | work, you have to have access to a CPU with the 'z' extension.             |  |  |  |
| Z                       | Expects a zero                                                             |  |  |  |

Below is the set of rounding modes for the m parameter. It has been taken from the Sifive site<sup>20</sup>. Edited in May 27th 2020.

Table 1.9: Accepted rounding modes for the 'm' parameter

| Binary | Mnemonic | Meaning                                |  |  |
|--------|----------|----------------------------------------|--|--|
| Value  |          |                                        |  |  |
| 000    | rne      | Round to Nearest, ties to Even         |  |  |
| 001    | rtz      | Round towards Zero                     |  |  |
| 010    | rdn      | Round Down (towards $-\infty$ )        |  |  |
| 011    | rup      | Round Up (towards $+\infty$ )          |  |  |
| 100    | rmm      | Round to Nearest, ties to Max Magni-   |  |  |
|        |          | tude                                   |  |  |
| 101    |          | Invalid. Reserved for future use.      |  |  |
| 110    |          | Invalid. Reserved for future use.      |  |  |
| 111    | dyn      | In instruction's rm field, selects dy- |  |  |
|        |          | namic rounding mode; In Rounding       |  |  |
|        |          | Mode register, Invalid.                |  |  |

These rounding modes are recognized in the assembler using the  ${\tt riscv\_rm}$  table, a simple table of 8 character strings.

The C (compressed) instructions are differentiated by the following letters:

Table 1.10: Compressed instruction types

| Char | Description                             |
|------|-----------------------------------------|
| 5    | Five bit field                          |
| 6    | Six bit numeric field                   |
| 8    | Eight bit field                         |
| a    | Jump. Expects 20 bit PC relative offset |
| С    | Source 1 constrained to be sp           |

<sup>&</sup>lt;sup>20</sup>sifive-blog

The URL seems truncated but it is not...

| D      | Floating point source 2                               |
|--------|-------------------------------------------------------|
| F      | Field of 6, 4, 3, or 2 bits                           |
| U      | Source 1 and destination the same.                    |
| j<br>k | Non-zero immediate of 6 bits                          |
| k      | Immediate (possibly zero)                             |
| K      | scaled by 4 stack addend                              |
| 1      | Load immediate (64 bits)                              |
| L      | Stack offset scaled by 16                             |
| m      | Load immediate                                        |
| n      | Immediate offset from SP                              |
| M      | Scaled by 4 stack displacement (32 bits store)        |
| N      | Data reference with offset from stack scaled by 4(64  |
|        | bits store)                                           |
| 0      | C.addiw, c.li, and c.andi allow zero immediate.       |
|        | C. addi allows zero immediate as hint. Otherwise this |
|        | is same as 'j'.                                       |
| s      | Source register 1                                     |
| S      | Floating point source 1                               |
| t      | Integer register source 2                             |
| Т      | Floating point source 2                               |
| u      | Immediate for jumps                                   |
| V      | Second source integer register rs2                    |
| v      | Compressed I type immediate                           |
| X      | Source 2 and destination are the same.                |
| w      | Source 1 constrained to be equal to the destination.  |
|        | Source 2 should be the zero register                  |
| >      | Shift amount between 0 and word length - 1            |

Table 1.10: Compressed instruction types

This is an example for an instruction entry in the opcodes table:

{"addi",0,INSN\_CLASS\_C,"Ct,Cc,CK",MATCH\_C\_ADDI4SPN,MASK\_C\_ADDI4SPN,\match\_c\_addi4spn,INSN\_ALIAS},

After parsing the name of the instruction, the riscv\_ip function examines entries in the opcode table starting with the first one that has this name. It copies this entry into temporary storage because it will modify it later (using the create\_insn function).

Then, it uses the letter in the args character string to check if there is a match. If there is, it stores immediately the bits into the instruction copy. But, as mentioned above, if there isn't any match, all the work is discarded and riscv\_ip starts over using a saved pointer to the start of the arguments.

This way it ensures that eventually, the good instruction will be discovered, if at all. It is a slow process, since in many cases 4 other 5 instructions will be parsed and discarded until the correct one is found. Since the order of the opcodes is crucial the most used instructions can be the last ones to be found, what compounds the problem.

Several solutions can be imagined to speed up things, but the question arises if the speed of the assembler encoding is really the limiting factor for the compilation process. In a very cheap riscv machine assembling a 3.6Mb file takes 1.7 seconds, including the time for i/o from disk.

# 1.10 A more detailed view of instruction encoding

If the first symbol in a line of assembler text is not a label or a directive, the parser calls md\_assemble. 21



Figure 1.15: md\_assemble control flow

```
1 static void md_assemble(char *str)
2 {
      struct riscv_cl_insn insn;
3
      expressionS imm_expr;
4
      bfd_reloc_code_real_type imm_reloc = BFD_RELOC_UNUSED;
5
6
      /* The architecture and privileged elf attributes should be set
       * before assembling. */
      if (!start_assemble) {
          start_assemble = true;
10
          riscv_set_abi_by_arch();
                                                                         (1)
1\,1
12
          if (!riscv_set_default_priv_spec(NULL)) return;
13
      riscv_mapping_state(MAP_INSN,0,false /* fr_align_code */);
                                                                         (2)
14
      const struct riscv_ip_error error = riscv_ip(str,&insn,
15
                                        &imm_expr,&imm_reloc,op_hash); (3)
16
      if (error.msg) {
                                                                         (4)
17
          if (error.missing_ext)
18
              as_bad("%s '%s',extension '%s' required",error.msg,
19
                      error.statement,error.missing_ext);
20
          else as_bad("%s '%s'",error.msg,error.statement);
21
          return;
22
^{23}
      if (insn.insn_mo\rightarrowpinfo == INSN_MACRO)
24
```

<sup>&</sup>lt;sup>21</sup>The md prefix is probably a short for machine dependent.

```
25 macro(&insn,&imm_expr,&imm_reloc);
26 else append_insn(&insn,&imm_expr,imm_reloc);
27 }
(5)
```

- 1. If it is the first time that we emit an instruction, initialize things.<sup>22</sup> The functions called are:
  - riscv\_set\_abi\_by\_arch. This function sets the ABI, and makes a lot of consistency checks.
  - riscv\_set\_default\_priv\_spec sets the privileged instruction set that will be used

This settings allow the assembler to check if an instruction is valid within the subset that is established. The problem with this approach is that if the assembler makes a bad guess about the machine, it will not assemble perfectly legal instructions, as is the case with the U74 CPU that I am using. The GAS assembler doesn't want to assemble the bit manipulation instructions and several other extensions that the U74 supports. Since there wasn't any hope of convincing the "binutils" group that it would be nice if the assembler supported all instructions available in the machine it is running on, I decided to drop those tests<sup>23</sup>

2. The riscv\_mapping\_state function adds "mapping symbols" if there is a transition from another section to the .text section. The signification of those is not completely clear, to me at least. In any case they are produced in important quantities, and the function riscv\_check\_mapping\_symbols is tasked with removing them if there are too many of them.

The mapping state can be either MAP\_INSN or MAP\_DATA. It is stored in a field of the riscv\_segment\_info\_type structure that is a member of the segment\_info block associated with each section. Within tiny-asm, this structure is set but never really used outside the mapping state function, for coding a state transition from another section to the .text section. Only those transitions are monitored.

3. We come now to the central task of an assembler: encoding instructions. This is done in the riscv\_ip function.

As you have seen in the encoding of the instructions, each instruction has several pieces of information written in pieces of its 32 bits: the source register number, the opcode, etc. These pieces are the parameters that fill the instruction, besides the fixed bits of the opcode.

In the opcode table those parameters are grouped into a sequence of letters that represent each one of the different pieces that an instruction can receive. riscv\_ip interprets those letters and acts accordingly, putting into the specified parts of the instruction the register numbers, the immediate constants or all other parameters that build an instruction. Each letter can have other modifiers, for instance the C letter (Compressed) has several modifiers that specify the parts of the 16 bit compressed instruction. For a full description of each letter see the table in §1.9 page 41.

4. riscvv\_ip returns either without an error, or with an error description stored in a structure that receives as a parameter.

<sup>&</sup>lt;sup>22</sup>The assembler uses a global variable (start\_assemble) that is used only here or in riscv\_write\_out\_attrs to test if we have an empty file with no instructions. In the later case we do not write the elf attributes to the executable.

<sup>&</sup>lt;sup>23</sup>Actually, the situation is more complicated. The GNU people argue that it is a security that the assemble checks if the instructions being assembled are valid. They say that if you invoke gcc with the **undocumented** option march=rv64gc\_zbb those instructions are assembled. Well, you can judge by yourself.

5. If the instruction found by riscv\_ip is a macro, the macro-expansion procedure is called, otherwise we append the new instruction to the growing fragment using append\_insn. append\_insn's task is to decide how the new instruction will be added to the current fragment, and if it is necessary to create a new fragment. If the new instruction has a relocation, either it is stored with add\_relaxed\_insn when possible, or a new fixup is created, and the instruction stored with append\_fixed\_insn. If there are no relocations, the instruction is immediately stored with append\_fixed\_insn.

After storing the instruction, the assembler creates a new fragment for all relocations that could be changed by the linker, to avoid calculating wrong offset between symbols, since those could change by the actions of the linker.

# 1.11 Writing the object file

After we have encoded all instructions and setup all the static data, processed all the assembler directives, we arrive at the end of the file, and we start preparing for writing the result of our efforts: the object file.

This file is written according to the ELF (Executable and Link Format.)<sup>24</sup> standard. This file format is extensively described in a lot of documentation floating in the internet, so it is not necessary to repeat all that here.

Before we start writing out things we must finish the assembling process.

- We have a long list of "fragments", each holding a piece of the final section... we have to stitch all that together.
- We have some symbols that still haven't got a specific location. We should resolve them.
- We have to prepare to write the file header and the section headers.
- We have symbols in an internal format. We have to prepare to write them out in the ELF symbol format.
- References to symbols (fixups) must be resolved as far as it is possible. Of course some symbols are just externals, and can't be resolved anyway.

## 1.11.1 Write the object file

The write\_object\_file function is a very long one (more than 250 lines). Here is a detailed account of it:

- subsegs\_finish This function does mainly two things:
  - 1. Correctly align the section.
  - 2. Finish the last fragment, so that there isn't any half done fragment.
- riscv\_pre\_output\_hook This function finishes optimizations of the eh\_frame output. Basically, if a subtraction from two symbols is performed, it is feasible to substitute the subtraction by a constant when the two symbols are in the same fragment. Sometimes, however, it is impossible to know if that is the case. In that case the optimization is postponed to the end of the assembly. This is done here.
- The assembler creates some sections to store its own data. They need to be discarded now, since they aren't needed any more. Once we do that, the sections need to be renumbered since we have thrown away some.

<sup>&</sup>lt;sup>24</sup>Unix is fond of mythological names: We have magic numbers, Elfs, dwarfs, daemons...

• chain\_frchains\_together This function manipulates the next and previous pointer of the fragment chains to make a single list. Now, since we have chained everything in a single list, any new relocations must be done not relative to a fragment, but relative to the start of the big list. We record that we have done the fragment reorganization in the variable frags\_chained. This global variable is used in the function fix\_new\_internal when making a new fixup:

- merge\_data\_into\_text. If the user specified (with the -R flag) that data sections should go into the text segment to make the data read-only, we should merge the data and the text sections. This is done now.
- We keep calling relax\_segment until we record that there isn't any more changes.

rsi is a variable of type struct relax\_seg\_info<sup>25</sup>. The function map\_over\_sections just calls the function given in argument for each section in the output file.

- size\_seg. Now that the address and size of all fragments is known, we can calculate the total size of each segment. This is done in the following stages:
  - 1. Set the current segment to the one we are measuring.
  - 2. For each fragment in this section convert them to fragments without any variable part, to be able to size them. This is done in the function cvt\_frag\_to\_fill.
  - 3. Go through the list to the last element. Then:

```
size = fragp->fr_address + fragp->fr_fix;
```

- 4. Then, the section is padded to alignment if necessary.
- dwarf2dbg\_final\_check. This is interesting stuff. There is a proposal from Alexandre Oliva<sup>26</sup> that introduces the concept of "view numbers" where the same program counter can belong to several views. The underlying need for this are inlined functions, where the inlined code can belong to the current function, or it can be understood as part of the inlined function, allowing the debugger to trace through the inlined function as if it were a normal function call.<sup>27</sup>

 $<sup>^{25}\</sup>mathrm{A}$  very simple structure:

struct relax\_seg\_info {int pass; int changed;}

The pass member is incremented but never used. It is there to allow debugging infinite loops that could arise.

<sup>26</sup> https://www.fsfla.org/~lxoliva/

<sup>&</sup>lt;sup>27</sup>The whole proposal text is here:

This proposal introduces a new implicit column to the line number table, namely "view numbers", so that multiple program states can be identified at the same program counter, and extends loclists with means to add view numbers to address ranges, enabling locations to start or end at specific views.

This may improve debug information, enabling generators to indicate inlined entry points and preferred breakpoints for statements even if instructions associated with the corresponding source locations were not emitted at the given PC, and to emit variable locations that indicate the initial values of inlined arguments,

- create\_obj\_attrs\_section creates a section to hold all program attributes. The attributes should refer to the CPU type where the program can run.
- All relocations refer to symbols. So we have to resolve symbols before doing the relocations. this is done

```
if (symbol_rootP) {
    symbolS *symp;

for (symp = symbol_rootP; symp; symp = symbol_next(symp))
    resolve_symbol_value(symp);

}

resolve_local_symbol_values();

resolve_reloc_expr_symbols();
```

The resolve\_symbol\_value function tries to determine the value of a possibly very complex expression and assigning it to the symbol.

The resolve\_local\_symbol\_value organizes a traversal of the hash symbol table to resolve all local symbols.

- elf\_frob\_file\_before\_adjust will go through all symbols and will eliminate unneeded versions of versioned symbols.
- adjust\_reloc\_syms will go through all symbols and try to replace the references to symbols by references to the section symbol + offset.
- fix\_segment. This function will go through all fixups of a segment and resolve those that can be resolved at this stage. For instance if a fragment's address has been resolved any fixup mentioning this address can be resolved too. Or when a symbol has been resolved, the fixup can be eliminated.
- Now it's time to write the symbol table. The code goes through all symbols checking that:
  - 1. Local labels are defined.
  - 2. Splice out symbols that should be ignored, like symbols that were equated to bss or to undefined symbols.
  - 3. elf\_frob\_symbol Will take care of symbol versioning and associated complexities...
  - 4. Take care of "warning" symbols, i.e. symbols that are there just to generate a warning. They are just skipped.
  - 5. Take care of the infinite possibilities of bugs... For instance there could be symbols that were emitted before an alignment that ended as a zero byte alignment. They are unnecessary. Get rid of them.
- set\_symtab. This function counts the symbols, and allocates a table that will be used to store the symbols to be written out.
- elf\_frob\_file. This function does two things:
  - 1. In the case we are emitting stabs debug information, fill the header with the number of stabs, and other information.

and side effects of operations as they would be expected to take effect from the source code, even when multiple statements have their side effects all encoded at the same PC: with view numbers, debug information consumers may be able to logically advance the perceived program state, so as to reflect user-expected changes specified in the source code, even if the operations were reordered or optimized out in the executable code.

- 2. Do the checks necessary for putting in the elf file flags, the necessary description of the target machine.
- write\_relocs. Write out all relocations.
- elf\_frob\_file\_after\_relocs. If we have a group of sections, and we have established the number of relocations, it could be that a section has no longer any relocations or that the number of relocations has changed. In that case the size of the group must be adjusted.
- Once the relocations have been prepared for writing, we can compress the debug section, if necessary. This must be done before anything is written out since it makes the size of the file change.
- write\_contents. this function organizes the actual writing out of the data. It writes the fixups, the section contents and the fill data to align sections. This is done using the set\_section\_contents function. This function makes some checks and then calls elf\_set\_section\_contents.

This one makes some further checks, copies the contents into the image of the section in RAM and calls generic\_set\_section\_contents that makes some checks and positions the file pointer at the correct position, then finally calls bfd\_bwrite that will send the data to the disk with fwrite.

Described like that, this whole bunch of stacked procedures seems bloated but it is not. Each one takes a piece of the work. The GAS code is written by defensive programmers and defensive programming is not a bad idea. It pays when you have clear error messages and not bad results. Bugs provoked by missing sanity tests are very difficult to find, bugs with clear error messages spare you the time consuming search for "where is the bug?". They pop up with an error message and you instantly know where the problem is.

## 1.12 Assembler directives

Directives are defined in a table of structures of type pseudo\_typeS:

Listing 1.7: struct pseudo typeS

```
typedef struct _pseudo_type {
    /* Assembler mnemonic in lower case, without the implicit dot '.' */
const char *poc_name;
/* Function that will be called to handle this directive */
void (*poc_handler) (int);
/* Value to pass to handler. */
int poc_val;
} pseudo_typeS;
```

The assembler defines several tables of this structures. We have the main one, potable and several others: cfi\_pseudo\_table for the debug information, elf\_pseudo\_table for the directives concerning the object code format, and a riscv\_pseudo\_table for several riscv specific directives.

All of them will be called from read\_a\_source\_file function. Here is the relevant code snippet:

```
1 if (*s == '.') {
2    /* PSEUDO - OP. WARNING: Next_char may be end-of-line. We lookup the pseudo-op
3    * table with s+1 because we already know that the pseudo-op begins with a '.' */
4    pop = str_hash_find(po_hash,s + 1);
```

```
if (pop && !pop→poc_handler)
pop = NULL;
// ... code elided
/* Input_line is restored. Input_line_pointer→1st non-blank char after
/* pseudo-operation. */
(*pop→poc_handler) (pop→poc_val);
}
```

The po\_hash table is built when the assembler starts, containing the different tables mentioned above. The function that does this is very simple:

```
Listing 1.8: pop insert
```

```
1 static void pop_insert(const pseudo_typeS * table)
2 {
3
       const pseudo_typeS *pop;
      for (pop = table; pop->poc_name; pop++) {
4
           if (str_hash_insert(po_hash,pop\rightarrow poc_name,pop,0) \neq NULL) {
5
6
               if (!pop_override_ok)
                   as_fatal("error constructing %s pseudo-op table",
7
8
                       pop_table_name);
9
           //else printf("%s \n", pop \rightarrow poc_name);
10
      }
11
12 }
```

Just a loop inserting each member of the given table. The variable pop\_override\_ok is a global that will be zero if we don't accept any insertions with the same name.

That function will be called from pobegin, that looks like this:

## Listing 1.9: pobegin

```
1 static void pobegin(void)
2 {
      po_hash = str_htab_create();
      pop_table_name = "md"; /* Do the target-specific pseudo ops. */
      pop_override_ok = 0; /* Do not accept any shadowing */
      pop_insert(riscv_pseudo_table);
      pop_table_name = "obj"; /* Object specific. Skip any already present */
      pop_override_ok = 1;
      pop_insert(elf_pseudo_table);
9
      pop_table_name = "standard"; /* Now portable ones. Skip any already present */
10
      pop_insert(potable);
11
12
      pop_table_name = "cfi"; /* Now CFI ones. */
13
      pop_insert(cfi_pseudo_table);
14 }
```

This code ensures that machine specific directives shadow any object or standard directives since they are inserted first. The global variable pop\_table\_name is used for error messages only, as we have seen in the code of pop\_insert<sup>28</sup>.

# 1.12.1 .align, .p2align, p2alignw, p2alignl

Entries in the table:

```
{"align",s_align_ptwo,0},
{"p2align",s_align_ptwo,0},
```

<sup>&</sup>lt;sup>28</sup>Looking at this code I do not quite understand why there isn't an additional parameter to pop\_insert instead of a global variable. Probably it is difficult to modify the syntax for all back-ends of GAS.

```
3 {"p2alignw",s_align_ptwo,-2},
4 {"p2alignl",s_align_ptwo,-4},
```

These four entries lead to calls to the same function, albeit with different arguments.

```
void s_align_ptwo(int arg) { s_align(arg,0); }
```

s\_align receives two arguments. The first one, if positive, defines a default alignment. If negative, it defines a length of a fill pattern. The second argument, if positive, should be interpreted as a byte boundary, not as a power of two. Now, if the first argument was negative, the second argument should contain the fill pattern.

All arguments are optional. If none is given, the alignment defaults to the argument that will be given to  $s_{align_ptwo}$ .

The s\_align function calls eventually do\_align. The comment at the start of this function says it all:

```
/* Guts of .align directive: N is the power of two to which to align. A value
1
     * of zero is accepted but ignored: the default alignment of the section will
2
     * be at least this. FILL may be NULL, or it may point to the bytes of the fill
3
     * pattern. LEN is the length of whatever FILL points to, if anything. If LEN
     * is zero but FILL is not NULL then LEN is treated as if it were one. MAX is
     * the maximum number of characters to skip when doing the alignment, or 0 if
     * there is no maximum. */
     But we aren't done yet. do_align calls md_do_align that is actually a macro:
     #define md_do_align(N, FILL, LEN, MAX, LABEL)
1
     if ((N) \neq 0 && !(FILL) && subseg_text_p (now_seg)) \
2
3
         if (riscv_frag_align_code (N))
4
         goto LABEL;
5
```

The actual call sequence looks like this:

```
md_do_align(n,fill,len,max,just_record_alignment);
```

Yes, there is *still* another level. And in this level we discover that we just can't align anything. The riscv linker changes the size of some instructions, allowing compressed instructions where possible, what will change the adresses of all subsequent instructions. So, the only thing that riscv\_frag\_align\_code can do is just emit an alignment relocation that will tell the linker that this fragment needs to be aligned.

Obviously, all this lengthy process could be simplified a lot, but I have tried to keep the original structure, it may be useful to understand GAS in the context of other machines.

## 1.12.2 .ascii, .asciiz, .string, .string8, .string16, .string32, .string64

All these directives lead to the stringer function. The entries are as follows:

The stringer receives an odd argument when it should append a zero to its output. The numbers represent how many bytes should it use for each character. The input is done by following input\_line\_pointer that is a global pointer to the assembler text. stringer's code is easy to follow, so it is not further described here.

## 1.12.3 bss

Changes (if necessary) the current section to he bss. This section contains uninitialized data and will set to zero at the program's start by the loader. This directive will calls obj\_elf\_bss, a small function that realizes this change.

```
1 /* Change to the .bss section. */
2 static void obj_elf_bss(int i ATTRIBUTE_UNUSED)
3 {
4    int    temp;
5    obj_elf_section_change_hook();
6    temp = get_absolute_expression(); // Optional subsection. Normally blank
7    subseg_set(bss_section,(subsegT) temp);
8    demand_empty_rest_of_line();
9 }
```

Function obj\_elf\_section\_change\_hook remembers the section before the change so that a .section previous directive can find it. See §1.12.22, page 66 for subseg\_set.

# 1.12.4 .byte, .dc, .dc.a, .dc.b, .dc.d, .dc.l, .dc.s, .dc.w, etc

```
{"byte", cons,1},
       {"dc", cons,2},
2
       {"dc.a", cons,0},
3
       {"dc.b", cons,1},
       {"dc.d",float_cons,'d'},
5
       {"dc.1", cons,4},
       {"dc.s",float_cons,'f'},
       {"dc.w",cons,2},
       {"hword", cons, 2},
9
       {"int",cons,4},
10
       {"octa", cons, 16},
1\,1
       {"quad", cons,8},
12
       {"short",cons,2},
13
       {"long", cons, 4},
14
       {"quad", cons,8},
15
       {"word", cons, 2},
16
       {"2byte", cons, 2},
17
       {"4byte",cons,4},
18
       {"8byte",cons,8},
19
       {"half", cons,2},
20
```

GAS likes to be compatible. The consequence of that is the above list. All those directives lead to the same function. You can write a two byte constant with .short, .dc, .dc.w, .hword, .2byte and .half.  $^{29}$ 

So, what does this cons function do?

It is a fairly simple function, consisting in a loop reading expressions separated by commas. In the original code, the crucial lines look like this:

```
do {
    TC_PARSE_CONS_RETURN_TYPE ret = TC_PARSE_CONS_RETURN_NONE;
    ret = TC_PARSE_CONS_EXPRESSION(&exp,(unsigned int)nbytes);

if (rva) {
    if (exp.X_op == 0_symbol)
    exp.X_op = 0_symbol_rva;
```

<sup>&</sup>lt;sup>29</sup>The directives .2byte, .4byte, etc are used by gcc mainly within the debug information.

The problem with macros such as those here (lines 2 and 3), is that they make impossible to know what is going on actually in the program. Translated into C, these two lines expand into:

```
do {
    bfd_reloc_code_real_type ret = BFD_RELOC_NONE;
    ret = (expr(0,&exp,expr_normal),BFD_RELOC_NONE);
    ... // The rest is the same
}
```

Line 2 shows that ret is a member of the enumeration bfd\_reloc\_code\_real\_type that is assigned zero.

Line 3 is a comma expression, that in its first statement evaluates a call to expr, that reads an expression from input\_line\_pointer and in the second (and last) one evaluates to a constant that is assigned to the ret variable.

Besides this small problem, cons doesn't present any big difficulties.

#### 1.12.5 data

Tells the assembler to change (if necessary) to the data section. This directive is handled by the s\_data function:

```
Listing 1.10: s data
```

```
1 static void s_data(int ignore ATTRIBUTE_UNUSED)
2
3
      segT
                 section;
4
      int
              temp;
5
      temp = get_absolute_expression();
6
      if (flag_readonly_data_in_text) {
          section = text_section;
          temp += 1000;
      } else section = data_section;
10
      subseg_set(section,(subsegT) temp);
11
      demand_empty_rest_of_line();
12
13 }
```

If the data section is readonly, a special subsegment in the text section is used.<sup>30</sup> See §1.12.22, page 66 for subseg\_set.

## 1.12.6 Other data directives

These directives allow you to control the size of integer data being emitted at the current position.

| Directive | Description                |
|-----------|----------------------------|
| .half     | Emit an integer of 16 bits |

<sup>&</sup>lt;sup>30</sup>It could be possible to set the flags of the data section to read-only, but GAS prefers this methods for portability reasons... Not all systems probably support that.

| .word                                     | Emit an integer of 32 bits                               |
|-------------------------------------------|----------------------------------------------------------|
| .dword                                    | Emit integer of 64 bits                                  |
| .dtprelword                               | Emit a word or double word thread relative symbol for    |
| $.\mathrm{dt}\mathrm{preld}\mathrm{word}$ | DWARF debug information in thread local variables.       |
| .uleb128                                  | Emit an unsigned or signed leb128 integer at the current |
| $.{ m sleb}128$                           | position. This must be a number. No symbols allowed.     |
|                                           | These two will be fully explained in §1.12.23 page 68.   |

## 1.12.7 debug, extern, format, Iflags, name, noformat, spc, xref

All those directives have only *one* thing in common: they are completely **ignored** by the GNU assembler. It just advances the line pointer to the end of the line.

Why this?

As you guessed, it is just a compatibility feature.

```
{ "debug",s_ignore,0},
{ "extern",s_ignore,0},/* We treat all undef as ext. */
{ "format",s_ignore,0},
{ "lflags",s_ignore,0},/* Listing flags. */
{ "name",s_ignore,0},
{ "noformat",s_ignore,0},
{ "spc",s_ignore,0},
{ "xref",s_ignore,0},
```

As the comment shows, declaring a symbol *extern* doesn't do anything. The assembler declares all undefined symbols **extern**. This implies that if a misspelled name appears in your assembler program you will see it at link time, not at assembly time. No big deal anyway.

More problematic is ignoring directives like **xref** or **debug**. These directives are expected to *do* something, and silently accepting and ignoring them will provoke in people that expect some result from their directives to search in vain **why** the assembler is not doing what they have written.

This is worst than a clear error message: "unknown directive". Much worst. That is why those directives aren't accepted any more in tiny-asm, except the extern one, because that one *does* what the user is expecting.

## 1.12.8 equ, equiv, eqv, set

```
.equ symbol, expression
```

This directive sets the value of symbol to expression. It is synonymous with '.set';

This is something similar to

# $\texttt{\#define} \llcorner \texttt{name} \llcorner \texttt{another\_name}$

in C. There are some subtleties though. The equiv directive will complain if the first symbol is already defined. The eqv directive announces to the assembler that the right hand side is a forward reference.

```
1 {"equ",s_set,0},
2 {"equiv",s_set,1},
3 {"eqv",s_set,-1},
4 {"set",s_set,0},
```

The s\_set function is simple to follow.

## 1.12.9 globl

```
.global symbol[, symbol, symbol, ...]
```

.global makes the symbol visible to ld. If you define symbol in your partial program, its value is made available to other partial programs that are linked with it. Otherwise, symbol takes its attributes from a symbol of the same name from another file linked into the same program.

In the potable we have:

```
Table: (potable)
{"global",s_glob1,0},
{"glob1",s_glob1,0},
```

Unix has a big problem with vowels. They are shunned everywhere. Why write glob1? Is the absence of a poor vowel really that shorter? Or is the necessary effort of remembering its absence when writing the program (taking precious memory space in the brain) even costlier?

Well, at least the assembler lets you decide, you can use both.

Coming back to our source code, the s\_glob1 function is a very simple and short one. It just scans names and adds the EXTERNAL bit to each of the symbols scanned in a loop (not shown).

```
if ((name = read_symbol_name()) == NULL)
return;
symbolP = symbol_find_or_make(name);
S_SET_EXTERNAL(symbolP);
```

## 1.12.10 attach to group

## Syntax:

```
.attach_to_group <name>
Table: (elf_pseudo_table)
{"attach_to_group",obj_elf_attach_to_group,0},
```

This will attach the current section to the named group. If the group doesn't exist it will be created. The obj\_attach\_to\_group function just changes a pointer and the flags of the current section. The relevant lines (without error checking etc) of this function are:

```
elf_group_name(now_seg) = gname;
elf_section_flags(now_seg) |= SHF_GROUP;
```

## 1.12.11 .comm, .common, .lcomm

Only the directive .comm and .lcomm are documented in the official documentation.

## Syntax:

```
.comm symbol , length
Table: (elf_pseudo_table)
{"comm",obj_elf_common,0},
{"common",obj_elf_common,1},
{"lcomm",obj_elf_lcomm,0},
```

.comm declares a common symbol named symbol. When linking, a common symbol in one object file may be merged with a defined or common symbol of the same name in another object file. If ld does not see a definition for the symbol—just one or more

common symbols—then it will allocate length bytes of uninitialized memory. length must be an absolute expression. If ld sees multiple common symbols with the same name, and they do not all have the same size, it will allocate space using the largest size.

.lcomm (local common) has the same syntax as comm but the symbol is just declared in the bss section and not make visible.

.common is a synonym for comm even if it receives a different argument because actually... the argument is ignored!

The function **s\_comm\_internal** is mostly parsing and error checking. The essential lines are at the end:

```
S_SET_VALUE(symbolP,(valueT) size);
S_SET_EXTERNAL(symbolP); // This is absent in lcomm
S_SET_SEGMENT(symbolP,bfd_com_section_ptr);
```

## 1.12.12 hidden

#### Syntax:

```
.hidden symbol-name [, symbol-name, ...]
```

Sets the visibility of a symbol, i.e. if it is visible for modules outside the one being assembled. This directive implies *protected* as well.

It is handled by the obj\_elf\_visibility function.

## 1.12.13 ident

```
Syntax:
    .ident "A string"
Table: elf_pseudo_table
{"ident",obj_elf_ident,0},
```

This directive writes any string into the comments section of the file. For instance:

```
.ident "I love you Barbie"
```

Assembling your file, you can display it to your girlfriend with:

```
star64: "/tiny-asm$ asm sample.s
star64: "/tiny-asm$ objdump -s -j .comment a.out
a.out: file format elf64-littleriscv

Contents of section .comment:
0000 0049206c 6f766520 796f7520 42617262 .I love you Barb
0010 696500 ie.
```

She will be surely greatly impressed... The obj\_elf\_ident function creates the .comments section if it is not already present. Then, it calls the stringer for parsing. You can write any number of these comments.

## 1.12.14 insn

```
Syntax:
.insn type, operand [,...,operand_n]
.insn insn_length, value
.insn value
Table: riscv_pseudo_table
{"insn",s_riscv_insn,0},
```

This directive assembles an unknown instruction into the instruction stream. For instance, using the first type of syntax, let's say you want to want to issue the instruction add a0,a1,a2. First, you have to look up what type of instruction it is. It is an "R" type of instruction. You write as first argument "r".

After the type, you should give the fields of the R format that are fixed: the opcode, the extra-3 and the extra-7 fields. In this case both are zero. And then, you should give the arguments of the instruction, i.e. the register names.

You should write then:

```
insn r 0x33, 0, 0, a0,a1,a2
```

Note that there isn't any comma between the "r" and the 0x33! The "r" is understood as a part of the opcode.

Now where does this 0x33 come from?

If you go to the opcode table, and search for the "add" entries, you will see several of them. You should choose this one:

```
{"add",0,INSN_CLASS_I,"d,s,t",MATCH_ADD,MASK_ADD,match_opcode,0},
```

since the other ones further up are compressed (INSN\_CLASS\_C) and we do not want compression. The opcode is in the MATCH\_ADD field, that is defined in asm.h to be... 0x33. After the two zeroes of the bit fields associated with class "R" we write the 3 required register names.

How can we know that this is OK?

Easy: just write following assembler program:

```
add a0,a1,a2
2 .insn r 0x33, 0, 0, a0,a1,a2
```

Then assemble it, and then display the contents with

```
star64:~/tiny-asm$ objdump -d sample.o

sample.o: file format elf64-littleriscv

Disassembly of section .text:

000000000000000004 <main>:
4: 00c58533 add a0,a1,a2
8: 00c58533 add a0,a1,a2
```

We find the 0x33 in the lower 7 bits of the opcode field.

The other syntax variants of the directive are trivial.

Another example: the instruction addw a0,a1,a2. The entry in the opcode table is: \[ \[ \"addw",64,INSN\_CLASS\_I,"d,s,t",MATCH\_ADDW,MASK\_ADDW,match\_opcode,0 \],

We look the constant MATCH\_ADDW in asm.h, what gives 0x3b. So, as shown in 1.4 page 30, the two fields "extra-3" and "extra-7" are zero. We write then:

```
addw a0,a1,a2
insn r 0x3b, 0, 0, a0,a1,a2
and when disassembling we get:

4: 00c58533 add a0,a1,a2
8: 00c58533 add a0,a1,a2
c: 00c5853b addw a0,a1,a2
4 10: 00c5853b addw a0,a1,a2
```

The s\_riscv\_insn function essentially just calls riscv\_ip. The lookup of the "r" letter yields an entry into the riscv\_insn\_types table, that looks like this:

```
["r",0,INSN_CLASS_I,"04,F3,F7,d,s,t",0,0,match_opcode,0],
```

where we see the length of the instruction (4 bytes) and the names of the 3 and 7 bits extra fields. Then, we find the usual denominations ("d,s,t") that we discussed when analyzing the string arguments to each opcode, see table 1.8 page 43.

**Conclusion** This is quite difficult stuff, because precisely the point of an assembler is to avoid you to encode manually the instructions. It is a *very* error prone process. And in the end if you write:

```
.word 0xc58533
```

it will work in the same way. The justification advanced by the GNU folks is that in future versions of the assembler you will *not* see this as just data, but as a real instruction.

Maybe. But I think a more real justification is that the riscv architecture itself allows for instruction extensions, and has a whole part of the instruction space available for standard or non-standard extensions to the accepted opcodes. The existence of an **insn** extension here, would allow the assembler to assemble code that uses those extensions.

## 1.12.15 internal

```
Syntax:
```

```
.internal symbol-name [, symbol-name, ...]
```

Sets the visibility of a symbol, i.e. if it is visible for modules outside the one being assembled. This directive implies *protected* as well.

It is handled by the obj\_elf\_visibility function. function.

#### 1.12.16 loc

```
Syntax:
```

Ahhh the old days, when everything was simple and clear! Remember when the debug information for the line number was just a triplet of address, file, line?

Say goodbye to that now, and welcome to DWARF<sup>31</sup>. The line number is a series of instructions to an interpreted language executed by a state machine.

Yes, you read correctly.

Conceptually we have a table of addresses, each one with as many properties as desired:

<sup>&</sup>lt;sup>31</sup>Critiques to DWARF abound. See for instance: https://tobast.fr/doc/publications/oopsla19-dwarf.pdf

| address | source | source | source                  | state- | basic | other           |
|---------|--------|--------|-------------------------|--------|-------|-----------------|
|         | file   | line   | $\operatorname{column}$ | ment?  | block | $_{ m columns}$ |
| 0x40260 | 1      | 23     | 12                      | 0      | 0     |                 |
| 0x40264 | 1      | 23     | 12                      | 1      | 1     |                 |

.

"... we design a byte-coded language for a state machine and store a stream of bytes in the object file instead of the matrix. This language can be much more compact than the matrix. When a consumer of the line number information executes, it must "run" the state machine to generate the matrix for each compilation unit it is interested in." <sup>32</sup>

The arguments for the .loc directive then, are as follows:

- fileno. The file index in the assembler's file table.
- lineno. Line number.
- column. This field is optional.
- options. They are the following:
  - basic\_block This instruction represents the start of a basic block.<sup>33</sup>
  - prologue\_end. End of the setup of the stack frame. This changes the state of the interpreter. In C it corresponds to the opening brace of a function.
  - is\_stmt⊔value Start of a statement sequence.
  - isa⊔value Sets the instruction set architecture register to value
  - An unsigned integer identifying the block to which the current instruction belongs. Discriminator values are assigned arbitrarily by the DWARF producer and serve to distinguish among multiple blocks that may all be associated with the same source file, line, and column. Where only one block exists for a given source position, the discriminator value should be zero. This is necessary because the compiler can move instructions around to keep the pipeline busy. Then, instructions belonging a one or several blocks could be mixed.
  - view. This is not in the 4th edition of the DWARF standard nor in the 5th. It has been added later probably. The documentation says:

This option causes a row to be added to .debug\_line in reference to the current address (which might not be the same as that of the following assembly instruction), and to associate value with the view register in the .debug\_line state machine. If value is a label, both the view register and the label are set to the number of prior .loc directives at the same program location. If value is the literal 0, the view register is set to zero, and the assembler asserts that there aren't any prior .loc directives at the same program location. If value is the literal -0, the assembler arrange for the view register to be reset in this row, even if there are prior .loc directives at the same program location.

Crystal clear isn't? 34

<sup>&</sup>lt;sup>32</sup>DWARF Debugging Information Format Version 4, page 108

<sup>&</sup>lt;sup>33</sup>A basic block is a sequence of instructions where only the first instruction may be a branch target and only the last instruction may transfer control. A procedure invocation is defined to be an exit from a basic block.

<sup>&</sup>lt;sup>34</sup>There is no other documentation anywhere that would state what this thing does in a more understandable way... Sorry.

The function dwarf2\_directive\_loc is interesting as an example of the functions used to parse data within the assembler. To make things a bit clearer I have added comments to everything.

Listing 1.11: Parsing .loc directive

```
1 static void
                dwarf2_directive_loc(int dummy ATTRIBUTE_UNUSED)
2 {
3
       /* If we see two .loc directives in a row, force the first one to be output now.*/
 4
      if (dwarf2_loc_directive_seen) dwarf2_emit_insn(0);
      offsetT filenum = get_absolute_expression();
5
                                                                        get absolute expression
      SKIP_WHITESPACE();
6
      offsetT line = get_absolute_expression();
      /* error checking: */
      if (filenum < 1) {
10
          /* DWARF5 specifies that a file number of zero indicates that
11
             the file is unknown */
12
          if (filenum == 0 && dwarf_level < 5) dwarf_level = 5;</pre>
          /* All other values are just nonsense */
13
          if (filenum < 0 || DWARF2_LINE_VERSION < 5) {
14
              as_bad("file number less than one");
15
              return;
16
          }
17
      }
18
      if ((valueT) filenum > files_in_use || files[filenum].filename == NULL) {
19
          as_bad("unassigned file number %ld",(long)filenum);
20
21
22
      }
                                                                        debug_type
      gas_assert(debug_type == DEBUG_NONE);
23
      current.filenum = filenum:
24
      current.line = line;
25
                                                                        current
      current.discriminator = 0;
26
      SKIP_WHITESPACE();
27
       /* test for an optional column number */
      if (ISDIGIT(*input_line_pointer)) {
           /* We have the optional column number */
30
          current.column = get_absolute_expression(); SKIP_WHITESPACE();
31
      }
32
      /* Now we start parsing the "options" field */
33
      while (ISALPHA(*input_line_pointer)) {
34
                                                                        get symbol name
          char
                         *p,c = get_symbol_name(&p);
                                                             4
35
          offsetT
                         value;
36
          if (strcmp(p, "basic_block") == 0) {
37
              current.flags |= DWARF2_FLAG_BASIC_BLOCK;
38
              *input_line_pointer = c; // Restore character
39
          } else if (strcmp(p, "prologue_end") == 0) {
40
              if (dwarf_level < 3) dwarf_level = 3;</pre>
41
              current.flags |= DWARF2_FLAG_PROLOGUE_END;
42
              *input_line_pointer = c;
43
          } else if (strcmp(p, "epilogue_begin") == 0) {
44
              if (dwarf_level < 3) dwarf_level = 3;</pre>
              current.flags |= DWARF2_FLAG_EPILOGUE_BEGIN;
46
47
              *input_line_pointer = c;
          } else if (strcmp(p,"is\_stmt") == 0) { // is\_stmt < boolean value >}
48
```

```
(void)restore_line_pointer(c);
49
              value = get_absolute_expression();
               if (value == 0) current.flags &= ~DWARF2_FLAG_IS_STMT;
               else if (value == 1) current.flags |= DWARF2_FLAG_IS_STMT;
               else { as_bad("is_stmt value not 0 or 1"); return; }
53
           } else if (strcmp(p,"isa") == 0)  { // "isa" numbers are defined by the ABI
54
               if (dwarf_level < 3) dwarf_level = 3;</pre>
55
               (void)restore_line_pointer(c);
56
                                                                        restore line pointer
               value = get_absolute_expression();
57
58
               if (value > 0) current.isa = value;
59
60
                  as_bad("isa number less than zero");
61
                  return:
              }
62
           } else if (strcmp(p, "discriminator") == 0) {
63
64
               (void)restore_line_pointer(c);
               value = get_absolute_expression();
65
              if (value \geq 0) current.discriminator = value;
66
               else {
67
                  as_bad(("discriminator less than zero"));
68
                  return;
69
70
              }
           } else if (strcmp(p,"view") == 0) {
           /* Now we parse the mysterious "view" statement. */
               symbolS
                             *sym;
74
               (void)restore_line_pointer(c);
              SKIP_WHITESPACE();
75
               if (ISDIGIT(*input_line_pointer) || *input_line_pointer == '-') {
76
77
                   * Now, we expect either "0" or "-0"
78
                   */
79
                  bool
                              force_reset = *input_line_pointer == '-';
80
                  value = get_absolute_expression();
81
                  if (value \neq 0) {
                      as_bad("numeric view can only be asserted to zero"); return;
83
                  }
84
85
                  if (force_reset && force_reset_view) sym = force_reset_view;
86
                  else {
87
                      sym = symbol_temp_new(absolute_section,&zero_address_frag,value);
                      if (force_reset) force_reset_view = sym;
88
89
              } else { // We have a symbol that will be put into the "view" register.
90
91
                                 *name = read_symbol_name();
                  // We silently accept .loc view followed by nothing, without
                   // any warning or error.
93
                  if (!name) return;
94
95
                  sym = symbol_find_or_make(name);
                  free(name); // read_symbol_name allocates memory for its result
96
                  if (S_IS_DEFINED(sym) || symbol_equated_p(sym)) {
97
                      if (S_IS_VOLATILE(sym)) sym = symbol_clone(sym,1);
98
                      else if (!S_CAN_BE_REDEFINED(sym)) {
99
                          as_bad("symbol '%s' is already defined",S_GET_NAME(sym));
100
                              return; }
101
                  S_SET_SEGMENT(sym,undefined_section); S_SET_VALUE(sym,0);
102
                  symbol_set_frag(sym,&zero_address_frag);
103
104
105
              current.u.view = sym;
```

```
} else {
106
               as_bad("unknown .loc sub-directive '%s'",p);
107
               (void)restore_line_pointer(c); return;
108
109
           /* This macro differs from SKIP_WHITESPACE in that it ignores a double quotes
110
            * after the name */
111
           SKIP_WHITESPACE_AFTER_NAME();
112
113
                                                                  demand empty rest of line
       demand_empty_rest_of_line();
       dwarf2_any_loc_directive_seen = dwarf2_loc_directive_seen = true;
115
       /* If we were given a view id, emit row now */
116
       if (current.u.view) dwarf2_emit_insn(0);
117
118 }
```

- 1. The function get\_absolute\_expression reads a constant from the global line pointer and sets it to just after the last character of the constant. If any error occurs, it emits an error message and returns zero. If the expression is absent, it returns zero without any error message. This makes many things default to a convenient zero.
- 2. The value in the global variable debug\_type will be turned off by the function dwarf2\_directive\_filename, and if we don't have a dwarf style .file directive in between, then files\_in\_use will be zero and the error in line 15 will trigger.

  Note: The global debug\_type will be left to zero, effectively disabling the emission of any debug information by the assembler.
- 3. current is a structure of type dwarf2\_line\_info that holds the current context. We update it AFTER all error checking is done, to preserve a correct context in case of an error
- 4. The get\_symbol\_name function parses a symbol using input\_line\_pointer. It writes a zero immediately after the expected symbol and returns the value of the character at the position where zero was written. Its result is left in its pointer argument, that will point to the start of the symbol.
- 5. restore\_line\_pointer writes the previous character into the line pointer, advances to the next character and if it is a double quote, it ignores it by advancing again.
- 6. demand\_empty\_rest\_of\_line advances the line pointer to the next newline character. If there is anything in that part of the line it will complain with an error.

## 1.12.17 local

#### Syntax:

```
.local symbol,symbol,...
Table: elf_pseudo_table
{"local",obj_elf_local,0},
```

This directive makes the given symbol a local symbol, not visible to other modules. Since all symbols are local unless declared extern or undefined, the utility of this is not clear.

The important lines of obj\_elf\_local are:

```
symbolP = get_sym_from_input_line_and_check();
S_CLEAR_EXTERNAL(symbolP);
symbol_get_obj(symbolP) \rightarrow local = 1; // See below
```

The function symbol\_get\_obj returns a pointer to a small structure that keeps several disjoint pieces of information about a symbol, among them, whether it is a local symbol. We can't access directly the field because of local symbols precisely.

```
1 /* Get a pointer to the object format information for a symbol. */
2 static struct elf_obj_sy *symbol_get_obj(symbolS * s)
3 {
4     if (s \rightarrow flags.local_symbol)
5         s = local_symbol_convert(s);
6     return &s \rightarrow x \rightarrow obj;
7 }
```

If the symbol is local, we have to convert it first. See §1.5.2, page 18 to see where this piece fits in the general schema of things.

# 1.12.18 option

# Syntax: .option <option-name> Table: riscv\_pseudo\_table {"option",s\_riscv\_option,0},

This handles the update of several riscv related options. The example given in the GAS documentation runs as follows:

```
.option push
.option norelax
la gp, __global_pointer$
.option pop
```

In the "relaxation" process, the assembler tries to find shorter, compressed, sequences for instructions. It tries to substitute loading a global directly, for a shorter sequence that loads the address from an offset from the <code>\_\_global\_pointers</code> table. The problem arises when you want to load the address of the <code>\_\_global\_pointers</code> table itself. In that case you do NOT want the assembler to pick an offset since the <code>\_\_global\_pointers</code> table is not loaded. Then, you disable for a single instruction, this feature and all goes well.

Of course this happens only to people that are writing the startup code, or other assembler wizards. This kind of fiddling is *for them only*. Please do not mess around with any of this things yourself.

The code for s\_riscv\_options is trivial: a long series of:

```
if (strcmp(name,"push") == 0) { /* code for push option */}
else if (strcmp(name,"pop") == 0 {/* code for pop option */})
etc...
```

Other interesting values for .option are:

- pic or nopic. Enable or disable the position independent code generation. This corresponds to the -fPic flag in gcc.
- rvs or norvc. Enable or disable the compressed instructions generation.
- relax or norelax. Enable or disable relaxation.
- csr-check or nocsr-check. Enables or disables checking when using the CSR registers.
- Etc. There are many other obscure things to peruse here: binutils-docs

# 1.12.19 org

```
Syntax:
.org new-location-counter , fill byte
```

Advance the location counter of the current section to *new-location-counter*. It should be either an absolute expression or an expression with the same section as the current subsection. That is, you can't use .org to cross sections: if it has the wrong section, the .org directive is ignored. To be compatible with former assemblers, if the section of new-lc is absolute, as issues a warning, then pretends the section of new-lc is the same as the current subsection.

# 1.12.20 protected

# Syntax:

```
.protected symbol-name [, symbol-name, ...]
```

Sets the visibility of a symbol, i.e. if it is defined for modules outside the one being assembled, the definition in this module will be used.

It is handled by the obj\_elf\_visibility function. function.

#### 1.12.21 reloc

The documentation of GAS says about this directive:

#### Syntax:

```
.reloc offset, reloc_name[, expression]
```

Generate a relocation at offset of type reloc\_name with value expression. If offset is a number, the relocation is generated in the current section. If offset is an expression that resolves to a symbol plus offset, the relocation is generated in the given symbol's section. expression, if present, must resolve to a symbol plus addend or to an absolute value, but note that not all targets support an addend. e.g. ELF REL targets such as i386 store an addend in the section contents rather than in the relocation. This low level interface does not support addends stored in the section.

The last part of the description needs maybe a clarification. In the x86 systems, the addend to the relocation is stored in the data itself, so the program loader should only add the load address. This makes constructing relocations with an addend impossible.

Why is this directive necessary? Mystery, the official documentation gives no examples, and (with my limited imagination) I just can't figure out its use.<sup>35</sup>

Well, the only way of figuring out this, is to use it and see what it does. I write this in C:

```
long double mm = 3.1415926534564321;
1
     int main(void) {}
     I compile it with: gcc -c -S tld.c and obtain a tld.s assembler file:
      .file
             "tld.c"
1
2
      .size
             mm, 16
3
     mm:
            0
4
      .word
             -1610612736
      .word
5
             -1253836416
      .word
6
             1073779231
      .word
      .text
      .globl main
```

 $<sup>^{35}</sup>$ In the documentation of the ARM assembler I found a similar RELOC directive that (seems) to force the assembler to put either a symbol or the preceding instruction at a specific address, like the .org directive, but I am not sure

```
.type main, @function
main:
jr ra
```

We have then, a long double in the data section. I start gdb:

```
(gdb) print &mm
$1 = (<data variable, no debug info> *) 0x2aaaaac010 <mm>
```

OK, now I add the line: .reloc 8,BFD\_RELOC\_32,mm after the last .word in the definition of mm. I start gdb with the new program and...

```
(gdb) print &mm
$1 = (<data variable, no debug info> *) 0x2aaaaac010 <mm>
```

The address is the same, the contents of the long double constant are the same, nothing changed. Weird.

Next thing: Change the text segment? I add the same reloc directive just before the jr ra at the end of main. Now I obtain:

```
(gdb) b main
Breakpoint 1 at 0x66c
(gdb) run
Starting program: /home/jacob/tiny-asm/tld-reloc
/home/jacob/tiny-asm/tld-reloc: error while loading shared libraries:\
unexpected reloc type 0x01
[Inferior 1 (process 1474) exited with code 0177]
```

Great! Now something seems to have changed. I can't run the program. The relocation is probably disturbing something in the program loader.

## Conclusion

- 1) Do not mess around with this unless you know exactly what you are doing...
- 2) If you know what you are doing... please let me know.



#### 1.12.22 text

Tells the assembler to change (if necessary) to the text section. Data and instructions will go at the end of that section.<sup>36</sup>

The change is handled by the s\_text function:

```
Listing 1.12: s_text function
```

```
1 static void s_text(int ignore ATTRIBUTE_UNUSED)
2 {
3    int temp = get_absolute_expression();
4    subseg_set(text_section,(subsegT) temp);
5    demand_empty_rest_of_line();
6 }
```

The function subseg\_set is used in several other functions to change the current section/segment.

<sup>&</sup>lt;sup>36</sup>Remember that "text" in this context has *nothing* to do with a text format, in the usual sense of the word. There is no text sequences here, unless you put a text sequence yourself.

Listing 1.13: Code of subseg set

```
1 static void subseg_set(segT secptr,subsegT subseg)
2 {
3
       if (!(secptr == now_seg && subseg == now_subseg))
            subseg_set_rest(secptr,subseg);
4
5 }
6 static void subseg_set_rest(segT seg, subsegT subseg)
7 {
                        *frcP; /* crawl frchain chain */
       frchainS
8
                       **lastPP; /* address of last pointer */
       frchainS
                        *newP; /* address of new frchain */
10
       frchainS
11
       segment_info_type *seginfo;
12
13
       if (frag_now && frchain_now)
            frchain_now -> frch_frag_now = frag_now;
                                                                                               (1)
14
15
       subseg_change(seg,(int)subseg);
                                                                                               (2)
       seginfo = seg_info(seg);
                                                                                               (3)
16
       /* Should the section symbol be kept? Yes. */
17
       seg >> symbol >> flags |= BSF_SECTION_SYM_USED;
                                                                                               (4)
18
19
       /* Attempt to find or make a frchain for that subsection. We keep the
        * list sorted by subsection number. */
20
21
       for (frcP = *(lastPP = &seginfo→frchainP); frcP ≠ NULL;
22
                frcP = *(lastPP = &frcP-frch_next))
                if (frcP \rightarrow frch\_subseg \ge subseg)
23
                    break;
24
       if (frcP == NULL \mid \mid frcP\rightarrowfrch_subseg \neq subseg) {
25
       /* Not found. Make a new. This should be the only code that creates a frchainS.*/
26
           newP = (frchainS *) obstack_alloc(&frchains, sizeof(frchainS));
27
28
           newP -> frch_subseg = subseg;
29
           newP -> fix_root = NULL;
           newP \rightarrow fix_tail = NULL;
30
31
            obstack_begin(&newP \rightarrow frch_obstack, CHUNKSIZE);
            obstack_alignment_mask(&newP->frch_obstack) = __alignof__(fragS) - 1; (5)
           \texttt{newP} {\rightarrow} \texttt{frch\_frag\_now} = \texttt{frag\_alloc(\&newP} {\rightarrow} \texttt{frch\_obstack)};
33
           newP \rightarrow frch_frag_now \rightarrow fr_type = rs_fill;
34
           newP \rightarrow frch_cfi_data = NULL;
35
           \texttt{newP} {\rightarrow} \texttt{frch\_root} = \texttt{newP} {\rightarrow} \texttt{frch\_last} = \texttt{newP} {\rightarrow} \texttt{frch\_frag\_now};
36
            *lastPP = newP; // Insert in chain
37
           newP -> frch_next = frcP;
38
            frcP = newP;
39
       }
40
41
       frchain_now = frcP;
       frag_now = frcP \rightarrow frch_frag_now;
42
43 }
```

- 1. Make sure that frchain\_now has a correct pointer in frch\_frag\_now.
- 2. subseg\_change is a small function that sets the global variables now\_seg and now\_subseg to the values given, and, if necessary, allocates the seg\_info structure.
- 3. seg\_info is just a macro that accesses the structure in the userdata of the bfd.
- 4. The original code used a function call and was just too complicated for setting a flag. It was: if (bfd\_keep\_unused\_section\_symbols(stdoutput)) that returned always true...
- 5. The gnu C construct \_\_alignof\_\_ has an equivalent in the C standard of 2011: \_Alignof. In the code above it should be \_Alignof(fragS).

## 1.12.23 uleb128, sleb128

```
Syntax:
.uleb128 value
.sleb128 value
Table: riscv_pseudo_table
{"uleb128",s_riscv_leb128,0},
{"sleb128",s_riscv_leb128,1},
```

These instructions encode a number using a special format. There is also a general directive for all machines that has the same syntax.

To encode an unsigned number:

- 1. Split the number in 7 bit chunks
- 2. Read the 7 bits of the lowest significant bits into a byte.
- 3. Set the most significant bit of the byte to 1 if more bytes follow, to zero otherwise.
- 4. Output 1 byte and shift the value right by 7 bits.

```
Listing 1.14: output uleb128
```

```
1 static unsigned int output_uleb128(char *p,valueT value)
2 {
      char
                     *orig = p;
3
      unsigned byte;
4
          byte = (value & 0x7f);
          value \geq 7;
          if (value \neq 0)
9
          /* More bytes to follow. */
10
             byte |= 0x80;
11
          *p++ = byte; // If value was zero, byte is zero
12
      } while (value \neq 0);
13
      return p - orig;
14
```

A signed number has a different encoding. Example: Encode -98765432

- 1. Ignore the minus sign. Binary representation is 0101 1110 0011 0000 1010 0111 1000, a 27 bit number padded to 28 with zero.
- 2. Negate all bits, what gives: 1010 0001 1100 1111 0101 1000 0111
- 3. Add 1, what gives: 1010 0001 1100 1111 0101 1000 1000
- 4. Split into 7 bit groups: 1010000 1110011 1101011 0001000
- 5. Add high 1 bit in all but the most significant one 01010000 11110011 11101011 10001000  $\rightarrow$  0x50F3EB88

The code for this is written in a quite complicated way, maybe because the code doesn't do step 1 above or because some machine under some OS is behaving badly...

Listing 1.15: output sleb128

```
static inline unsigned int output_sleb128(char *p,offsetT value)
2 {
                    *orig = p;
3
      char
4
      int
              more;
5
6
      do { unsigned byte = (value & 0x7f);
      /* Sadly, we cannot rely on typical arithmetic right shift behaviour. Fortunately,
       * we can structure things so that the extra work reduces to a noop on systems
8
       * that do things "properly". */
9
          value = (value >> 7)|~(-(offsetT) 1 >> 7);
10
          more = !((((value == 0) && ((byte & 0x40) == 0))
11
          || ((value == -1) && ((byte & 0x40) \neq 0))));
12
          if (more) byte |= 0x80;
13
          *p++ = byte;
14
      } while (more);
15
16
      return p - orig;
17 }
```

# 1.12.24 Other directives

In general, the code for handling directives is simple and easy to follow. There is no need to detail all of that here.

## 1.13 The cfi directives

CFI stands for Call Frame Information. The objective of these directives is to furnish to a debugger enough information so that at any address within the program, the layout of the stack is clear.

The C++ language uses also this kind of information for another purpose: to rewind the stack, looking for a procedure that will *catch* an exception that has been thrown somewhere in the program. To be able to reconstruct the stack at any moment, big tables are generated, that give the stack unwinding machinery all the information needed to rewind the stack.

Before we get into the details, we need to explain some concepts. We begin with the concept of the *stack frame*, i.e. the portion of the stack used by the currently running function. When a function call is executed, both the riscv CPU and the ARM cpu copy the address of the next instruction into a special register. At the end of the called function, the last instruction that is executed is a jump to the address stored into that register.

Other machines like the x86 family, do not have a link register and the machine pushes the return address into the stack, decreasing the stack by the address size and writing into the new space the return address. Under the riscv/ARM RISC machines we have a link register that allows to avoid (sometimes) to store the return address in memory.

The stack address at the moment of the call is called Canonical Frame Address or CFA.

The first thing the called procedure does is to save the permanent registers that it will use. All machines have in their ABI a list of registers that are preserved across calls (the permanent registers) and other scratch registers that are used freely, without any obligation to preserve their contents. A procedure then, needs to store the current values of those registers in the stack to be able to restore them at the end to their previous values.

To be able to reconstruct the data that is active at procedures higher in the stack, the debugger or the stack unwinding machinery must restore the values of the saved registers, so the addresses and register numbers must be stored in the tables for each procedure. Starting with the current instruction pointer, the debugger restores the values of the previous CFA,

virtually returning from a procedure, what allows it to show the values of all the variables of that procedure, and so on.

The debug information is independent of the type of machine being used, what complicates further things.

- Compilers can duplicate the epilogue to avoid executing a jump instruction to a common one.
- Sometimes a procedure uses a frame pointer register, sometimes they use directly the stack pointer.
- Within the prologue or epilogue, the stack can change. Some compilers will use a push instruction for each register saved, some others will subtract from the stack a fixed amount, and save the registers at fixed offsets from the stack or frame pointer.
- Sometimes a preserved register will be saved in a scratch register, and restored later without using a stack frame...
- Some machines use a bit-mask for saving the registers in a single instruction.
- Etc. There are many other special conditions, weird designs that needed not to be mentioned here.

## 1.13.1 Concepts

The .eh\_frame section contains two things: a CIE (Common Information Entry) and several FDEs or Frame Description Entry) records.



The CIE

Table 1.13: Common Information Entry fields

| Field                   | Description                                                              |  |  |
|-------------------------|--------------------------------------------------------------------------|--|--|
| Length                  | A 4 byte unsigned value indicating the length in bytes of the CIE        |  |  |
|                         | structure, not including the Length field itself. If Length contains the |  |  |
|                         | value 0xffffffff, then the length is contained in the Extended Length    |  |  |
|                         | field. If Length contains the value 0, then this CIE shall be considered |  |  |
|                         | a terminator and processing shall end.                                   |  |  |
| Extended                | Optional, see above. In practice, this is never used.                    |  |  |
| $\operatorname{Length}$ |                                                                          |  |  |
| CIE-ID                  | A 4 byte value that is used to distinguish between CIEs and FDEs.        |  |  |
|                         | In CIEs it will be always zero.                                          |  |  |
| Version                 | This is a single byte and should be 1.                                   |  |  |
| Augmentation            | This is a series of byte codes that are interpreted (sounds familiar?)   |  |  |
|                         | See below.                                                               |  |  |
| Code alignment          | An unsigned leb128 encoded value that represents the units used in       |  |  |
|                         | the "advance location" instructions in this CIE and its associated       |  |  |
|                         | FDEs.                                                                    |  |  |

This field is only mentioned in the MaskRay blog. All other official address return documents do not mention it<sup>37</sup> register Similar to the code alignment factor above Data alignment factor Unsigned leb128 encoded value. This field is only present if the aug-Augmentation mentation string contains the 'z' character. length Augmentation A block of data, that is interpreted according to the augmentation data The augmentation string characters Indicates there is some data there. Must be the first character. The FDEs contain pointers to language specific data. This is a single byte that indicates how those pointers are encoded. 'nP; This indicates the presence of two items: 1) A single byte that specifies how the second item, a pointer, is encoded. 2) The second item is encoded according to the type of encoding described by the first, and it represents a pointer to a personality routine, i.e. some routine that will be used to unwind the stack according to the language preferences. An associated FDE describes a signal frame, i.e. an interrupt procedure  $^{38}$ .

Table 1.13: Common Information Entry fields

## The FDE

FDE stands for Frame Description Entry.

Table 1.14: FDE fields

| Field                                                                 | Description                                                          |
|-----------------------------------------------------------------------|----------------------------------------------------------------------|
| Length                                                                | In 4 bytes                                                           |
| Extended                                                              | Same specs as in the CIEs above                                      |
| $\operatorname{Length}$                                               |                                                                      |
| CIE pointer                                                           | A 4 byte unsigned value that when subtracted from the offset of the  |
|                                                                       | the CIE Pointer in the current FDE yields the offset of the start of |
|                                                                       | the associated CIE.                                                  |
| Program This is a pointer encoded according to the method specified b |                                                                      |
| Counter be- 'R' character in the CIE <sup>39</sup>                    |                                                                      |
| gin                                                                   |                                                                      |
| PC range                                                              | An absolute value that tells how long the code section is.           |
| Augmentation Unsigned leb128 encoded value that contains the length   |                                                                      |
| $\operatorname{length}$                                               | lowing data                                                          |

<sup>&</sup>lt;sup>37</sup>The riscv specification mentions explicitly that other registers could contain the return address.

There is no dedicated stack pointer or subroutine return address link register in the Base Inte-

There is no dedicated stack pointer or subroutine return address link register in the Base IntegerISA; the instruction encoding allows any x register to be used for these purposes. However, the standard software calling convention uses register x1 to hold the return address for a call, with register x5 available as an alternate link register. The standard calling convention uses register x2 as the stack pointer.

RISC-V Unprivileged ISA V20191214-draft, page 14 For RISC-V machines then, this field *could* be useful. In any case, the software representation has a field "return column".

<sup>&</sup>lt;sup>38</sup>This letter is not mentioned in the Linux Standard Base specifications release 5, but it is mentioned in the MaskRay blog.

<sup>39...</sup> as far as I have understood this mess.

Table 1.14: FDE fields

| Augmentation                | Contains pointers encoded according to the prescriptions of the CIE |
|-----------------------------|---------------------------------------------------------------------|
| data                        |                                                                     |
| Call frame in-              | A set of call frame instructions.                                   |
| $\operatorname{structions}$ |                                                                     |

## Software representation

A CIE will be described by the following structure in asm.h:

```
1 struct cie_entry {
2    struct cie_entry *next;
3    symbolS    *start_address;
4    unsigned return_column;
5    unsigned signal_frame;
6    unsigned char fde_encoding;
7    unsigned char per_encoding;
8    unsigned char lsda_encoding;
9    expressionS personality;
10    struct cfi_insn_data *first,*last;
11 };
```

The cie\_entry structure will be built in the function cfi\_finish. An FDE is described by the following structure:

```
1 struct fde_entry {
                                     Linked list
2
      struct fde_entry *next;
      symbolS
3
                  *start_address;
                                     start
      symbolS
                   *end_address;
                                      end
      struct cfi_insn_data *data;
5
      struct cfi_insn_data **last;
                                   Always DW_EH_PE_omit
      unsigned char per_encoding;
      unsigned char lsda_encoding; Always DW_EH_PE_omit
                                     Not supported in riscv
             personality_id;
      expressionS personality;
10
      expressionS lsda;
11
12
      unsigned return_column;
13
      unsigned signal_frame;
14
      int
             eh_header_type;
      /* Compact unwinding opcodes, not including the PR byte or LSDA. */
15
      int eh_data_size;
16
      uint8_t
                   *eh_data;
17
      symbolS
                   *eh_loc;
                                      Not used in riscv
18
19
      int
            sections;
20 };
```

The constructor is cfi\_new\_fde. It receives a label symbol as argument, and the fde will start at that label. Calls alloc\_fde\_entry to allocate and fill the new structure with default values. The default "return column" is 1, as the ABI specifies<sup>40</sup>.

# 1.13.2 An example

Let's see how the debug information is organized with a simple example. Given the following C program:

 $<sup>^{40}</sup>$ This is a misnomer. It is not a "column" but a register number actually. Columns in the virtual table correspond to register numbers.

```
1 #include <stdio.h>
2 int main(void)
      printf("hello\n");
5 }
  ... sorry for this lack of any imagination. Now, if we compile this with:
1 star64: ~/tiny-asm $ gcc -c -S -g hello.c
   We obtain then:
                                     Listing 1.16: hello.s
1 star64: ~/tiny-asm cat hello.s
      .file "hello.c"
                             set the file name
                             see §1.12.18 page 64
3
      .option pic
                             assemble in the text section
4
      .text
5 .Ltext0:
      .cfi_sections .debug_frame See §1.13.3 page 74.
      .file 0 "/home/jacob/tiny-asm" "hello.c"
                .rodata
                             assemble in the read only section
      .section
      .align 3
                             align to multiple of 8 (2^3)
10 .LCO:
11
      .string "hello"
                             see §1.12.2 page 52
12
      .text
13
      .align 1
      .globl main
14
      .type main, @function
15
16 main:
17 .LFBO:
                              "main" will be known as LFBO in some debug statements
      .file 1 "hello.c"
18
      .loc 1 3 1
                             See 1.12.16, page 59.
19
                             first executable instruction of "main"
20
      .cfi_startproc
             sp,sp,-16
                             reserve space for stack frame
21
      addi
      .cfi_def_cfa_offset 16 record that with CFI
22
      sd ra,8(sp)
                             store return address at sp+8
23
24
      sd s0,0(sp)
                             store previous frame pointer at (sp).
25
      .cfi_offset 1, -8
                             return address is at s0-8. See §1.13.6 page 77
      .cfi_offset 8, -16
26
                             previous frame pointer is at s0-16
            s0,sp,16
                             set s0 (frame pointer)
      addi
27
      .cfi_def_cfa 8, 0
                             See §1.13.8 page 77
28
      .loc 1 4 2
                             Start line 4 col 2 in the C text above
29
                             load the address of .LCO into a0
      lla a0,.LC0
30
                             call puts (and not printf)
31
      call puts@plt
      li a5,0
                             put zero into scratch register a5
32
      .loc 1 5 1
                             we start line 5 col 1 of the program text
33
      mv a0,a5
                             put the zero into the result register
34
      ld ra,8(sp)
                             restore the return address
35
      .cfi_restore 1
                             tell that to CFI
36
                             restore the frame pointer
      ld s0,0(sp)
37
                             tell that to CFI
      .cfi_restore 8
38
      .cfi_def_cfa 2, 16
                             See §1.13.8 page 77
39
      addi
            sp,sp,16
                             restore the stack
40
      .cfi_def_cfa_offset 0 tell that to CFI
41
                             jump to the return address
42
      jr ra
      .cfi_endproc
                             tell CFI that we returned
44 .LFEO:
                             alias for the end of "main"
                             subtract from the current position the address of "main"
45
      .size main, .-main
                             label. That will be the size of this procedure.
46
```

#### 47 # Further lines snipped

We see here that there are only 7 .cfi\_\* directives used. In bigger files, for instance in asm.c we find that the only directives used are exactly the same ones. And that file makes around 35 000 lines. We will document here those ones that are used by gcc. The other are documented in the GAS documentation.

Let's go to each of those cfi directives in detail.

#### 1.13.3 cfi\_sections

#### Syntax:

```
.cfi_sections <section_list>
Table: cfi_pseudo_table
   {"cfi_sections",dot_cfi_sections,0},
```

The directive .cfi\_sections is used to specify the type of format that should be used: whether CFI directives should emit .eh\_frame section, .debug\_frame section and/or .sframe section. To emit multiple sections, specify them together in a list. For example, to emit both .eh\_frame and .debug\_frame, use .eh\_frame, .debug\_frame. The default if this directive is not used is .cfi\_sections .eh\_frame.

The .eh\_frame is required for exceptions to work. It must contain sufficient info to unwind from all the places where exception may be raised, but doesn't have to include anything beyond that. For example, it does not need to contain info needed to unwind through function prologue or epilogue, since no exception can be raised there.

The .debug\_frame (and other .debug\_\* sections) is only needed for debugging (and also for "self-aware" programs which unwind their own stack on e.g. crashes). It should contain sufficient info for debugger to unwind the stack from arbitrary place in the program, though in practice it may not.

The differences between the two formats are:<sup>41</sup>

- .eh\_frame is based on .debug\_frame introduced in DWARF v2.
- .eh\_frame has the flag of SHF\_ALLOC (indicating that a section should be part of the process image) but .debug\_frame does not, so the latter has very few usage scenarios.
- .debug\_frame supports DWARF64 format (supports 64-bit offsets but the volume will be slightly larger) but .eh\_frame does not support (in fact, it can be expanded, but lacks demand)
- In the CIE (Common Information Entry) of .debug\_frame, augmentation instead of augmentation\_data\_length and augmentation\_data is used.
- The version field in CIEs is different.
- The meaning of CIE\_pointer in FDEs is different. .debug\_frame indicates a section offset (absolute) and .eh\_frame indicates a relative offset. This change made by .eh\_frame is great. If the length of .eh\_frame exceeds 32-bit, .debug\_frame has to be converted to DWARF64 to represent CIE\_pointer. Relative offsets do not need to worry about this issue (if the distance between FDE and CIE exceeds 32-bit, add a CIE OK)
- In .eh\_frame, augmentation typically includes R and the FDE encoding is DW\_EH\_PE\_pcrel | DW\_EH\_PE\_sdata4 for small code models of AArch64, PowerPC64, x86-64.
- initial\_location has 4 bytes in GCC (even if -mcmodel=large). In .debug\_frame, 64-bit architectures need 8-byte initial\_location. Therefore, .eh\_frame is usually smaller than an equivalent .debug\_frame

<sup>&</sup>lt;sup>41</sup>see maskray-blog

1.13. The cfi directives 75

## 1.13.4 cfi\_startproc

.cfi\_startproc is used at the beginning of each function that should have an entry in .eh\_frame.

#### Syntax:

```
.cfi_startproc [simple]
Table: cfi_pseudo_table
{"cfi_startproc",dot_cfi_startproc,0}
```

The .cfi\_startproc directive is handled by dot\_cfi\_startproc, that performs following actions:

- Verifies that an cfi\_endproc has been issued or that we are at the start of the program.
- Allocates and initializes a new FDE.
- If present parses the simple argument, and sets an internal flag accordingly.
- If simple wasn't present, it generates the initial instructions for the virtual machine, in this case it sets the stack pointer to  $x^{42}$ .

#### 1.13.5 cfi\_def\_cfa\_offset

```
Syntax:
```

```
.cfi_def_cfa_offset offset
Table: cfi_pseudo_table
{"cfi_def_cfa_offset",dot_cfi,DW_CFA_def_cfa_offset}
```

.cfi\_def\_cfa\_offset modifies a rule for computing CFA. Register remains the same, but offset is new. Note that it is the absolute offset that will be added to a defined register to compute CFA address. In the example of hello.s line 22 we see that the new offset is emitted right after we subtract 16 from the stack. Right after that instruction, the CFA is 16 bytes from the value of sp, obviously.

This instruction (and several others) are handled by the dot\_cfi function that receives as its argument the instruction for the virtual machine.

This function does the following:

- Check that a previous cfi\_startproc has been issued.
- If the last address wasn't the current address, emit an instruction to advance to the current address.
- And now... a big switch statement that will perform the actions needed for each instruction.

In this case (DW\_CFA\_def\_cfa\_offset) the code is:

```
case DW_CFA_def_cfa_offset:
    offset = cfi_parse_const();
    cfi_add_CFA_def_cfa_offset(offset);
    break;

The function cfi_add_CFA_def_cfa_offset is as follows:

/* Add a DW_CFA_def_cfa_offset record to the CFI data. */
static void cfi_add_CFA_def_cfa_offset(offsetT offset);

{
    cfi_add_CFA_insn_offset(DW_CFA_def_cfa_offset,offset);
}
```

<sup>&</sup>lt;sup>42</sup>This instruction is repeated for each procedure in the program. It would be much easier to set this information in the CIE, since there isn't any program that will switch the stack register on a procedure basis...

```
frchain_now -> frch_cfi_data -> cur_cfa_offset = offset;

frchain_now -> frch_cfi_data -> cur_cfa_offset = offset;

fraction of struct cfi_add_CFA_insn_offset(int insn,offsetT offset)

fraction of struct cfi_insn_data *insn_ptr = alloc_cfi_insn_data();

insn_ptr -> insn = insn;

insn_ptr -> u.i = offset;

fraction of struct cfi_add_CFA_insn_offset(int insn,offsetT offset)

fraction of struct cfi_insn_data *insn_ptr = alloc_cfi_insn_data();

fraction of struct cfi_add_CFA_insn_offset(int insn,offsetT offset)

fraction of struct cfi_add_CFA_insn_offset(int insn,offsetT offset)

fraction of struct cfi_insn_data *insn_ptr = alloc_cfi_insn_data();

fraction of struct cfi_insn_data();

fraction of struct cfi_
```

Each action is split in several functions, a side-effect of object oriented design. The function alloc\_cfi\_insn\_data allocates space for a new data packet.

These data packets are defined like this:

Listing 1.17: cfi insn data

```
1 struct cfi_insn_data {
       struct cfi_insn_data *next;
                                              Linked list
       int
               insn;
                                              The instruction in question
 3
       union {
                                              Depending on the instruction, only one
                                              of these fields is active.
 5
           struct {
 6
               unsigned
                           reg;
                           offset;
               offsetT
           }
                   ri:
           struct {
9
               unsigned
10
                           reg1;
               unsigned
                           reg2;
11
12
                   rr;
13
           unsigned r;
14
           offsetT
           struct {
15
                              *lab1;
16
               symbolS
               {\tt symbolS}
                              *lab2;
17
           }
                   11:
18
           struct cfi_escape_data *esc;
19
           struct {
20
21
               unsigned
                          reg ,encoding;
               expressionS exp;
22
                   ea;
24
           const char
                          *sym_name;
25
       } u;
26 };
      The function alloc_cfi_insn_data let us see immediately how everything is organized:
 1 static struct cfi_insn_data *alloc_cfi_insn_data(void)
2 {
       struct cfi_insn_data *insn = XCNEW(struct cfi_insn_data);
 3
       \verb|struct| fde_entry *| cur_fde_data = frchain_now \rightarrow frch_cfi_data \rightarrow cur_fde_data; \\
 4
 5
       *cur_fde_data -- last = insn;
                                            Link the new item in the linked list
 6
       cur_fde_data \rightarrow last = &insn \rightarrow next;
       SET_CUR_SEG(insn,is_now_linkonce_segment());
       return insn;
10 }
```

The macro XCNEW is just a call to xcalloc with a corresponding sizeof its argument, that should be a type. It is saved as the current FDE data pointer, added to the linked list. And that is all.

1.13. The cfi directives 77

```
No? You want me to explain to you the impressing code SET_CUR_SEG(insn,is_now_linkonce_segment()); Well, I don't know what it should do, since in asm.h we have the definition: #define_SET_CUR_SEG(structp,seg)_u(void)_u(0_u&&_useg) 43
```

So, all that complex statement is actually nothing!

## 1.13.6 cfi\_offset

#### Syntax:

```
.cfi_offset register, offset
Table: cfi_pseudo_table
{"cfi_offset",dot_cfi,DW_CFA_offset}
```

The previous value of register is saved at offset offset from the CFA. Processing goes to dot\_cfi (see above in cfi\_def\_cfa\_offset). The relevant lines in dot\_cfi are:

```
case DW_CFA_offset:
creg1 = cfi_parse_reg();
cfi_parse_separator();
formula for the separator const();
cfi_add_CFA_offset(reg1,offset);
break;
```

# 1.13.7 cfi\_restore

#### Syntax:

```
.cfi_restore register [, register]
Table: cfi_pseudo_table
{"cfi_restore",dot_cfi,DW_CFA_restore}
```

The argument is a list of one or more registers. Again, we use the workhorse dot\_cfi. The relevant lines are below:

```
case DW_CFA_restore:
      for (;;) {
2
         reg1 = cfi_parse_reg();
3
          cfi_add_CFA_restore(reg1);
4
5
          SKIP_WHITESPACE();
6
          if (*input_line_pointer ≠ ',')
             break;
          ++input_line_pointer;
      }
9
10
      break;
```

## 1.13.8 cfi\_def\_cfa

## Syntax:

```
.cfi_def_cfa register, offset
Table: cfi_pseudo_table
{"cfi_def_cfa",dot_cfi,DW_CFA_def_cfa},
```

.cfi\_def\_cfa defines a rule for computing CFA as: take address from register and add offset to it. The relevant lines in dot\_cfi are:

<sup>&</sup>lt;sup>43</sup>Yes, I should eliminate all those fake statements from the code of tiny-asm... but I haven't since it is quite a lot of work, to find them, and to get rid of them. In other CPUs that statement does something, surely.

```
case DW_CFA_def_cfa:
    reg1 = cfi_parse_reg();
    cfi_parse_separator();
    offset = cfi_parse_const();
    cfi_add_CFA_def_cfa(reg1,offset);
    break;

1.13.9 .cfi_endproc
    Syntax:
```

.cfi\_endproc
Table: cfi\_pseudo\_table
{"cfi\_endproc",dot\_cfi\_endproc,0},

.cfi\_endproc is used at the end of a function where it closes its unwind entry previously opened by .cfi\_startproc and emits it to .eh\_frame.

The dot\_cfi\_endproc procedure is as follows:

```
1 static void dot_cfi_endproc(int ignored ATTRIBUTE_UNUSED)
2 {
3     if (!cfi_test_startproc()) return;
4     last_fde = frchain_now \rightarrow frch_cfi_data \rightarrow cur_fde_data;
5     cfi_end_fde(symbol_temp_new_now());
7     demand_empty_rest_of_line();
8     cfi_sections_set = true;
10     if ((cfi_sections & CFI_EMIT_target) \neq 0)
11         tc_cfi_endproc(last_fde);
12 }
```

- Requires a previous open startproc
- sets globals like last\_fde, a variable that is set, kept current, but never used. It is there just for fun.

Or not?

Actually, it is used when SUPPORT\_COMPACT\_EH is defined. Since this is not supported under the riscv version of GAS, what you see are just leftovers of its former self...  $^{44}$ 

- cfi\_end\_fde sets several globals to mark the end of a function.
- tc\_cfi\_endproc is #defined as nothing, so the last two lines are empty.

# 1.13.10 .cfi\_remember\_state and .cfi\_restore\_state

This complementary directives save the current state of the virtual table of register values and restore it later. The usage facilitates cases where a lot of .cfi\* directives are issued that need to be ignored for the rest of the code. One example is when we have repeated exit procedures instead of a single one. At each repeated function epilogue there are a lot of .cfi instructions issued:

<sup>&</sup>lt;sup>44</sup>It can be asked why these variables are still there if they do not fill any purpose. There are several reasons why. The first is that they do not cost a lot of space or execution time. And the second is that, of course, maybe tiny-asm will one day support compact eh\_frames, and if the skeleton of places where the variable is set and updated is erased, that would be impossible. And the third one is that I haven't found the time to enclose all usages of that variable in a conditional compilation like SUPPORT\_COMPACT\_EH what would be actually the correct solution.

```
beq label
1
      cfi_remember_state
                                We save the state here
      ld ra,24(sp)
      .cfi_restore 1
                                Several cfi directives
      ld s0,16(sp)
5
                                that describe the function epilogue
      .cfi_restore 8
6
      .cfi_def_cfa 2, 32
      addi sp,sp,32
      .cfi_def_cfa_offset 0
9
      jr ra
10
11 label:
                                We restore it here, voiding all previous cfi directives
12
      .cfi_restore_state
13
```

Within the assembler those instructions are added using the general utilities for adding DW\_CFA instructions. Here is cfi\_add\_CFA\_remember\_state

```
1 static void cfi_add_CFA_remember_state(void)
2 {
3
        struct cfa_save_data *p;
4
        cfi_add_CFA_insn(DW_CFA_remember_state);
        p = XCNEW(struct cfa_save_data); // Allocate space
        // remember the offset
        p \!\!\to\! \texttt{cfa\_offset} \; = \; \texttt{frchain\_now} \!\!\to\! \texttt{frch\_cfi\_data} \!\!\to\! \texttt{cur\_cfa\_offset};
9
        // Push it into the top of the list
10
        p\rightarrow next = frchain_now \rightarrow frch_cfi_data \rightarrow cfa_save_stack;
11
        frchain_now \rightarrow frch_cfi_data \rightarrow cfa_save_stack = p;
12
13 }
```

#### 1.14 Risc-v instructions

OK, we know now how to build tiny-asm, how to write directives, how the operations are encoded, let's start now to do something with that knowledge. Let's see how the common operations are done.

We will start by showing programs generated by the C compiler. It is the best way to get a feeling for this machine, its instructions and its possibilities.

#### 1.14.1 Loads, stores and addition

Here we cover the basics: loading data from memory, performing an operation, and storing the result in memory again. The riscv is a RISC machine, i.e. like the ARM, it can't work directly on data in memory like the x86 family. Data must be first loaded into memory, before it can be used for calculations.

Consider the following C program:

```
1 int main(void)
2 {
3     short sa=5,sb=6,sc=sa+sb; // 16 bit addition
4     int ia=5,ib=6,ic=ia+ib; // 32 bit
5     long long lla=5,llb=6,llc=lla+llb; // 64 bit
6     float fa=5,fb=6,fc=fa+fb; // single precision
7     double da=5,db=6,dc=da+db;// double precision
8     return sc+ic+llc+fc+dc; // Should be 55 isn't it?
9 }
```

We translate this with:

gcc -c -S add.c obtaining the following assembler file:

## Listing 1.18: add.s, no optimizations

```
// Standard instructions at the beginning of any file.
      .file "add.c"
1
                            // PC relative code
2
      .option pic
      .text
                           // Ensure code section
3
                           // Align to multiple of 16 bits (2^4) bytes)
      .align 4
4
                           // Visible outside this module
      .globl main
5
      .type main, @function // Debug statement
7 main:
      addi
             sp,sp,-112
                           // add immediate -112 to the value in the stack
                           // Store doubleword: old frame pointer (s0)
9
      sd s0,104(sp)
             s0, sp, 112
                           // Setup the new frame pointer
10
      addi
```

At this point the prologue of this function is finished. The old value of the frame pointer has been saved and a new one established. We start compiling the first C statement.

```
// short sa=5,sb=6,sc=sa+sb; // 16 bit addition
11
      li a5,5
                             // Put constant 5 in a5
12
      sh = a5, -18(s0)
                            // Store halfword (16 bits)
13
      li a5,6
                            // Put 6 into a5
14
      sh = a5, -20(s0)
                            // Store it
15
                            // Load half word unsigned
16
      lhu a4,-18(s0)
```

Note that the compiler uses the "lhu" instruction for loading an *unsigned* instead of the correct one lh that does a sign extension and is used for loading signed data, as it should be since we have declared the data as short and not unsigned short!

```
1hu \ a5,-20(s0)
                            // Same as above
17
              a5,a4,a5
                            // At last! 32 bit addition
      addw
18
      slli
              a5,a5,48
                            // shift left a5 48 bits
19
              a5,a5,48
                            // shift right a5 48 bits
20
      srli
      sh a5,-22(s0)
                            // Store 16 bits: store halfword, sh
```

The compiler emits code to load the data as unsigned, do the addition, and select the lower 16 bits. We can see better what is going on if we follow this sequence in the debugger but using -5 instead of a positive constant.

We see now that the addition was done in an unsigned form, producing 0x10001, that after the shifts was converted to 1. So,  $-5 + 6 \rightarrow 1$ . We are saved for this time... <sup>45</sup>

```
22
                            // int ia=5, ib=6, ic=ia+ib; // 32 bit
23
      li a5,5
                            // Same as before: 5 into a5
      sw a5,-28(s0)
                            // Initialize "ia" to 5
24
      li a5,6
                           // Put 6 into a5
25
      sw a5,-32(s0)
                           // Store it into "ib"
26
      lw a5,-28(s0)
                           // load ia
27
                            // copy it to a4
      mv a4,a5
28
      lw a5,-32(s0)
                            // load "ib"
29
      addw a5,a4,a5
                            // Do the addition
30
      sw a5,-36(s0)
                            // store the result
31
                            // long long lla=5,llb=6,llc=lla+llb; // 64 bit
32
      li a5,5
                            // load 5
33
      sd a5,-48(s0)
                            // Store doubleword this time
34
      li a5,6
                           //
35
                            // Same
36
      sd a5,-56(s0)
                            // Load "lla" into a4 (directly this time)
37
      1d a4,-64(s0)
                            // Load "llb" into a5
38
      1d a5,-48(s0)
                            // 64 bit addition
39
      add a5,a4,a5
      sd a5,-64(s0)
                            // Store 64 bits
40
                            // float fa=5,fb=6,fc=fa+fb; // single precision
41
      lla a5..LC0
                            // Load the address of LCO into a5
42
                           // Load single precision from the address in a5 \,
      flw fa5.0(a5)
43
      fsw fa5,-68(s0)
                           // Store it at "fa"
44
      lla a5,.LC1
                            // Same process for "fb".
45
      flw fa5,0(a5)
46
      fsw fa5,-72(s0)
                            // "fb" at -72
47
      flw fa4,-68(s0)
                            // Load fa4 with "fa"
48
                            // Load fa5 with "fb"
      flw fa5, -72(s0)
49
50
      fadd.s fa5,fa4,fa5
                           // Add single precision
      fsw fa5, -76(s0)
                            // Store result at -76
51
                            // double da=5,db=6,dc=da+db; double precision
52
                            // Load the address of .LC2 into a5 \,
      lla a5,.LC2
53
      fld fa5,0(a5)
                            // Load double precision from that address
54
      fsd fa5,-88(s0)
                            // Initialize "da"
55
      lla a5,.LC3
                            // Same for "db"
56
      fld fa5,0(a5)
57
      fsd fa5, -96(s0)
58
                            // Load "da" into fa4
      fld fa4, -88(s0)
59
                            // Load "db" at fa5
      fld fa5, -96(s0)
60
                           // Add double precision
      fadd.d fa5,fa4,fa5
61
                            // Store result
      fsd fa5,-104(s0)
62
                            // return sc+ic+llc+fc+dc; Should be 55
63
                            // "sc" into a5
      1h a5, -22(s0)
64
                            // Sign extend it
      sext.w a5,a5
65
                           // "ic" goes into a4
      lw a4,-36(s0)
66
                            // Add both a,nd accumulate into a5
67
      addw a5,a4,a5
      sext.w a5,a5
                            // Sign extend result to 64 bits
68
      mv a4,a5
                            // Copy it to a4
69
                            // Load "llc" to a5
      1d a5,-64(s0)
70
      add a5,a4,a5
                            // Add accumulating into a5
71
```

<sup>&</sup>lt;sup>45</sup>Why does the compiler do this instead of loading everything as signed and doing a signed addition? Nobody knows, at least not me. Note that we are using the compiler without any optimizations, we will see later what happens when some of those are turned on.

In any case, the sequence of loading sign extended 16 bit data and making a 32 bit addition gives exactly the same results.

```
// Convert integer in a5 into float in fa4
       fcvt.s.l
                 fa4,a5
72
       flw fa5, -76(s0)
                            // Load "fc" into fa5
       fadd.s fa5,fa4,fa5
                            // Add single precision fa4 and fa5
74
       fcvt.d.s fa4,fa5
                            // Convert that result into double precision
75
                            // Load "dc" into fa5
       fld fa5,-104(s0)
76
       fadd.d fa5,fa4,fa5
                            // Add double precision fa4+fa5 -
ightarrow fa5
77
       fcvt.w.d a5,fa5,rtz // Truncate result into a45
78
       sext.w a5,a5
                            // Sign extend
79
       mv a0,a5
                            // Put result into the result register
80
                            // Start of epilogue -----
81
       ld s0,104(sp)
                            // Restore previous frame pointer
82
       addi
              sp, sp, 112
                            // Restore stacl
83
                            // Jump to return address
84
       jr ra
                            // End of code of "main"
       .size main, .-main // Compute size of main at assembly time
86
                          // New section: read only data
87
       .section .rodata
                            // Align to 4 byte boundary
88
       .align 2
89 .LCO:
       .word 1084227584
                            // 5.0 in single precision
90
       .align 2
91
92 .LC1:
       .word 1086324736
93
                            // 6.0 in single precision
                            // Align to 8 byte boundary
94
       .align 3
95 .LC2:
       .word 0
       .word 1075052544
                            // 5.0 in double precision
97
98
       .align 3
                            // 6.0 in double precision
99 . LC3:
       .word 0
100
       .word 1075314688
101
                            // End of module add.o GNU specific stuff follows
102
       .ident "GCC: (GNU) 11.3.0"
103
       .section
                 .note.GNU-stack,"",@progbits
104
```

This simple program allows us to see the instructions in action. How data is loaded from, and written to memory, how to convert from integer to floating point and vice versa, and how to add in several formats.

- Load from memory all integer data into the a5 register
- Once in memory, copy the data to its eventual destination.
- Target the result of the operations into a5, to save it into memory.

What happens with higher optimization levels? Trying with gcc -c -S -01 add.c we obtain:

WOW... there is nothing left even at the lowest optimization level. To avoid this we change the program like this:

```
8    double da=argc,db=6,dc=da+db;
9    return sc+ic+llc+fc+dc;
10 }
```

The compiler can't possibly know what "arge" will contain and will be forced to do the hard work.

This yields the following program:

#### Listing 1.19: add1.s

```
// argc is in a0 (first argument)
1 main:
                          // add argc + 6. Result in a5
     addiw a5,a0,6
2
             a5,a5,16
                          // 16 bit left shift of result
3
      slliw
                          // 16 bit right shift of result. "sa" is in a5
     sraiw
             a5,a5,16
4
                         // 32 bit add of argc and 6
     addiw
             a4,a0,6
5
             a5,a5,a4
                         // Accumulate addition into a5
6
      addw
      addi
             a4,a0,6
                          // Add 64 bits argc + 6 into a4
                          // Accumulate into a5
      add a5,a5,a4
                         // Convert sum sc+ic+llc to double
     fcvt.s.l fa5,a5
9
                         // Convert argc into float in fa4
               fa4,a0
10
      fcvt.s.w
                          // auipc instruction
     flw fa3,.LC0,a5
11
```

This instruction, that the assembly code of gcc represents as "lw" is actually the "auipc" instruction that was introduced to the specifications in 2014, version 2.0. "auipc" adds a 20 bit upper immediate to the program counter to form an address where the data will be loaded. This constant will be filled by the linker, that can establish the definitive distance between the program counter and the variable in question<sup>46</sup>.

If you look at the entry of "auipc" in the opcodes table you will find:

```
{"auipc",0,INSN_CLASS_I,"d,u",MATCH_AUIPC,MASK_AUIPC,match_opcode,0},
```

Now, looking at table §1.8 page 43 you will see that the 'u' letter means a 20 bit immediate will be supplied. Our label ".LC1" is precisely that.

But, I hear your question, how come that I see "lw" in the assembler source text and an "auipc" instruction gets written out ???

Well, that the magic of tiny-asm. It will be explained below, after we finish with this small program.

```
fadd.s fa4,fa4,fa3 // Add single precision: fa4 = fa4 + fa3
12
                           // fa4 contains argc in single precision
13
                           // fa3 contains 6
14
                          // Accumulate in fa5 that contains the sum of sc+ic
      fadd.s fa5,fa5,fa4
15
                          // Convert from single precision to double precision.
      fcvt.d.s fa5,fa5
16
      fcvt.d.w
                fa4,a0
                           // Convert argc to double precision
17
      fld fa3,.LC1,a5
                           // The same auipc instruction to acces 6.0 in double prec.
18
      fadd.d fa4,fa4,fa3 // Double precision add: fa4 = argc+6.0
19
      fadd.d fa5,fa5,fa4 // Add toaccumulator fa5
20
^{21}
      fcvt.w.d a0,fa5,rtz // Convert to integer
                           // sign extend
22
      sext.w a0,a0
                            // Done.
23
      ret.
      .size main, .-main
24
                .rodata.cst4,"aM",@progbits,4
      .section
25
      .align 2
26
27 .LCO:
      .word 1086324736
28
      .section
                 .rodata.cst8,"aM",@progbits,8
29
      .align 3
```

<sup>46</sup> Add Upper Immediate to **Program** Counter  $\rightarrow$  AUIPC.

```
31 .LC1:
32 .word 0
33 .word 1075314688
```

We see here what it means to optimize:

- The compiler keeps all data in registers, there isn't even a stack frame.
- More operations do actual calculations than loading or storing data from/to memory. In the unoptimized version of add.c we have only 15 out of 76 instructions that do arithmetic. In the optimized version we have 15 out of 33, mainly because there are so few loads and no stores
- Use of more advanced instructions

#### Load and store instructions in short

Table 1.15: Standard load and store operations

| Instruction        | Description                                       |
|--------------------|---------------------------------------------------|
| lb rd, imm12(rs1)  | Load 8 bits. Sign extension.                      |
| lb rd,lab          | Macro: Load 8 bits from address lab.              |
| sb rs2, imm12(rs1) | Store 8 bits at rs1+imm12                         |
| sb rs1, lab,rs2    | Macro: store 8 bits at lab's address. Contents of |
|                    | rs2 destroyed                                     |
| lbu rd, imm12(rs1) | Load 8 bits. Zero extension                       |
| lh rd, imm12(rs1)  | Load 16 bits Sign extension                       |
| sh rs2, imm12(rs1) | Store 16 bits                                     |
| lhu rd, imm12(rs1) | Load 16 bits. Zero extension                      |
| lw rd, imm12(rs1)  | Load 32 bits Sign extension                       |
| sw rs2, imm12(rs1) | Store 32 bits                                     |
| lwu rd, imm12(rs1) | Load 32 bits Zero extension                       |
| ld rd, imm12(rs1)  | 64 bit load                                       |
| sd rs2, imm12(rs1) | Store 64 bits at rs1+imm12                        |
| lui rd,imm20       | Load a 20 bit address constant into rd.           |
| auipc rd,imm20     | Adds the 20 bit immediate to the program          |
|                    | counter and stores the result in rd.              |

The imm12 is always sign extended.

# Addressing modes

• Absolute addressing.

```
lui a0, %hi(message)
addi a0, %lo(message)
```

The %hi and the %lo constructs mean the higher 20 and the lower 12 bits of the address.

• Relative addressing

```
auipc a0, %pcrel_hi(msg + 1)
addi a0, a0, %pcrel_lo(message)
```

• GOT (Global Object Table) relative addressing

```
.L1:
   auipc a0, %got_pcrel_hi(message)
   ld a0, %pcrel_lo(.L1)(a0)
```

Note that the last two are the same: either PC relative or GOT relative, the instructions

## Recognizing addressing modes

The assembler recognizes these keywords using tables of the following structure:

```
struct percent_op_match {
const char *str; // Name without the percentage sign
bfd_reloc_code_real_type reloc; // Relocation type invoked
};
const struct percent_op_match percent_op_utype[];
const struct percent_op_match percent_op_itype[];
const struct percent_op_match percent_op_stype[];
const struct percent_op_match percent_op_rtype[];
const struct percent_op_match percent_op_rtype[];
const struct percent_op_match percent_op_null[];
```

These tables will be used in the function parse\_relocation to recognize (or not) a relocation directive.

```
1 /* Return true if *STR points to a relocation operator. When returning true, move
2 * *STR over the operator and store its relocation code in *RELOC. Leave both *STR
3 * and *RELOC alone when returning false. */
4 bool parse_relocation(char **str,bfd_reloc_code_real_type * reloc,
5 const struct percent_op_match *percent_op)
```

This function will set up a pointer to the first table, and will scan each name in all the tables, assuming they are in consecutive order. The last "table" is a terminator with only zeroes. It is crucial then, that, unaware of this, you insert something in between those tables. That would totally screw up things...

parse\_relocation will be called when parsing an expression that should yield a small immediate constant of offset. Its single use will be in my\_getSmallExpression.

```
1 /* Parse string STR as a 16-bit relocatable operand. Store the expression in
2 * *EP and the relocation, if any, in RELOC. Return the number of relocation
3 * operators used (0 or 1).
4 *
5 * On exit, EXPR_PARSE_END points to the first character after the expression. */
6 size_t my_getSmallExpression(expressionS * ep, bfd_reloc_code_real_type * reloc,
7 char *str,const struct percent_op_match *percent_op)
```

Now, this function, my\_getSmallExpression will be called from two places:

- 1. my\_getOpcodeExpression, a function used in riscv\_ip.
- 2. riscv\_ip directly, and in an extensive fashion.



Figure 1.16: Who calls the parse\_relocationhn,vc=:m:= function

## 1.14.2 Digression: assembler macros

We have seen above that the expression flw fa3,.LC0,a5 gets translated into two instructions:

```
auipc a5,0x0
flw fa3,52(a5)
```

Looking at the opcode table, we find that there several entries for the "flw" instruction.

At line 7, we find an instruction whose flag field has the INSN\_MACRO set. In the md\_assemble function, we find the sequence:

```
if (insn.insn_mo >pinfo == INSN_MACRO)
   macro(&insn,&imm_expr,&imm_reloc);
else
   append_insn(&insn,&imm_expr,imm_reloc);
```

If this instruction is actually a macro, expand it, if not, append the new instruction. What does the macro procedure do?

- It decomposes its arguments into 4 parts: the destination register (rd), the two source registers (rs1 and rs2), and a mask. According to the mask, different actions are performed. In our case we have M\_LW, as we can see in line 7 of the opcodes listing above. In the same line we find that the function for matching the opcode is match\_never a function that will always fail, excluding that the macro will be understood as another opcode. 47
- Using the mask value, it dispatches in a long switch statement for each mask. In our case:

```
case M_LW:
    pcrel_load(rd,rd,imm_expr,"lw",
```

 $<sup>^{47}</sup>$ M\_LW is a member of an anonymous enumeration defined in asm.h

```
BFD_RELOC_RISCV_PCREL_HI20, BFD_RELOC_RISCV_PCREL_L012_I);
break;
```

pcrel\_load and its companion pcrel\_store call pcrel\_access with slightly different arguments:

Listing 1.20: pcrel load and store

And, to make a disgression within a disgression, long and explicit type names can be nice, but sometimes they can lead to *really* verbose code... What if we substitute in the code above the long names with something like Reloc?

Listing 1.21: perel load and store improved

```
void pcrel_load(int destreg,int tempreg,expressionS * ep,const char *lo_insn,
1
                     Reloc hi_reloc, Reloc lo_reloc)
     Is this code less lisible?
      Anyway, both functions call pcrel_acces 48
1 static void pcrel_access(int destreg,int tempreg,expressionS * ep,
2
                          const char *lo_insn,const char *lo_pattern,
                  bfd_reloc_code_real_type hi_reloc,bfd_reloc_code_real_type lo_reloc)
3
4 {
5
      ep2.X_op = O_symbol; // expression is a symbolic expression
      ep2.X_add_symbol = make_internal_label(); // Symbol to attach the relocation
      ep2.X_add_number = 0;
      macro_build(ep,"auipc","d,u",tempreg,hi_reloc); // First insn
9
      macro_build(&ep2,lo_insn,lo_pattern,destreg,tempreg,lo_reloc); // Second
10
11 }
```

pcrel\_access builds a symbolic expression and calls macro\_build twice. The first one to build the auipc instruction, and the second for the actual load using the temporary register. The function macro\_build receives as arguments:

- 1. An expression.
- 2. A name for the instruction to generate.
- 3. A format string that will be used, in a similar manner to printf, as a template for the extraction of the corresponding arguments from the rest.

<sup>48</sup>The problem with bfd\_reloc\_code\_real\_type (besides the fact that is a pain to type!) is that many of the words used do not convey any new information... Real type? Are other types "unreal"? What did they want to say?

In our case we give it first the .LCO label, the name of the first instruction that we want to generate ("auipc"), and a format string of 'd' and 'u'.

The meaning of those letters is as follows:

Table 1.16: Macro letter arguments

| Letter  | action                                                         |
|---------|----------------------------------------------------------------|
| ,V,     | Vector macro. It needs a further letter for fully specifying   |
|         | which action is needed.                                        |
| 'd'     | <pre>INSERT_OPERAND(RD,insn,va_arg(args,int)); continue;</pre> |
| 's'     | <pre>INSERT_OPERAND(RS1,insn,va_arg(args,int)); continue</pre> |
| 't'     | <pre>INSERT_OPERAND(RS2,insn,va_arg(args,int)); continue</pre> |
| 'q','u' | r=va_args(args,int); continue; "r" is the relocation           |
| and 'j' | type".                                                         |

Then, just before exiting, macro\_build will call: append\_insn(&insn,ep,r); Let's see the output of objdump when we ask for disassembly and relocations:

As expected, we have a relocation of 20 bits and another one for the next instruction for the lower 12 bits. There are also 'relax" relocations, that we will meet later, when we study relocations.

## 1.14.3 Subtraction

Replacing all additions with subtractions in our C source doesn't change much to the overall shape of the program. The subtraction instructions are:

```
• sub. 64 bit subtraction.

Syntax: sub rd,rs1,rs2

Operation: rd ← rs1 - rs2.
```

- The subw instruction does a 32 bit subtraction. Same syntax and operation as above.
- The fsub.s does a single precision subtraction Syntax: fsub.s fd,fs1,fs2
   Operation: fd ← fs1 - fs2
- The fsub.d instruction does a double precision subtraction. Same as above.

## 1 14 4 Comparisons

```
• slti. Set less than immediate. (Signed)
Syntax: slti rd,rs1,immediate
Operation: rd ← (rs1 < immediate) ? 1: 0
```

• sltiu Set less than immediate unsigned.

```
Syntax: sltiu rd, rs1, imm
```

```
sltu rd,rs1,rs2 Operation: rd \leftarrow (rs1 < rs2/imm ) ? 1 : 0
```

- The pseudo instruction SEQZ rd,rs sets rd to 1 if rs is equal to zero. This is actually an alias for sltiu rd,rs,1.
- flt.s and flt perform floating point comparisons for single and double precision floating point respectively.

```
Syntax: flt rd,fsrc1,fsrc2
Operation: rd ← fsrc1 < fsrc2) ? 1 : 0
rd is an integer register, fsrc1 and fsrc2 are floating point.
```

• feq and feq.s do an equality comparison.

```
Syntax: feq rd,fsrc1,fsrc2
Operation: rd ← (fsrc1 == fsrc2) ? 1 : 0
rd must be an integer register, fsrc1 and fsrc2 are floating point.
```

An instruction alias that uses subtraction is neg that is actually just sub rd,x0,rs1 i.e. subtract rs1 from zero.

# 1.14.5 Multiplication and Division

#### Multiplication

These instructions are present if the processor implements the 'M' extension.

- mul performs a 64 by 64 bits multiplication, returning the lower 64 bits.
- mulh performs a signed 64 bit by a signed 64 bit multiplication and returns the higher 64 bits of the result.
- mulhu multiplies unsigned by unsigned 64 bit quantities and returns the upper 64 bits.
- mulhsu multiplies a signed rs1 by an unsigned rs2 and returns the higher 64 bits. <sup>49</sup>
- mulw is a 32 bit multiplication. The lower 32 bits are returned, with sign extension.

## XuanTie-OpenC910

This processor features several new instructions for multiplication.

| Т | able 1. | 17: | Thead | Multip | lication | ext | ension | 1s |
|---|---------|-----|-------|--------|----------|-----|--------|----|

| Instruction | Operation                                   | Description               |
|-------------|---------------------------------------------|---------------------------|
| th.mula     | $rd \leftarrow rd + (rs1 \times rs2)$       | Accumulate in rd          |
| rd,rs1,rs2  |                                             |                           |
| th.mulah    | $t[0:31] \leftarrow rd + (rs1[0:15] \times$ | Accumulate with result of |
| rd,rs1,rs2  | rs2[0:15])                                  | 16 bit multiplication.    |
|             | $rd \leftarrow sign\_extend(t)$             |                           |

<sup>&</sup>lt;sup>49</sup>In a multiple precision context, this instruction can be used to multiply the higher 64 bits that contain the sign, with the lower 64 bits of the other multiplicand, that has no sign.

|                 | 1 ( 10 01)                                  | 1 1 1 1 1                  |
|-----------------|---------------------------------------------|----------------------------|
| ${	t th.mulaw}$ | $t[0:31] \leftarrow rd + (rs1[0:31] \times$ | Accumulate with result of  |
| rd,rs1,rs2      | rs2[0:31])                                  | 32 bit multiplication.     |
|                 | $rd \leftarrow sign\_extend(t)$             |                            |
| th.muls         | $rd \leftarrow rd - (rs1 \times rs2)$       | Subtract from rd the re-   |
| rd,rs1,rs2      |                                             | sult of the multiplication |
| th.mulsh        | $t[0:31] \leftarrow rd - (rs1[0:15] \times$ | Subtract from rd result of |
| rd,rs1,rs2      | rs2[0:15])                                  | 16 bit multiplication.     |
|                 | $rd \leftarrow sign\_extend(t)$             |                            |
| th.mulsw        | $t[0:31] \leftarrow rd - (rs1[0:31] \times$ | Subtract from rd result of |
| rd,rs1,rs2      | rs2[0:31])                                  | 32 bit multiplication.     |
|                 | $rd \leftarrow sign\_extend(t)$             |                            |

Table 1.17: Thead Multiplication extensions

These operations are encoded using a modified form of the "R" format. Here is the encoding for the mula instruction for instance: mula a0,a1,a2.

Figure 1.17: Modified C910 R Instruction layout



As you can see, the extra-7 field of the "R" format has been split into a 5+2 bit field. The meaning of those 2 bits is described in §1.14.14 page 96

## Division

A change of the standard allows now to implement processors that have multiplication but not division. For those that do feature division, we have:

• div features a signed 64 bit division with rounding towards zero. Syntax:

```
div rd,rs1,rs2
Operation:
rd ← rs1 / rs2
```

- divu Unsigned division. Syntax and mode of operation the same as DIV.
- rem Signed remainder Syntax: rem rd,rs1,rs2 Operation: rd ← rs1 % rs2
- remu Unsigned remainder. Same as REM but for unsigned data.
- remw and remuw 32 bit versions.

Division by zero returns a result with all bits set, without any trap. <sup>50</sup>

The riscv ISA doesn't provide an instruction for calculating the remainder and the division with only one division operation. The sequence: DIV[U] rdq, rs1, rs2; REM[U] rdr, rs1, rs2 ()where rdq can't be the same as rs1 or rs2) is proposed for optimization.

#### 1 14 6 Shifts

Operation Syntax slli rd, rsrc1, imm Shift left logical immediate. Shift right logical immediate (Shifts in zeros) srli rd, rsrc1, imm Shift right arithmetic (propagating the sign bit). srai rd,rsrc1,imm sll rd,rsrc1,rsrc2 Shift left logical (shifts in zeroes). As s11 but works on lower 32 bits. sllw rd,rsrc1,rsrc2 Shift right logical (shifts in zeroes). srl rd,rsrc1,rsrc2 srlw rd, rsrc1, rsrc2 As srl but works on lower 32 bits. Shift right arithmetic (propagating the sign bit). sra rd,rsrc1,rsrc2 As sra but works on lower 32 bits. sraw rd, rsrc1, rsrc2

Table 1.18: Standard shift operations

In all this instructions rsrc1 is the quantity to be shifted, and rsrc2 or imm contain the number of bits to shift.

#### 1.14.7 Control flow

## **Inconditional Jumps**

| Pseudo      | Base             | Operation                                   |
|-------------|------------------|---------------------------------------------|
| instruction | instruction      |                                             |
| j label     | jal x0 label     | Jump inconditional                          |
|             |                  | $pc \leftarrow pc + sign_extend(imm20 * 2)$ |
| jal fn      | jal x1,fn        | Call subroutine                             |
| jr register | jalr x0,register | Call function pointer in register           |

Table 1.19: Standard inconditional jumps

The jal instructions uses the 'j' instruction format (See §1.7.6 page 34). The offset immediate (in multiples of 2 bytes) is added to the current program counter value to form the target address. It has a reach of 1MB forward or backwards.

We considered raising exceptions on integer divide by zero, with these exceptions causing a trap in most execution environments. However, this would be the only arithmetic trap in the standard ISA (floating-point exceptions set flags and write default values, but do not cause traps) and would require language implementors to interact with the execution environment's trap handlers for this case. Further, where language standards mandate that a divide-by-zero exception must cause an immediate control flow change, only a single branch instruction needs to be added to each divide operation, and this branch instruction can be inserted after the divide and should normally be very predictably not taken, adding little runtime overhead. The value of all bits set is returned for both unsigned and signed divide by zero to simplify the divider circuitry.

The value of all 1s is both the natural value to return for unsigned divide, representing the largest unsigned number, and also the natural result for simple unsigned divider implementations. Signed division is often implemented using an unsigned division circuit and specifying the same overflow result simplifies the hardware.

<sup>&</sup>lt;sup>50</sup>The riscv standard justifies this with:

The indirect jumps through a register are **not** in multiples of two bytes, beware. The address must be the real address of the target.

## 1.14.8 Conditional expressions

Table 1.20: Standard conditional expressions

| Inst                 | Operation                                                        |
|----------------------|------------------------------------------------------------------|
| beq rs1, rs2, label  | if (rs1 = rs2) pc $\leftarrow$ pc+sign_extend(imm12 $<<$ 1)      |
| bge rs1, rs2, label  | if (rs1 $\geq$ rs2) pc $\leftarrow$ pc+sign_extend(imm12 $<<$ 1) |
| bgeu rs1, rs2, label | if (rs1 $\geq$ rs2) pc $\leftarrow$ pc+sign_extend(imm12 $<<$ 1) |
| blt rs1, rs2, label  | if (rs1 $\leq$ rs2) pc $\leftarrow$ pc+sign_extend(imm12 $<<$ 1) |
| bltu rs1, rs2, label | if (rs1 $\leq$ rs2) pc $\leftarrow$ pc+sign_extend(imm12 $<<$ 1) |
| bne rs1, rs2, label  | if (rs1 $\neq$ rs2) pc $\leftarrow$ pc+sign_extend(imm12 $<<$ 1) |

All these instructions have a range of  $\pm 4$ K.

Exercise 1: The instruction bgt is an alias. How would you build it from the other instructions?

Exercise 2: Write a small program that uses a conditional branch.

Exercise 3: Disassemble the program. What you see instead of bgt?

Exercise 4: How is the change achieved? Look at the source asm.c.

## 1.14.9 And, Or, Xor

Table 1.21: Standard boolean instructions

| Inst                | Operation                          |
|---------------------|------------------------------------|
| and rd,rsrc1,rsrc2  | $rd \leftarrow rsrc1 \land rsrc2$  |
| andi rd,rsrc1,imm12 | $rd \leftarrow rsrc1 \land imm12$  |
| or rd,rsrc1,rsrc2   | $rd \leftarrow rsrc1 \lor rsrc2$   |
| ori rd,rsrc1,imm12  | $rd \leftarrow rsrc1 \lor imm12$   |
| xor rd,rsrc1,rsrc2  | $rd \leftarrow rsrc1 \oplus rsrc2$ |
| xori rd,rsrc1,imm12 | $rd \leftarrow rsrc1 \oplus imm12$ |

Exercise 5: Use the XOR instruction to invert all bits in an integer register

# 1.14.10 Reading timers

The "Zinctr" extension prescribes at least 3 counters/timers that should be present in all implementations.

- Cycles. The rdcycle pseudo instruction reads the low XLEN bits of the cycle special register which holds the number of clock cycles executed by the processor core on which the hardware thread is running from an arbitrary start time somewhere in the past, probably, when the machine was powered on.
- Time. The rdtime instructions returns the wall clock time since start, sometime in the past.

• Instructions retired. The rdinstret instruction returns the number of instructions retired, i.e. executed (roughly) since some time in the past.

## Reading standard counters

Table 1.22: Counter reading

| Instruction  | Description                                           |
|--------------|-------------------------------------------------------|
| rdtime rd    | Reads a 64 bit timer counter                          |
| rdcycle rd   | Reads a 64 bit cycle counter                          |
| rdinstret rd | Reads a 64 bit counter for the number of instructions |
|              | retired, i.e. executed                                |

The rd placeholder represents a 64 bit register.

Exercise 6: Write a program in assembler to print these 3 counters.

Exercise 7: Try to verify that time corresponds to a time measure

## 1.14.11 CSR instructions

"CSR" stands for Control and Status Register. These registers are used primarily in the privileged part of the instruction set, but there are some uses in the unprivileged instructions (the subject of this book).

#### 1.14.12 Boolean instructions

These instructions correspond to the "Zbb". In the opcode table they have INISN\_CLASS\_ZBB in the class field. extension.

The instructions that work only in a 32 bit environment have been excluded.

The Sifive U74-MC supports the standard Zbb extension. The XuanTie-OpenC910 has two somehow similar instructions, "ff1" and "ff0".

Note: GCC doesn't recognize these instructions in machines using the U74 CPU. You have to force it by adding the (undocumented) option <code>-march=rv64gc\_zbb</code> to the compilation command line. These problems do not affect tiny-asm: it will generate the correct instructions without problems.

Table 1.23: Zbb boolean extension instructions

| Inst.           | Description                                                                            |
|-----------------|----------------------------------------------------------------------------------------|
| bclr rd,rs1,rs2 | Returns rs1 with a single bit cleared at the index specified in rs2.                   |
|                 | The index is read from the lower log2(XLEN) bits of rs2.                               |
| bclri rd,rs1,nr | Returns rs1 with a single bit cleared at the index specified in nr.                    |
|                 | The index is read from the lower log2(XLEN) bits of nr.                                |
| bext rd, rs1,   | Returns a single bit extracted from rs1 at the index specified in                      |
| rs2             | rs2. The index is read from the lower log2(XLEN) bits of rs2.                          |
| bexti rd, rs1,  | Returns a single bit extracted from rs1 at the index specified in                      |
| nr              | $ m nr. \ The \ index \ is \ read \ from \ the \ lower \ log2(XLEN) \ bits \ of \ nr.$ |
| binv rd, rs1,   | Returns rs1 with a single bit inverted at the index specified in rs2                   |
| rs2             |                                                                                        |
| binvi rd,       | Returns rs1 with a single bit inverted at the index specified in nr.                   |
| rs1,nr          |                                                                                        |

Table 1.23: Zbb boolean extension instructions

|                     | Table 1.26. Zbb boolean extension instructions                                                                                                                                                                                                                 |
|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| bset rd,<br>rs1,rs2 | Returns rs1 with a single bit set at the index specified in rs2.                                                                                                                                                                                               |
| bseti rd,<br>rs1,nr | Returns rs1 with a single bit set at the index specified in nr.                                                                                                                                                                                                |
| clmul<br>rd,rs1,rs2 | clmul produces the lower half of the $2\times XLEN$ carry-less product.                                                                                                                                                                                        |
| clmulh rd, rs1, rs2 | Produces the upper half of the $2\times$ XLEN carry-less product.                                                                                                                                                                                              |
| clmulr              | Produces bits $2 \times$ XLEN-2 to XLEN-1 of the $2 \times$ XLEN carry-less product. <sup>51</sup>                                                                                                                                                             |
| clz rd,rs1          | Counts the number of 0 bits before the first 1 bit, starting at the most significant bit and progressing to bit 0. If the input is 0, the output is 64. If the most-significant bit of the input is 1, the output is 0.                                        |
| clzw rd,rs1         | Counts the number of 0 bits before the first 1 bit, starting at bit 31 and progressing to bit 0. If the least-significant word is 0, the output is 32. If the most-significant bit of the word is 1, the output is 0.                                          |
| ctz rd,rs1          | Counts the number of 0 bits before the first 1 bit, starting at the least-significant bit and progressing to the most-significant bit. If the input is 0, the output is 64. If the least-significant bit of the input is 1, the output is 0.)                  |
| ctzw rd,rs1         | Counts the number of 0 bits before the first 1 bit, starting at the least-significant bit and progressing to the most-significant word. If the least significant word is 0, the output is 32. If the least significant bit of the input is 1, the output is 0. |
| cpop rd,rs1         | Counts the number of 1 bits in the source register. This operations is also known as "population count" or "Hamming weight".                                                                                                                                   |
| cpopw rd,rs1        | Counts the number of 1 bits in the least-significant word of the source register                                                                                                                                                                               |
| max rd,rs1,rs2      | Returns the larger of two signed integers.                                                                                                                                                                                                                     |
| maxu rd,rs1,rs2     | Returns the larger of two unsigned integers.                                                                                                                                                                                                                   |
| min rd,rs1,rs2      | Returns the smaller of two signed integers.                                                                                                                                                                                                                    |
| minu rd,rs1,rs2     | Returns the smaller of two unsigned integers.                                                                                                                                                                                                                  |
| orc.b rd,rs         | Combines the bits within each byte using bitwise logical OR. This                                                                                                                                                                                              |
|                     | sets the bits of each byte in the result rd to all zeros if no bit                                                                                                                                                                                             |
|                     | within the respective byte of rs is set, or to all ones if any bit                                                                                                                                                                                             |
|                     | within the respective byte of rs is set.                                                                                                                                                                                                                       |
| orn rd,rs1,rs2      | performs the bitwise logical OR operation between rs1 and the bitwise inversion of rs2. $rd \leftarrow rs1 \mid \sim rs2$                                                                                                                                      |
| rev8 rd,rs1         | Reverses the order of the bytes in rs1.                                                                                                                                                                                                                        |
| rol rd,rs1,rs2      | Rotate left. Performs a rotate left of rs1 by the amount in least-significant 6 bits of rs2.                                                                                                                                                                   |
| rolw rd,rs1,rs2     | Rotate left. Performs a rotate left of rs1 by the amount in least-significant 5 bits of rs2. The resulting 32 bit vvalue is sign extended to 64.                                                                                                               |

 $<sup>\</sup>overline{\phantom{a}^{51}}$  The clmulr instruction is used to accelerate CRC calculations. The r in the instruction's mnemonic stands for reversed, as the instruction is equivalent to bit-reversing the inputs, performing a clmul, then bit-reversing the output.

Table 1.23: Zbb boolean extension instructions

| ror rd,rs1,rs2  | Rotate right. Uses the least significant 6 bits of rs2 for the amount |
|-----------------|-----------------------------------------------------------------------|
|                 | to rotate.                                                            |
| rori rd,rs1,imm | Rotate right immediate. Uses the least significant 6 bits of imm      |
|                 | for the amount to rotate.                                             |
| sext.b rd,rs    | Sign-extends the least-significant byte in the source to 64 by copy-  |
|                 | ing the most-significant bit in the byte (i.e., bit 7) to all of the  |
|                 | more-significant bits.                                                |
| sext.h rd,rs    | Sign-extends the least-significant 16 bits in the source to 64 by     |
|                 | copying the most-significant bit in the byte (i.e., bit 7) to all of  |
|                 | the more-significant bits.                                            |
| sh1add          | Shifts rs1 left by 1 and adds it to rs2.                              |
| rd,rs1,rs2      |                                                                       |
| sh2add          | Shifts rs1 left by 2 and adds it to rs2.                              |
| rd,rs1,rs2      | ·                                                                     |
| sh1add_uw       | This instruction performs an 64 bit wide addition of two addends.     |
| rd,rs1,rs2      | The first addend is rs2. The second addend is the unsigned value      |
|                 | formed by extracting the least-significant word of rs1 and shifting   |
|                 | it left by 1 place.                                                   |
| sh2add_uw       | Same as above but the shift is by 2 places.                           |
| rd,rs1,rs2      |                                                                       |
| sh3add_uw       | Same as above but the shift is 3 places.                              |
| rd,rs1,rs2      |                                                                       |
| slli.uw rd,     | Takes the least-significant word of rs1, zero-extends it, and shifts  |
| rs1, imm6       | it left by the immediate.                                             |
| xnor rd, rs1,   | Performs the bit-wise exclusive-NOR operation on rs1 and rs2.         |
| rs2             | rd = ~(rs1 ^ rs2); <sup>52</sup>                                      |
|                 | ·                                                                     |

Exercise 8: Use the "max" instruction to calculate the absolute value of a signed integer.

# The Zbkb extension

Table 1.24: Zbkb instructions

| Inst.           | Description                                                         |
|-----------------|---------------------------------------------------------------------|
| pack rd,rs1,rs2 | Packs the XLEN/2-bit lower halves of rs1 and rs2 into rd, with      |
|                 | rs1 in the lower half and rs2 in the upper half.                    |
| packh           | Packs the least-significant bytes of rs1 and rs2 into the 16 least- |
| rd,rs1,rs2      | significant bits of rd, zero extending the rest of rd.              |
| packw           | packs the low 16 bits of rs1 and rs2 into the 32 least-significant  |
| rd,rs1,rs2      | bits of rd, sign extending the 32-bit result to the rest of rd.     |
| rvb rd,rs1      | Reverses the order of the bits in every byte of a register.         |
| xperm.b rd,     | The xperm.b instruction operates on bytes. The rs1 register con-    |
| rs1, rs2,       | tains a vector of 8 8-bit elements. The rs2 register contains a     |
|                 | vector of 8 8-bit indexes. The result is each element in rs2 re-    |
|                 | placed by the indexed element in rs1, or zero if the index into rs2 |
|                 | is out of bounds. This instruction is in the extension Zxbkx.       |

 $<sup>^{52}</sup>$ The XNOR operation of two inputs returns 1 if the two inputs are equal, zero otherwise.

xperm.n rd,
rs1, rs2

The xperm.n instruction operates on nibbles. The rs1 register
contains a vector of XLEN/4 4-bit elements. The rs2 register
contains a vector of XLEN/4 4-bit indexes. The result is each
element in rs2 replaced by the indexed element in rs1, or zero if
the index into rs2 is out of bounds. This instruction is in the
extension Zxbkx.

Zext.h rd, rs

This instruction zero-extends the least-significant halfword of the
source to XLEN by inserting 0's into all of the bits more significant
than 15.

Table 1.24: Zbkb instructions

# 1.14.13 Pause instruction

## Syntax:

pause

The pause instruction is a HINT that indicates the current hart's rate of instruction retirement should be temporarily reduced or paused. The duration of its effect must be bounded and may be zero. No state is changed. The standard says about this:

Software can use the PAUSE instruction to reduce energy consumption while executing spin-wait code sequences. Multithreaded cores might temporarily relinquish execution resources to other harts when PAUSE is executed. It is recommended that a PAUSE instruction generally be included in the code sequence for a spin-wait loop.

Exercise 9: Calculate how long takes a pause instruction in your machine

## 1.14.14 Floating point

Floating point operations are controlled with the status register, fcsr. It is a 32-bit read-/write register that selects the dynamic rounding mode for floating-point arithmetic operations and holds the accrued exception flags.

- Bit 0: NX Inexact
- Bit 1: UF Underflow
- Bit 2: OF Overflow
- Bit 3: DZ Divide by zero
- Bit 4: NV Invalid operation
- Bits 5-7: Rounding mode. This will be used when the instruction uses the dynamic rounding mode. See §1.26 page 98.
- Bits 8-31 Reserved.

The fcsr register can be read and written with the FRCSR and FSCSR instructions, which are assembler pseudo instructions, built on the underlying CSR access instructions.

The fields of the csr can also be accessed individually. The instruction frrm reads the rounding mode field. The instruction fsrm writes to it. In a similar fashion frflags and fsflags read and write to the flags field.

#### Syntax:

```
\begin{array}{lll} \text{frrm rd} & \text{rd} \leftarrow \text{ rounding mode} \\ \text{fsrm rd,rs1} & \text{rounding mode} \leftarrow \text{rs1, rd} \leftarrow \text{old rounding mode} \\ \text{frflags rd} & \text{rd} \leftarrow \text{flags} \\ \text{fsflags rd,rs1} & \text{flags} \leftarrow \text{rs1, rd} \leftarrow \text{old flags} \end{array}
```

Exercise 10: Write an assembler program to show the CSR flags in the console

Exercise 11: Write a subroutine that returns the flags of the CSR as a 32 bit integer

Floating point can have 4 possible precision settings: half (16 bits), single (32), double
(64) and quad(128). Most machines implement single and double precision, some implement
half precision, and (till now) none has implemented 128 bit precision.

## **Encodings**

Floating point instructions use a slightly modified "R" format. Bits 25 and 26 define a "format" field that is used to differenciate between half, single and double precision.

Figure 1.18: Modified R Instruction layout



In the figure above we have:

- Bits 0-6, the opcode, 83 (0x53).
- Bits 7-11, the destination, 10 (0xA)
- Bits 12-14, the rounding mode, 7 (Dynamic rounding mode)
- Bits 15-19, the first source register, 11 (a1)
- Bits 20-24, the second source register 12 (a2)
- Bits 25-26, The format, in this case 0, single precision
- Bits 27-31, The code for the operation, in this case 1, the code for FADD or FSUB.

Table 1.25: Format bits (Bits 25-26)

| Bits  | Instruction | Description            |
|-------|-------------|------------------------|
| value | mnemonic    |                        |
| 0 0   | S           | Single precision       |
| 0 1   | D           | Double precision       |
| 1 0   | Н           | Half precision         |
| 1 1   | Q           | 128 bit Quad precision |

Table 1.26: Rounding mode bits (Bits 12-14)

| Bits  | Mode     | Description                  |
|-------|----------|------------------------------|
| value | mnemonic |                              |
| 000   | RNE      | Round to nearest             |
| 001   | RTZ      | Round to zero.               |
| 010   | RDN      | Round down towards $-\infty$ |
| 011   | RUP      | Round up towards $+\infty$   |

| 100 | RMM      | Round to nearest. Ties towards max magnitude                                                           |
|-----|----------|--------------------------------------------------------------------------------------------------------|
| 101 | Reserved |                                                                                                        |
| 110 | Reserved |                                                                                                        |
| 111 | DYN      | If in the instruction, selects dynamic rounding mode. If in the rounding mode register, it is reserved |

Table 1.26: Rounding mode bits (Bits 12-14)

The recognition of the rounding modes for an instruction is done in the case for the letter 'm', in the riscv\_ip function. It uses the riscv\_rm table of rounding modes.

## Floating point instructions

Single precision floating point is called extension "F", double "D" and half "H". There is provision for a "Q" extension for 128 bit numbers but none of the machines I have used implements that yet.

In the instructions below, an adddress is formed by adding the contents of the source register with a sign extended imm12.

| Instruction        | Description                                       |
|--------------------|---------------------------------------------------|
| flw frd,imm12(fs1) | Load single precision data from address at into   |
|                    | frd.                                              |
| fsw fs2,imm12(rs1) | Store single precision data from fs2 at address   |
| fld frd,imm12(fs1) | Load double precision data from address at into   |
|                    | frd                                               |
| fsd fs2,imm12(rs1) | Store double precision data from fs2 at address   |
| flh frd,imm12(fs1) | Load half precision data from address at into frd |
| fsh fs2,imm12(rs1) | Store half precision data from fs2 at address     |

Table 1.27: Floating point load/store instructions

In the instructions above the data will be moved without any changes.

Table 1.28: Floating point arithmetic instructions

| Instruction                | Description                                       |
|----------------------------|---------------------------------------------------|
| fadd.{h s d} frd,frs1,frs2 | Add. $frd \leftarrow frs1 + frs2$                 |
| fsub.{h s d} frd,frs1,frs2 | Subtraction. $frd \leftarrow frs1 - frs2$         |
| fmul.{h s d} frd,frs1,frs2 | Multiplication. $frd \leftarrow frs1 \times frs2$ |
| fdiv.{h s d} frd,frs1,frs2 | Division. $frd \leftarrow frs1 \div frs2$         |

|                                           | _                                             |
|-------------------------------------------|-----------------------------------------------|
| $fmadd.\{h s d\}$                         | Fused multiply add.                           |
| fd,fs1,fs2,fs3                            | $fd \leftarrow (fs1 \times fs2) + fs3$        |
| $fmsub.\{h s d\}$                         | Fused multiply subtract.                      |
| fd,fs1,fs2,fs3                            | $fd \leftarrow (fs1 \times fs2) - fs3$        |
| $fnmadd.\{h s d\}$                        | Fused negative multiply add simple precision. |
| fd,fs1,fs2,fs3                            | $fd \leftarrow (-fs1 \times fs2) - fs3^{53}$  |
| fnmsub. $\{h \mid s \mid d\}$ fs1,fs2,fs3 | Fused negative subtraction simple precision.  |
|                                           | $fs1 \leftarrow (-fs1 \times fs2) + fs3$      |

Table 1.28: Floating point arithmetic instructions

Table 1.29: Floating point square root, min, max instructions

| Instruction                  | Description                                                       |
|------------------------------|-------------------------------------------------------------------|
| fsqrt.{h s d} rd,rs1         | Square root. $rd \leftarrow \sqrt{rs1}$                           |
| M                            | inimum/Maximum                                                    |
| fmin.{h s d} rd,rs1,rs2      | Minimum of two inputs.rd← rs1 <rs2 :<="" ?="" rs1="" td=""></rs2> |
|                              | rs2                                                               |
| $fmax.{h s d} rd,rs1,rs2$    | Maximum of two inputs.rd← rs1 <rs2 ?="" rs2<="" td=""></rs2>      |
|                              | : rs1                                                             |
| fsgnj.{s d h} fd,fs1,fs2     | Sign injection of fs2 into fs1.                                   |
|                              | $fd[xlen-1] \leftarrow rs2[xlen-1]$                               |
|                              | $fd[0xlen-2] \leftarrow rs1[0xlen-2]$                             |
| $fsgnjn.{s d h}} fd,fs1,fs2$ | Sign injection of neg(fs2) into fs1.                              |
|                              | $\int fd[xlen-1] \leftarrow ! rs2[xlen-1]$                        |
|                              | $fd[0xlen-2] \leftarrow rs1[0xlen-2]$                             |
| $fclass.{h s d} rd,fs1$      | Classify fs1, returning a classification in rd, that              |
|                              | must be an integer register. The bits in the result               |
|                              | are explained in table §1.30, page 100. Only one                  |
|                              | bit will be set.                                                  |

Table 1.30: fclass results

| Bit number | Meaning             |
|------------|---------------------|
| 0          | $-\infty$           |
| 1          | rs1 < 0             |
| 2          | subnormal $rs1 < 0$ |
| 3          | $rs1 \equiv -0$     |
| 4          | $rs1 \equiv 0$      |
| 5          | subnormal $rs1 > 0$ |
| 6          | rs1 > 0             |

 $<sup>^{53}</sup>$  The official riscv manual acknowledges that fnmadd is a  $\mathbf{misnomer}$ . They try to justify this error with:

In my opinion, this "explanation" doesn't explain why this misnomer is maintained...

The FNMSUB and FNMADD instructions are counter intuitively named, owing to the naming of the corresponding instructions in MIPS-IV. The MIPS instructions were defined to negate the sum, rather than negating the product as the RISC-V instructions do, so the naming scheme was more rational at the time. The two definitions differ with respect to signed-zero results. The RISC-V definition matches the behavior of the x86 and ARM fused multiply-add instructions, but unfortunately the RISC-V FNMSUB and FNMADD instruction names are swapped compared to x86 and ARM.

RISC-V Unprivileged ISA V20191214-draft page 77  $\,$ 

Table 1.30: fclass results

| 7 | $+\infty$     |
|---|---------------|
| 8 | signaling NAN |
| 9 | quiet NAN     |

## Conversions

Table 1.31: Floating point conversion instructions

| Instruction                    | Description                                         |
|--------------------------------|-----------------------------------------------------|
| fcvt.w.s rd,rs1                | Converts a single-precision floating-point number   |
|                                | to a signed 32-bit integer. Sign-extends the 32-bit |
|                                | result to the destination register width.           |
| fcvt.s.w rd,rs1                | Converts a signed 32-bit integer to a single-       |
|                                | precision floating-point number                     |
| fcvt.wu.s rd,rs1               | Converts a single-precision floating-point number   |
|                                | to an unsigned 32-bit integer. Sign-extends the     |
|                                | 32-bit result to the destination register width.    |
| fcvt.s.wu rd,rs1               | Converts a unsigned 32-bit integer to a single-     |
|                                | precision floating-point number                     |
| fcvt.l.s rd,rs1                | Converts a single-precision floating-point number   |
|                                | to a signed 64-bit integer.                         |
| fcvt.s.l rd,rs1                | Converts a signed 64-bit integer to a single-       |
|                                | precision floating-point number                     |
| fcvt.lu.s rd,rs1               | Converts a single-precision floating-point number   |
|                                | to an unsigned 64-bit integer.                      |
| fcvt.s.lu rd,rs1               | Converts a unsigned 64-bit integer to a single-     |
|                                | precision floating-point number                     |
| Other precisions               |                                                     |
| $fcvt.{l w lu wu}.{s d h}$     | Convert floating point to integer in the different  |
|                                | sizes and precisions                                |
| $fcvt.\{s d h\}.\{1 lu w wu\}$ | Convert integer to floating point in the different  |
|                                | sizes and precisions                                |
| fcvt.{h s d}.{h s d}           | Convert between different floating point formats.   |

Table 1.32: Move to/from integer registers

| Instruction    | Description                                                                                                                                                                                                                                                      |
|----------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| fmv.x.w rd,rs1 | Moves the single-precision value in floating-point register rs1represented in IEEE 754-2008 encoding to the lower 32 bits of integer register rd. The higher 32 bits of the destination register are filled with copies of the floating-point number's sign bit. |
| fmv.w.x        | Moves the single-precision value encoded in IEEE 754-2008 standard encoding from the lower 32 bits of integer register rs1 to the floating-point register rd.                                                                                                    |

Absent from the table of instructions above are the ones introduced in 2023: the "Zfa" extension, that will make possible to load some immediates into fp registers, minimum/max-

imum operations with NANs and others. <sup>54</sup>

#### Comparisons

Floating-point compare instructions (feq, flt, fle) perform the specified comparison between floating-point registers (rs1 = rs2, rs1 < rs2, rs1  $\leq$  rs2) writing 1 to the integer register rd if the condition holds, and 0 otherwise.

InstructionDescriptionfeq. {h|s|d} rd,fs1,fs2Equality.  $rd \leftarrow (fs1 = fs2)$ flt. {h|s|d} rd,fs1,fs2Less than comparison.  $rd \leftarrow (fs1 < fs2)$ fle. {h|s|d} rd,fs1,fs2Less equal comparison.  $rd \leftarrow (fs1 \le fs2)$ 

Table 1.33: Floating point comparison instructions

flt. $\{h \mid s \mid d\}$  and fle. $\{h \mid s \mid d\}$  perform signaling comparisons: they set the invalid operation exception flag if either input is NaN. feq performs a quiet comparison: it only sets the invalid operation exception flag if either input is a *signaling* NaN. For all three instructions, the result is 0 if either operand is NaN.

#### 1.14.15 Atomic instructions

These instructions read, modify, and write memory atomically to provide synchronisation across several RISC-V harts in the same memory space. The theoretical model whereupon riscv is based is described in the paper **Memory consistency and event ordering in scalable shared-memory multiprocessors**. In *Proceedings of the 17th Annual International Symposium on Computer Architecture*, pages 15–26, 1990.

Complex atomic memory operations on a single memory word or doubleword are performed with the load-reserved (lr) and store-conditional (sc) instructions.

| Instruction              | Description                                             |
|--------------------------|---------------------------------------------------------|
| lr.{d w}.aqrl rd,rs1     | Loads a double or a single word from the address        |
|                          | in rs1, places the sign-extended value in rd, and       |
|                          | registers a reservation set, a set of bytes that sub-   |
|                          | sumes the bytes in the addressed double word.           |
|                          | For lr.w the value is extended to 64 bits.              |
| sc.{d w}.aqrl rd,rs1,rs2 | Conditionally writes a word in rs2 to the address       |
|                          | in rs1: the sc. {w d} succeeds only if the reserva-     |
|                          | tion is still valid and the reservation set contains    |
|                          | the bytes being written. If it succeeds, the in-        |
|                          | struction writes the word in rs2 to memory, and         |
|                          | it writes zero to rd. If it fails, the instruction does |
|                          | not write to memory, and it writes a nonzero value      |
|                          | to rd. Regardless of success or failure, executing      |
|                          | an sc instruction invalidates any reservation held      |
|                          | by this hart.                                           |

Table 1.34: load reserved/store conditional instructions

<sup>&</sup>lt;sup>54</sup>They are not supported in tiny-asm, nor do they have any implementation in actual hardware yet.

#### The aq and rl bits

The riscv specifications explain:

To follow more closely the specifications in the paper cited above for release consistency, each atomic instruction has two bits, aq and rl, used to specify additional memory ordering constraints as viewed by other riscv harts. The bits order accesses to one of the two address domains, memory or I/O, depending on which address domain the atomic instruction is accessing.

If both bits are clear, no additional ordering constraints are imposed on the atomic memory operation. If only the aq bit is set, the atomic memory operation is treated as an acquire access, i.e., no following memory operations on this RISC-V hart can be observed to take place before the acquire memory operation.

If only the rl bit is set, the atomic memory operation is treated as a release access, i.e., the release memory operation cannot be observed to take place before any earlier memory operations on this RISC-V hart. If both the aq and rl bits are set, the atomic memory operation is sequentially consistent and cannot be observed to happen before any earlier memory operations or after any later memory operations in the same riscv hart and to the same address domain. <sup>55</sup>

These bits can be specified by writing their state after the instruction, for instance:

```
amoxor.d rd,rs1,rs2,aq
amoxor.d rd,rs1,rs2,aqrl
amoxor.d rd,rs1,rs2,rl
```

## **Encodings**

New bits are: bit 26, for aq and bit 25, for r1. The width of the instruction is written in bits 12 to 14. A value of 3, for instance, means  $2^3 \rightarrow 8$ .

Figure 1.19: Atomic instructions layout



Table 1.35: Atomic Memory Operation instructions

| Instruction                  | Description                                        |
|------------------------------|----------------------------------------------------|
| $amoswap.{w d}[.aq aqrl rl]$ | Loads a data value from the address in rs1, place  |
| rd,rs2,(rs1)                 | the value into register rd, swaps the loaded value |
|                              | and the original value in rs2, then stores the re- |
|                              | sult back to the address in rs1                    |

<sup>&</sup>lt;sup>55</sup>§10.1, page 55

amoadd. {w|d}[.aq|aqrl|rl] Loads a data value from the address in rs1 , place rd, rs2, (rs1) the value into register rd, adds the loaded value and the original value in rs2, then stores the result back to the address in rs1 amoand. {w|d}[.aq|aqrl|rl] Loads a data value from the address in rs1, place the value into register rd , does and AND operrd, rs2, (rs1) ation between the loaded value and the original value in rs2, then stores the result back to the address in rs1 Loads a data value from the address in rs1, place amoor. {w|d}[.aq|aqrl|rl] the value into register rd , does an OR operation rd, rs2, (rs1) between the loaded value and the original value in rs2, then stores the result back to the address in rs1 amoxor.{w|d}[.aq|aqrl|rl] Loads a data value from the address in rs1, place rd, rs2, (rs1) the value into register rd, does an XOR operation between the loaded value and the original value in rs2, then stores the result back to the address in rs1 Loads a data value from the address in rs1, place amomax[u].{w|d}[.aq|aqr1|r1] rd, rs2, (rs1) the value into register rd, finds the signed (or the unsigned) maximum value between the loaded value and the original value in rs2, then stores the result back to the address in rs1 amomin[u].{w|d}[.aq|aqrl|rl] Loads a data value from the address in rs1, place rd, rs2, (rs1) the value into register rd, finds the signed (or unsigned) minimum value between the loaded value and the original value in rs2, then stors the result

Table 1.35: Atomic Memory Operation instructions

#### An example

To show one of those instructions in action, we write a small program. The results can only be traced in the debugger since there is no call to printf.

back to the address in rs1

```
1
       .globl main
2 main:
       addi sp,sp,-32
3
                                  // Boilerplate code for building the stack frame
       sd ra,8(sp)
4
       sd s0,0(sp)
5
6
                                 // Put 123 into t1
       li t1,123
                                 //\ \mathit{Store}\ \mathit{it}\ \mathit{at}\ \mathit{the}\ \mathit{bottom}\ \mathit{of}\ \mathit{the}\ \mathit{stack}
       sd t1,(sp)
8
       li t2,47
                                 // Put 47 in t2
       amoadd.d t3,t2,0(sp) // Atomic Memory Operation
10
11
       ld t4,(sp)
                                 // Load the result into t4
12
13
       1d ra,8(sp)
                                 // Boiler plate code to destroy the stack frame
14
       ld s0,0(sp)
15
       add sp,sp,32
16
```

We follow the execution in the debugger

```
star64: ~/tiny-asm$ gdb a.out
GNU gdb (GDB) 11.2
... the usual stuff from gdb not shown
Reading symbols from a.out...
(gdb) b main
                                    // Set a breakpoint at main
Breakpoint 1 at 0x640
(gdb) display/i $pc
                                    // Disassemble each instruction and show it
1: x/i $pc
<error: No registers.>
                                    // ... Normal, we haven't started yet
(gdb) run
                                    // Let's go
Starting program: /home/jacob/tiny-asm/a.out
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/libthread_db.so.1".
Breakpoint 1, 0x0000002aaaaaa640 in main ()
1: x/i $pc
=> 0x2aaaaaa640 <main+24>: amoadd.d t3,t2,(sp) // GDB stops BEFORE the instruction
    is executed
(gdb) print $t3
$1 = 274743607392
                                    // t3 contains nonsense
(gdb) print $t2
$2 = 47
                                    // As it should
(gdb) nexti
                                    // Execute next instruction
0x0000002aaaaaa644 in main ()
1: x/i $pc
=> 0x2aaaaaa644 <main+28>: ld t4,0(sp)
(gdb) print $t3
                                    // Contains the old value at the stack bottom
$3 = 123
(gdb) nexti
0x0000002aaaaaa648 in main ()
1: x/i $pc
=> 0x2aaaaaa648 <main+32>: ld ra,8(sp)
(gdb) print $t4
$4 = 170
                                    // The stack bottom contains 123 + 47
(gdb)
```

Exercise 12: Change the amound by amoswap. What are the values of t2, t3 and t4 after the operation?

## 1.15 Cryptographic Extensions

There are two main branches for these specifications: scalar and vector opcodes. Only scalar opcodes are supported in tiny-asm for the time being. Even then, there isn't any easily available board that implements them as of this writing (2023).

The scalar opcodes have two sides too, of course: encryption (extension Zkne) and decription (Zknd).

| Instruction             | Description                                                                              |
|-------------------------|------------------------------------------------------------------------------------------|
| AES                     |                                                                                          |
| aes64ks1i rd, rs1, rcon | Key schedule 1 decryption. Implements the rotation, SubBytes and Round Constant addition |
|                         | steps.                                                                                   |

Table 1.36: Atomic Memory Operation instructions

Table 1.36: Atomic Memory Operation instructions

|                       | v 1                                                                     |
|-----------------------|-------------------------------------------------------------------------|
| aes64ks2 rd, rs1, rs2 | Key schedule 2 decryption. Implements the remaining xor operations.     |
|                       |                                                                         |
| aes64im rd, rs1       | Applies the inverse MixColumns transformation                           |
|                       | to two columns of the state array, packed into a                        |
|                       | single 64-bit register. It is used in decryption to                     |
|                       | create the inverse cipher KeySchedule, according                        |
|                       | to the equivalent inverse cipher construction.                          |
| aes64esm rd, rs1, rs2 | Performs the (Inverse) SubBytes, ShiftRows and                          |
|                       | Mix-Columns Transformations for encryption.                             |
| aes64es rd, rs1, rs2  | Perform the (Inverse) SubBytes and ShiftRows                            |
|                       | Transformations.It is used for the last round of                        |
|                       | encryption only.                                                        |
| aes64dsm rd, rs1, rs2 | SubBytes, ShiftRows and Mix-Columns Transfor-                           |
|                       | mation for decryption.                                                  |
| aes64ds rd, rs1, rs2  | SubBytes and ShiftRows Transformations. It is                           |
|                       | used for the last round of decryption only                              |
| Hash functions        |                                                                         |
| sha256sig0 rd,rs1     | Implements the Sigma0 transformation function                           |
|                       | as used in the SHA2-256 hash function <sup>56</sup>                     |
| sha256sig1 rd,rs1     | Implements the Sigmal transformation function                           |
|                       | as used in the SHA2-256 hash function.                                  |
| sha256sum0 rd,rs1     | Implements the Sum0 transformation function                             |
|                       | as used in the SHA2-256 hash function (Section                          |
|                       | 4.1.2).                                                                 |
| sha256sum1 rd,rs1     | Implements the Sum1 transformation function                             |
|                       | as used in the SHA2-256 hash function (Section                          |
|                       | $ 4.1.2\rangle$ .                                                       |
| sha512sig0 rd,rs1,rs2 | Implements the Sigma0 transformation, as used                           |
| 5                     | in the SHA2-512 hash function (Section 4.1.3).                          |
| sha512sigl rd,rs1,rs2 | Implements the Sigmal transformation, as used                           |
| <b>5</b> , ,          | in the SHA2-512 hash function (Section 4.1.3).                          |
| sha512sum0 rd,rs1     | Implements the Sum0 transformation function                             |
| •                     | as used in the SHA2-512 hash function (Section                          |
|                       | 4.1.3).                                                                 |
|                       | $rd \leftarrow ror64(rs1, 28) \oplus ror64(rs1, 34)$                    |
| sha512sum1 rd,rs1     | Implements the Sum1 transformation function                             |
| •                     | as used in the SHA2-512 hash function (Section                          |
|                       | 4.1.3).                                                                 |
|                       | $rd \leftarrow ror64(rs1,14) \oplus ror64(rs1,18) \oplus ror64(rs1,41)$ |
| sm3p0 rd,rs1          | P0 transformation function as used in the SM3                           |
| 1 ,                   | hash function.                                                          |
|                       | 57                                                                      |
| sm3p1 rd,rs1          | P1 transformation function as used in the SM3                           |
| <u>r,</u>             | hash function.                                                          |
|                       | IIGGII I GIIO UIOII.                                                    |

 $<sup>\</sup>overline{\begin{tabular}{l} 56\, NIST, "Secure Hash Standard (SHS)."} \end{tabular} Federal Information Processing Standards Publication FIPS 180-4, Aug. 2015, [Online]. Available: doi.org. \\ 57" GB/T 32905-2016: SM3 Cryptographic Hash Algorithm." Also GM/T 0004-2012. Standardization Administration of China, Aug. 2016 Available at gmbz.org$ 

Implements a T-tables in hardware style approach to accelerating the SM4 Key Schedule. A byte is extracted from rs2 based on bs, to which the SBox and linear layer transforms are applied, before the result is XOR'd with rs1 and written back to rd. bs is a 2 bit constant that is multiplied by 8, and represents the number of bits to

Table 1.36: Atomic Memory Operation instructions

 $sbin \leftarrow (rs2[0..31] >> bs*8)[0..8]$ 

shift right to select the first byte of rs2.

|                        | Accelerates the block encrypt/decrypt operation      |
|------------------------|------------------------------------------------------|
| sm4ed rd, rs1, rs2, bs | of the SM4 block cipher                              |
|                        | Implements a T-tables in hardware style ap-          |
|                        | proach to accelerating the SM4 round function.       |
|                        | A byte is extracted from rs2 based on bs, to which   |
|                        | the SBox and linear layer transforms are applied,    |
|                        | before the result is XOR'd with rs1 and written      |
|                        | back to rd.                                          |
|                        | The bs parameter is encoded in bits 30 and 31.       |
|                        | The two bits are multiplied by 8, and represent      |
|                        | the shift amount for the algorithm.                  |
| xperm4 rd,rs1,rs2      | The xperm4 instruction operates on nibbles. The      |
|                        | rs1 register contains a vector of 16 4-bit elements, |
|                        | two per byte. The rs2 register contains a vector     |

# The xperm4 instruction operates on nibbles. The rs1 register contains a vector of 16 4-bit elements, two per byte. The rs2 register contains a vector of 16 4-bit indexes. The result is each element in rs2 replaced by the indexed element in rs1, or zero if the index into rs2 is out of bounds. xperm8 rd,rs1,rs2 The xperm4 instruction operates on nibbles. The rs1 register contains a vector of 8 8-bit elements. The rs2 register contains a vector of 8

The rs1 register contains a vector of 8 8-bit elements. The rs2 register contains a vector of 8 8-bit indexes. The result is each element in rs2 replaced by the indexed element in rs1, or zero if the index is out of bounds.

# 1.16 Instructions specific to the Thead processor

All these instructions are prefixed with the letters "th.".

Table 1.37: Thead instructions

| Instruction              | Description                                   |
|--------------------------|-----------------------------------------------|
| th.addsl rd,rs1,rs2,imm2 | $rd \leftarrow rs1 + (rs2 << imm2)$           |
|                          | Add with shifted register.                    |
| th.ext rd,rs1,imm1,imm2  | $rd \leftarrow rs1[imm1:imm2].$               |
|                          | Extract bits imm1 to imm2 with sign extension |
| th.extu rd,rs1,imm1,imm2 | $rd \leftarrow rs1[imm1:imm2].$               |
|                          | Extract bits imm1 to imm2 with zero extension |

Table 1.37: Thead instructions

| Table                                                                                                     | 1.57: I nead instructions                                                                                                                                                                                                                                                      |
|-----------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| th.ff0 rd,rs                                                                                              | Finds the first bit with the value of 0 from the highest bit of rs1 and writes the result back into the rd register. If the highest bit of rs1 is 0, the result 0 is returned. If all the bits in rs1 are 1, the result 64 is returned.                                        |
| th.ff1 rd,rs                                                                                              | Finds the first bit with the value of 1 from the highest bit of rs1 and writes the index of this bit back into rd. If the highest bit of rs1 is 1, the result 0 is returned. If all the bits in rs1 are 1, the result 64 is returned.                                          |
| th.lbia rd,(rs1),imm5,imm2 Post increment                                                                 | Load byte with post increment of rs1. The destination register is filled with the sign extended contents of mem[rs1]. Then, rs1 is incremented (or decremented) by imm5 « imm2                                                                                                 |
| th.lbuia rd,rs1,imm5,imm2 Post increment unsigned                                                         | Load byte unsigned with post increment of rs1. The destination register is filled with the zero extended contents of mem[rs1]. Then, rs1 is incremented (or decremented) by imm5 « imm2                                                                                        |
| th.lbib rd,rs1,imm5,imm2                                                                                  | Pre-increment load byte. Increments rs1 by imm5 « imm2. Then, loads a sign extended byte into rd from mem[rs1]                                                                                                                                                                 |
| th.lhia rd, (rs1), imm5, imm2. Post-increment. th.lhib rd, (rs1), imm5, imm2. Pre-increment.              | Loads sign extended half-word into rd. After the load, rs1 is incremented by imm5 « imm2.  Loads sign extended half-word into rd. Before the load, rs1 is incremented by imm5 « imm2.                                                                                          |
| th.lhuia rd, (rs1), imm5,<br>imm2. Post-increment.<br>th.lhuib rd, (rs1), imm5,<br>imm2. Pre-increment.   | Loads zero extended half-word into rd.After the load, rs1 is incremented by imm5 « imm2.  Loads zero extended half-word into rd.Before the load, rs1 is incremented by imm5 « imm2.                                                                                            |
| th.lwia rd, (rs1), imm5, imm2. Post-increment. th.lwib rd, (rs1), imm5,                                   | Loads sign extended word into rd. After the load, rs1 is incremented by imm5 « imm2.  Loads sign extended word into rd. Before the load,                                                                                                                                       |
| <pre>imm2. Pre-increment. th.lwuia rd, (rs1), imm5, imm2. Post-increment. th.lwuib rd, (rs1), imm5,</pre> | rs1 is incremented by imm5 « imm2.  Loads zero extended word into rd.After the load, rs1 is incremented by imm5 « imm2.  Loads zero extended word into rd.Before the load,                                                                                                     |
| imm2. Pre-increment. th.lwd rd,imm7(rs1)                                                                  | rs1 is incremented by imm5 « imm2.  Load two sign extended words into two consecutive registers. The rd register receives the first 32 bits, sign extended, and the register rd+1 receives the sign extended bits 32 to 63.                                                    |
| th.ldia rd, (rs1), imm5,<br>imm2 Post-increment<br>th.ldib rd, (rs1), imm5,                               | Loads 64 bits from *(rs1) into rd. Then increments rs1 by a sign extended (imm5 « imm2)  Increments rs1 by a sign extended (imm5 « imm2).                                                                                                                                      |
| th.rev rd,rs1                                                                                             | Then loads 64 bits from *(rs1) into rd.  Reverses the bytes in rs1.  rd[7] $\leftarrow$ rs[0] rd[6] $\leftarrow$ rs[1] rd[5] $\leftarrow$ rs[2]  rd[4] $\leftarrow$ rs[3] rd[3] $\leftarrow$ rs[4] rd[2] $\leftarrow$ rs[5]  rd[1] $\leftarrow$ rs[6] rd[0] $\leftarrow$ rs[7] |

Table 1.37: Thead instructions

| th.revw rd,rs1                        | Reverses the bytes in lower word of rs1.                                                                                                                              |
|---------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| on.icvw id,ibi                        | $rd[3] \leftarrow rs[0] rd[2] \leftarrow rs[1] rd[1] \leftarrow rs[2]$                                                                                                |
|                                       | $  \operatorname{rd}[0] \leftarrow \operatorname{rs}[0]  \operatorname{rd}[2] \leftarrow \operatorname{rs}[1]  \operatorname{rd}[1] \leftarrow \operatorname{rs}[2] $ |
| th tat nd na1 imm6                    |                                                                                                                                                                       |
| th.tst rd,rs1,imm6                    | rd contains the bit at position imm6 of rs1.                                                                                                                          |
| th.tstnbz rd, rs1                     | Tests for a zero byte in rs1. Each byte of rd will be                                                                                                                 |
|                                       | either 0xff (the corresponding byte is zero) or 0 (the                                                                                                                |
| ·                                     | corresponding byte is different than zero)                                                                                                                            |
| th.lbia rd,(rs1),imm5,imm2            | Post-increment.                                                                                                                                                       |
|                                       | $signExt(rd \leftarrow mem[rs1]);$                                                                                                                                    |
|                                       | $rs1 \leftarrow rs1 + imm5 << imm2$                                                                                                                                   |
|                                       | rd and rs1 must be different registers.                                                                                                                               |
| th.lbib rd,(rs1),imm5,imm2            | Pre-increment.                                                                                                                                                        |
|                                       | $rs1 \leftarrow rs1 + imm5 << imm2;$                                                                                                                                  |
|                                       | $signExt(rd \leftarrow mem[rs1])$                                                                                                                                     |
|                                       | rd and rs1 must be different registers.                                                                                                                               |
| th.lbuia rd,(rs1),imm5,imm2           | Post-increment.                                                                                                                                                       |
| , , , , ,                             | $zeroExt(rd \leftarrow mem[rs1]);$                                                                                                                                    |
|                                       | $rs1 \leftarrow rs1 + imm5 < < imm2$                                                                                                                                  |
|                                       | rd and rs1 must be different registers.                                                                                                                               |
| th.lbuib rd,(rs1),imm5,imm2           | Pre-increment.                                                                                                                                                        |
| 011.12.12. 24, (22.1, 1               | $rs1 \leftarrow rs1 + imm5 << imm2;$                                                                                                                                  |
|                                       | $zeroExt(rd \leftarrow mem[rs1])$                                                                                                                                     |
|                                       | rd and rs1 must be different registers.                                                                                                                               |
| th.ldd rd1,rd2, (rs1),imm2            | Load pair of registers.                                                                                                                                               |
| on. raa rar, raz, (151), ramz         | $address \leftarrow rs1 + zero \ extend(imm2 << 4)$                                                                                                                   |
|                                       | $rd1 \leftarrow mem[address + 7 : address]$                                                                                                                           |
|                                       | $rd2 \leftarrow mem[address + 15 : address + 8]$                                                                                                                      |
|                                       |                                                                                                                                                                       |
| th.ldia rd,(rs1),imm5,imm2            | Load byte with post increment                                                                                                                                         |
| ,                                     | $rd \leftarrow signExt(mem[rs1 + 7 : rs1])$                                                                                                                           |
|                                       | $rs1 \leftarrow rs1 + signExt(imm5 << imm2)$                                                                                                                          |
| th.ldib rd,(rs1),imm5,imm2            | Load byte with pre increment                                                                                                                                          |
| 511. Talb Ta, (151), 1111110, 1111112 | $rs1 \leftarrow rs1 + signExt(imm5 << imm2) \ rd \leftarrow$                                                                                                          |
|                                       | signExt(mem[rs1 + 7 : rs1])                                                                                                                                           |
|                                       |                                                                                                                                                                       |
| th.lhia rd,(rs1),imm5,imm2            | Load half word with post increment                                                                                                                                    |
| 511.1111d 1d, (151), 1111110, 1111112 | $rd \leftarrow signExt(mem[rs1 + 1 : rs1]) \ rs1 \leftarrow rs1 +$                                                                                                    |
|                                       | signExt(imm5 << imm2)                                                                                                                                                 |
| th.lhib rd,(rs1),imm5,imm2            | Load half word with pre increment                                                                                                                                     |
| cii. iiiib id, (isi), iiiiiis, iiiiiz | $rd \leftarrow signExt(mem[rs1 + 1 : rs1])$                                                                                                                           |
|                                       |                                                                                                                                                                       |
| th lhuin rd (mai) immE immo           | $rs1 \leftarrow rs1 + signExt(imm5 << imm2)$                                                                                                                          |
| th.lhuia rd,(rs1),imm5,imm2           | Load half word with post increment and zero extend.                                                                                                                   |
|                                       | $rd \leftarrow zeroExt(mem[rs1+1:rs1])$                                                                                                                               |
| #h 1h:h mJ (4) : 5 : 0                | $rs1 \leftarrow rs1 + signExt(imm5 << imm2)$                                                                                                                          |
| th.lhib rd,(rs1),imm5,imm2            | Load half word with pre increment and zero extend.                                                                                                                    |
|                                       | $rd \leftarrow zeroExt(mem[rs1+1:rs1])$                                                                                                                               |
|                                       | $rs1 \leftarrow rs1 + signExt(imm5 << imm2)$                                                                                                                          |
| th.lrb rd,rs1,imm2                    | Load sign extended byte with shifted register.                                                                                                                        |
|                                       | $rd \leftarrow signExt(mem[rs1 + (rs2 << imm2)])$                                                                                                                     |

Table 1.37: Thead instructions

| 20020                       | 1.01. Though instructions                                    |
|-----------------------------|--------------------------------------------------------------|
| th.lrbu rd,rs1,imm2         | Load zero extended byte with shifted register.               |
|                             | $rd \leftarrow zeroExt(mem[rs1 + (rs2 << imm2)])$            |
| th.lrd rd,rs1,imm2          | Load double word with shifted register.                      |
|                             | $rd \leftarrow mem[rs1 + (rs2 << imm2)]$                     |
| th.lrh rd,rs1,imm2          | Load half word with sign extend and shifted register.        |
|                             | $rd \leftarrow mem[rs1 + (rs2 << imm2)]$                     |
| th.lrhu rd,rs1,imm2         | Load half word with zero extend and shifted register.        |
|                             | $rd \leftarrow mem[rs1 + (rs2 << imm2)]$                     |
| th.lrw rd,rs1,imm2          | Load word with sign extend and shifted register.             |
|                             | $rd \leftarrow mem[rs1 + (rs2 << imm2)]$                     |
| th.lrwu rd,rs1,imm2         | Load word with zero extend and shifted register.             |
|                             | $rd \leftarrow mem[rs1 + (rs2 << imm2)]$                     |
| th.lurb rd,rs1,rs2,imm2     | Load byte, shift it, then sign extend result.                |
|                             | $   rd \leftarrow signExt(mem[rs1 + zeroExt(rs2[0:31])] <<$  |
|                             | imm2)                                                        |
| th.lurbu rd,rs1,rs2,imm2    | Load byte, shift it, then zero extend result.                |
|                             | $   rd \leftarrow zeroExt(mem[rs1 + zeroExt(rs2[0:31])] <<$  |
|                             | imm2)                                                        |
| th.lurd rd,rs1,rs2,imm2     | Load double word, shift the result.                          |
|                             | $rd \leftarrow mem[rs1 + zeroExt(rs2[0:31])] << imm2$        |
| th.lurh rd,rs1,rs2,imm2     | Load half word, shift the result.                            |
|                             | $rd \leftarrow signExt(mem[rs1 + zeroExt(rs2[0:31])] <<$     |
|                             | imm2)                                                        |
| th.lurhu rd,rs1,rs2,imm2    | Load half word, shift the result.                            |
|                             | $   rd \leftarrow zeroExt(mem[rs1 + zeroExt(rs2[0:31])] <<$  |
|                             | imm2)                                                        |
| th.lurw rd,rs1,rs2,imm2     | Load word, shift the result.                                 |
|                             | $   rd \leftarrow signExt(mem[rs1 + zeroExt(rs2[0:31])] <<$  |
|                             | imm2)                                                        |
| th.lurwu rd1,rd2,(rs1),imm2 | Load word register pair unsigned.                            |
|                             | $address \leftarrow rs1 + zeroExt(imm2 << 3)$                |
|                             | $rd1 \leftarrow signExt(mem[address + 3: address])$          |
|                             | $rd2 \leftarrow signExt(mem[address + 7: address + 4])$      |
| th.lwd rd,rs1,(rs2),imm2    |                                                              |
| th.mula rd,r1,r2            | Multiplies $r1 \times r2$ and adds the sign extended result  |
|                             | to rd. The result is then stored in rd.                      |
| th.mulah rd,r1,r2           | Multiplies the lower 16 bits of each $r1 \times r2$ and adds |
|                             | the sign extended result to the lower 16 bits of rd.         |
|                             | The result is then stored in rd.                             |
| th.mulaw rd,r1,r2           | Multiplies the lower 32 bits of each $r1 \times r2$ and adds |
|                             | the sign extended result to the lower 32 bits of rd.         |
|                             | The result is then stored in rd.                             |
| th.mulsw rd,r1,r2           | Multiplies the lower 32 bits of each $r1 \times r2$ and sub- |
|                             | tracts the sign extended result to the lower 32 bits         |
|                             | of rd. The result is then stored in rd.                      |
| th.mveqz rd,r1,r2           | Assigns r1 to rd if r2 is equal to zero. Otherwise rd        |
|                             | remains unchanged                                            |
| th.mvneqz rd,r1,r2          | Assigns r1 to rd if r2 is not equal to zero. Otherwise       |
|                             | rd remains unchanged                                         |
|                             | •                                                            |

3 4

5

9

10 11 }

Table 1.37: Thead instructions

| th.srri rd, rs1, imm6  | Shifts the original value of rs1 to the right by the amount specified, then rotates the result one bit to the right. |
|------------------------|----------------------------------------------------------------------------------------------------------------------|
| th.srriw rd, rs1, imm5 | Shifts the lower 32 bits of rs1 to the right, then rotates the result one bit to the right in 32 bits.               |

```
Exercise 13: Calculate \ ceil(log2(x))
     This function in C calculates \lceil log_2(x) \rceil:
static unsigned int bfd_log2(unsigned long x)
2 {
      unsigned int result = 0;
      if (x \le 1) return result;
          ++result;
      while ((x \geq 1) \neq 0);
      return result;
```

Do the same in 3 assembler instructions.

## Pseudo instructions 1.17

The accepted policy under risc-v is the opposite to the ARM64 assembler. Under ARM64, the assembler will issue an error if an instruction alias expands to more than one instruction. Here, it is quite the opposite, the assembler (and above all, the linker) is responsible for expanding high level macros.

Table 1.38: Pseudo instructions

| Pseudo              | Base instruction                    | Meaning                     |
|---------------------|-------------------------------------|-----------------------------|
| beqz rs, offset     | beq rs, x0, offset                  | Branch if $=$ zero          |
| bgez rs, offset     | bge rs, x0, offset                  | Branch if ≥zero             |
| bgt rs, rt, offset  | blt rt, rs, offset                  | Branch if >                 |
| bgtu rs, rt, offset | bltu rt, rs, offset                 | Branch if >, unsigned       |
| bgtz rs, offset     | blt x0, rs, offset                  | Branch if > zero            |
| ble rs, rt, offset  | bge rt, rs, offset                  | Branch if $\leq$            |
| bleu rs, rt, offset | bgeu rt, rs, offset                 | Branch if $\leq$ , unsigned |
| blez rs, offset     | bge x0, rs, offset                  | Branch if $\leq$ zero       |
| bltz rs, offset     | blt rs, x0, offset                  | Branch if < zero            |
| bnez rs, offset     | bne rs, x0, offset                  | Branch if $\neq$ zero       |
| call offset         | <pre>auipc x1, offset[31:12];</pre> | Call far-away subrou-       |
|                     | jalr x1, x1, offset[11:0]           | tine                        |
| fabs.d rd, rs       | fsgnjx.d rd, rs, rs                 | Double-precision abso-      |
|                     |                                     | lute value. Just an         |
|                     |                                     | alias.                      |
| fabs.s rd, rs       | fsgnjx.s rd, rs, rs                 | Single-precision abso-      |
|                     |                                     | lute value                  |
| fence               | fence iorw, iorw                    | Fence on all memory         |
|                     |                                     | and I/O                     |

Table 1.38: Pseudo instructions

|                                | Table 1.30. I seudo instructions    |                                |
|--------------------------------|-------------------------------------|--------------------------------|
| <pre>fl{w d} rd, symbol,</pre> | auipc rt, symbol[31:12];            | Floating-point load            |
| rt                             | fl{w d} rd, symbol[11:0](rt)        | global                         |
| fmv.d rd, rs                   | fsgnj.d rd, rs, rs                  | Copy double-precision          |
|                                |                                     | register                       |
| fmv.s rd, rs                   | fsgnj.s rd, rs, rs                  | Copy single-precision register |
| fneg.d rd, rs                  | fsgnjn.d rd, rs, rs                 | Double-precision               |
|                                |                                     | negate                         |
| fneg.s rd, rs                  | fsgnjn.s rd, rs, rs                 | Single-precision negate        |
| fs{w d} rd,symbol,             | auipc rt, symbol[31:12];            | Floating-point store           |
| rt                             | fs{w d} rd, symbol[11:0](rt)        | global                         |
| j offset                       | jal x0, offset                      | Jump                           |
| jal offset                     | jal x1, offset                      | Jump and link                  |
| jalr rs jalr                   | x1, rs, 0                           | Jump and link register         |
| jr rs                          | jalr x0, rs, 0                      | Jump register                  |
| $\frac{3}{1\{b h w d\}}$ rd,   | auipc rd, symbol[31:12];            | Load global                    |
| symbol                         | 1{b h w d} rd,                      |                                |
| J                              | symbol[11:0](rd)                    |                                |
| la rd, symbol                  | auipc rd, symbol@GOT[31:12];        | Load address With              |
|                                | lw d rd, symbol@GOT[11:0](rd)       | option pic                     |
| la rd, symbol                  | auipc rd, symbol[31:12];            | Load address With              |
| ia ia, symbol                  | addi rd, rd, symbol[11:0]           | option nopic (Default)         |
| li rd, immediate               | Myriad sequences                    | Load immediate                 |
| lla rd, symbol                 | auipc rd, symbol[31:12];            | Load local address             |
| ila id, symbol                 | addi rd, rd, symbol[31.12],         | Load local address             |
| mv rd, rs                      | addi rd, rs, 0                      | Copy register                  |
|                                | sub rd, x0, rs                      | Two's complement               |
| neg rd, rs                     |                                     | Two's complement               |
| negw rd, rs                    | subw rd, x0, rs                     | word                           |
| nop                            | addi x0, x0, 0                      | No operation                   |
| not rd, rs                     | xori rd, rs, -1                     | Ones' complement               |
| pause                          | fence w, 0                          | PAUSE hint                     |
| ret                            | jalr x0, x1, 0                      | Return from subrou-            |
|                                |                                     | tine                           |
| $s\{b h w d\}$ rd,             | <pre>auipc rt, symbol[31:12];</pre> | Store global                   |
| symbol, rt                     | $s{b h w d} rd,$                    |                                |
|                                | symbol[11:0](rt)                    |                                |
| seqz rd, rs                    | sltiu rd, rs, 1                     | Set if = zero                  |
| sext.b rd, rs                  | slli rd, rs, XLEN - 8;              | Sign extend byte It            |
|                                | srai rd, rd, XLEN - 8               | will expand to an-             |
|                                |                                     | other instruction se-          |
|                                |                                     | quence when B exten-           |
|                                |                                     | sion is available              |
| sext.h rd, rs                  | slli rd, rs, XLEN - 16;             | Sign extend half word          |
|                                | srai rd, rd, XLEN - 16              | It will expand to an-          |
|                                |                                     | other instruction se-          |
|                                |                                     | quence when B exten-           |
|                                |                                     | sion is available              |
| sext.w rd, rs                  | addiw rd, rs, 0                     | Sign extend word               |
| sgtz rd, rs                    | slt rd, x0, rs                      | Set if > zero                  |
|                                |                                     | ·                              |

| sltz rd, rs   | slt rd, rs, x0                      | Set if < zero           |
|---------------|-------------------------------------|-------------------------|
| snez rd, rs   | sltu rd, x0, rs                     | Set if $\neq$ zero      |
| tail offset   | <pre>auipc x6, offset[31:12];</pre> | Tail call far-away sub- |
|               | jalr x0, x6, offset[11:0]           | routine.                |
| zext.b rd, rs | andi rd, rs, 255                    | Zero extend byte        |
| zext.h rd, rs | slli rd, rs, XLEN - 16;             | Zero extend half word   |
|               | srli rd, rd, XLEN - 16              | It will expand to an-   |
|               |                                     | other instruction se-   |
|               |                                     | quence when B exten-    |
|               |                                     | sion is available       |
| zext.w rd, rs | slli rd, rs, XLEN - 32;             | Zero extend word It     |
|               | srli rd, rd, XLEN - 32              | will expand to an-      |
|               |                                     | other instruction se-   |
|               |                                     | quence when B exten-    |
|               |                                     | sion is available       |

Table 1.38: Pseudo instructions

Exercise 14: Mismatch between source and disassembly. Explain Consider this program:

When we assemble this with gcc -o addr.o addr.s we obtain an object file. When we disassemble it, we obtain:

Explain why this mismatch.

## 1.18 Interfacing with high level languages

- The first 8 arguments of a function, if they fit in 64 bits, are passed in the registers a0-a7.
- When more than 8 arguments are present, they are passed in the stack.
- Scalars that are  $2 \times 64$  bits wide are passed in a pair of argument registers, with the low-order 64 bits in the lower-numbered register and the high-order 64 bits in the higher-numbered register. Scalars wider than 128 bits are passed by reference. In C, the address of a copy is passed.
- Floating point arguments (16, 32 and 64 bits) are passed in the registers f10 to f17.
- Structures with two floating point fields (complex numbers, for instance) are passed using two floating points registers. If there isn't two floating point regs available they are passed using the integer protocol.
- Functions with variable number of arguments use the integer protocol.
- Values up to 128 bits are returned using the a0-a1 register pair, or the f0,f1 registers for floating point.

## 1.19 The full opcode table

Symbols used in the opcode table:

| $\operatorname{Sign}$ | Description                               |
|-----------------------|-------------------------------------------|
| $\oplus$              | Xor                                       |
| A                     | Not and                                   |
| ±<br>+                | Sign extension                            |
|                       | Zero extension                            |
| $\nabla$              | Macro.                                    |
| $\nabla$              | Alias                                     |
| $\sim$                | Rotate right                              |
| $op_u$                | Unsigned variant of op, for example $<_u$ |
| reg[idx]              | Bit idx of reg                            |
| reg[nm]               | Bits n through m inclusive of reg         |
| rm                    | Rounding mode                             |
| $\mathtt{uim} n$      | unsigned immediate with $n$ bits          |
| $\verb"imm" n$        | immediate with $n$ bits                   |
| $\mathcal{H}$         | Hypervisor instruction                    |
| $\approx$             | Rounding mode                             |

## Notes:

- $\bullet$  The symbol  $\leftarrow$  is used for assignment. = means equality.
- If the rounding mode is not explicitly mentioned, the rounding mode stored in the floating point control register is used.

| Instruction name          | Extens.  | Abstract   | Very short                            |
|---------------------------|----------|------------|---------------------------------------|
| and syntax                | required | parameters | description                           |
| add sp,sp,imm6            | C        | Cc,Cc,CL   | $sp \leftarrow sp + imm6 << 4$        |
| add rd',sp,imm8           | C        | Ct,Cc,CK   |                                       |
| add rd,rd,rs2'            | C        | d,CU,CV    |                                       |
| add rd,rd,imm6            | C        | d,CU,Co    |                                       |
| add rd,rs1',rd            | C        | d,CV,CU    |                                       |
| add rd,x0,rs2'            | C        | d,Cz,CV    |                                       |
| add rd,rs1,rs2            | I        | d,s,j      |                                       |
| add rd,rs1,rs2            | I        | d,s,t      |                                       |
| add rd,rs1,rs2,%tprel_add | I        | d,s,t,1    |                                       |
| add.uw                    | Zba      | d,s,t      |                                       |
| addi                      | C        | Cc,Cc,CL   |                                       |
| addi                      | C        | Ct,Cc,CK   |                                       |
| addi rd,rd,imm6           | С        | d,CU,Cj    | $rd \leftarrow rd + imm6$             |
| addi                      | С        | d,CU,z     |                                       |
| addi                      | С        | d,CV,z     |                                       |
| addi                      | C        | d,Cz,Co    |                                       |
| addi rd,rs1,imm12         | I        | d,s,j      | $rd \leftarrow rs1 + imm12$           |
| addiw rd,rs1,imm12        | С        | d,CU,Co    | $rd \leftarrow \pm rs1_{031} + imm12$ |
| addiw rd,rs,imm12         | I        | d,s,j      | $rd \leftarrow \pm rs_{031} + imm12$  |
| addw                      | С        | Cs,Ct,Cw   |                                       |
| addw                      | C        | Cs,Cw,Ct   |                                       |
| addw                      | C        | d,CU,Co    |                                       |
| addw rd,rs1,imm12         | I        | d,s,j      | $rd \leftarrow rs1 + imm12$           |

| T                             | Part           | Al-ma      | Variable in the second                     |
|-------------------------------|----------------|------------|--------------------------------------------|
| Instruction name              | Extens.        | Abstract   | Very short                                 |
| and syntax                    | required       | •          | description                                |
| add rd,rs1,rs2w               | I              | d,s,t      | $rd \leftarrow rs1 + rs2$                  |
| aes64ds rd, rs1, rs2          | Zknd           | d,s,t      | Round: InvShiftRows,                       |
|                               |                |            | InvSubBytes                                |
| aes64dsm rd, rs1, rs2         | Zknd           | d,s,t      | Round: InvShiftRows,                       |
|                               |                |            | InvSubBytes,                               |
|                               |                |            | InvMixColumns                              |
| aes64es rd, rs1, rs2          | Zkne           | d,s,t      | Round: ShiftRows,                          |
|                               |                |            | SubBytes                                   |
| aes64esm rd, rs1, rs2         | Zkne           | d,s,t      | Round: ShiftRows,                          |
|                               | <b>5</b> 1     |            | SubBytes, MixColumns                       |
| aes64im rd, rs1               | Zknd           | d,s        | KeySchedule:                               |
|                               | <b>5</b> 2 2 1 |            | InvMixColumns for Decrypt                  |
| aes64ks1i rd, rs1, rcon       | Zknd           | d,s,Y      | KeySchedule: SubBytes,                     |
|                               | Zkne           | 1 .        | Rotate, Round Const                        |
| aes64ks2 rd, rs1, rs2         | Zknd           | d,s,t      | KeySchedule: XOR                           |
| amanda d and                  | Zkne           | 4 + O(-)   | summation                                  |
| amoadd.d rd,rs2,(rs1),        | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 + rd                  |
|                               | Δ.             | d,t,0(s)   | rd  rd  *(rs1) *(rs1)  rs2 +               |
| amoadd.d.aq rd,rs2,(rs1)      | A              | a,t,0(s)   | rd←*(rs1) *(rs1)←rs2 + rd. Acquire.        |
|                               | A              | d,t,0(s)   | rd. Acquire.<br>rd←*(rs1) *(rs1)←rs2 +     |
| amoadd.d.aqrl<br>rd,rs2,(rs1) | A              | a,t,0(s)   | rd←*(rs1) *(rs1)←rs2 + rd. Acquire/release |
| amoadd.d.rl rd,rs2,(rs1)      | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 +                     |
| amoadd.d.ff fd,f52,(f5f)      | A              | u, c, o(s) | rd. Release                                |
| amoadd.w rd,rs2,(rs1)         | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 +                     |
| amoadd.w 1d,152,(151)         | A              | u, c, o(s) | rd. 32 bit.                                |
| amoadd.w.aq                   | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 +                     |
| amoudaaq                      |                | 4,0,0(5)   | rd. 32 bit acquire                         |
| amoadd.w.aqrl                 | Α              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 +                     |
| rd,rs2,(rs1)                  |                | ,-,-(-,    | rd.Acquire/Release                         |
| amoadd.w.rl rd,rs2,(rs1)      | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 +                     |
|                               |                |            | rd. Release.                               |
| amoand.d rd,rs2,(rs1)         | Α              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
| , , , ,                       |                |            | rd                                         |
| amoand.d.aq rd,rs2,(rs1)      | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
| • • •                         |                |            | rd. Acquire                                |
| amoand.d.aqrl                 | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
| rd,rs2,(rs1)                  |                |            | rd. Acquire/Release                        |
| amoand.d.rl rd,rs2,(rs1)      | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
|                               |                |            | rd. Release                                |
| amoand.w rd,rs2,(rs1)         | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
|                               |                |            | rd. 32 bit                                 |
| amoand.w.aq rd,rs2,(rs1)      | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
|                               |                |            | rd                                         |
| amoand.w.aqrl                 | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
| rd,rs2,(rs1)                  |                |            | rd.Acquire/Release                         |
| amoand.w.rl rd,rs2,(rs1)      | A              | d,t,0(s)   | rd←*(rs1) *(rs1)←rs2 ∧                     |
|                               |                |            | rd. Release. 32 bits                       |
| amomax.d                      | A              | d,t,0(s)   | rd←*(rs1)                                  |
|                               |                |            | $*(rs1) \leftarrow max(rs1,rs2)$           |
| amomax.d.aq                   | A              | d,t,0(s)   |                                            |
| amomax.d.aqrl                 | A              | d,t,0(s)   |                                            |
| amomax.d.rl                   | A              | d,t,0(s)   |                                            |
| amomax.w                      | A              | d,t,0(s)   |                                            |
|                               |                |            | 1                                          |

| Instruction name      | Extens.  | Abstract   | Very short                               |
|-----------------------|----------|------------|------------------------------------------|
| and syntax            | required | parameters | description                              |
|                       | -        | -          | description                              |
| amomax.w.aq           | A        | d,t,0(s)   |                                          |
| amomax.w.aqrl         | A        | d,t,0(s)   |                                          |
| amomax.w.rl amomaxu.d | A        | d,t,0(s)   |                                          |
|                       | A        | d,t,0(s)   |                                          |
| amomaxu.d.aq          | A        | d,t,0(s)   |                                          |
| amomaxu.d.aqrl        | A        | d,t,0(s)   |                                          |
| amomaxu.d.rl          | A        | d,t,0(s)   |                                          |
| amomaxu.w             | A        | d,t,0(s)   |                                          |
| amomaxu.w.aq          | A        | d,t,0(s)   |                                          |
| amomaxu.w.aqrl        | A        | d,t,0(s)   |                                          |
| amomaxu.w.rl          | A        | d,t,0(s)   |                                          |
| amomin.d              | A        | d,t,0(s)   |                                          |
| amomin.d.aq           | A        | d,t,0(s)   |                                          |
| amomin.d.aqrl         | A        | d,t,0(s)   |                                          |
| amomin.d.rl           | A        | d,t,0(s)   |                                          |
| amomin.w              | A        | d,t,0(s)   |                                          |
| amomin.w.aq           | A        | d,t,0(s)   |                                          |
| amomin.w.aqrl         | A        | d,t,0(s)   |                                          |
| amomin.w.rl           | Α        | d,t,0(s)   |                                          |
| amominu.d             | Α        | d,t,0(s)   |                                          |
| amominu.d.aq          | A        | d,t,0(s)   |                                          |
| amominu.d.aqrl        | A        | d,t,0(s)   |                                          |
| amominu.d.rl          | A        | d,t,0(s)   |                                          |
| amominu.w             | A        | d,t,0(s)   |                                          |
| amominu.w.aq          | A        | d,t,0(s)   |                                          |
| amominu.w.aqrl        | A        | d,t,0(s)   |                                          |
| amominu.w.rl          | A        | d,t,0(s)   |                                          |
| amoor.d               | A        | d,t,0(s)   |                                          |
| amoor.d.aq            | A        | d,t,0(s)   |                                          |
| amoor.d.aqrl          | A        | d,t,0(s)   |                                          |
| amoor.d.rl            | A        | d,t,0(s)   |                                          |
| amoor.w               | A        | d,t,0(s)   |                                          |
| amoor.w.aq            | A        | d,t,0(s)   |                                          |
| amoor.w.aqrl          | A        | d,t,0(s)   |                                          |
| amoor.w.rl            | A        | d,t,0(s)   |                                          |
| amoswap.d             | A        | d,t,0(s)   |                                          |
| amoswap.d.aq          | A        | d,t,0(s)   |                                          |
| amoswap.d.aqrl        | A        | d,t,0(s)   |                                          |
| amoswap.d.rl          | A        | d,t,0(s)   |                                          |
| amoswap.w             | A        | d,t,0(s)   |                                          |
| amoswap.w.aq          | A        | d,t,0(s)   |                                          |
| amoswap.w.aqrl        | A        | d,t,0(s)   |                                          |
| amoswap.w.rl          | A        | d,t,0(s)   |                                          |
| amoxor.d              | A        | d,t,0(s)   | $rd \leftarrow mem[rs1]$                 |
|                       |          |            | $mem[rs1] \leftarrow rd \oplus rs2$      |
| amoxor.d.aq           | A        | d,t,0(s)   | $rd \leftarrow mem[rs1]$                 |
|                       |          |            | $\mid mem[rs1] \leftarrow rd \oplus rs2$ |
|                       |          |            | acquire *(rs1)                           |
| amoxor.d.aqrl         | A        | d,t,0(s)   | $rd \leftarrow mem[rs1]$                 |
| rd,rs2,(rs1)          |          |            | $\mid mem[rs1] \leftarrow rd \oplus rs2$ |
|                       |          |            | acquire/release *(rs1)                   |

| Instruction name   | Extens.  | Abstract      | Vory short                                               |
|--------------------|----------|---------------|----------------------------------------------------------|
| and syntax         | required | parameters    | Very short description                                   |
|                    | -        | 1             | <u>-</u>                                                 |
| amoxor.d.rl        | A        | d,t,0(s)      | $rd \leftarrow mem[rs1]$                                 |
|                    |          |               | $mem[rs1] \leftarrow rd \oplus rs2$                      |
|                    |          | 1 . 0( )      | release *(rs1)                                           |
| amoxor.w           | A        | d,t,0(s)      | $rd \leftarrow mem[rs1]_{032}$                           |
|                    | •        | 1 . 0( )      | $mem[rs1]_{032} \leftarrow rd \oplus rs2$                |
| amoxor.w.aq        | A        | d,t,0(s)      | $rd \leftarrow mem[rs1]_{032}$                           |
|                    |          |               | $mem[rs1]_{032} \leftarrow rd \oplus rs2$ acquire *(rs1) |
| omorrow tr o and   | A        | d,t,0(s)      | $rd \leftarrow mem[rs1_{032}]$                           |
| amoxor.w.aqrl      | н        | u, t, 0(s)    | $   mem[rs1]_{032} \leftarrow rd \oplus rs2 $            |
|                    |          |               | acquire/release *(rs1)                                   |
| amoxor.w.rl        | A        | d,t,0(s)      | $rd \leftarrow mem[rs1]_{032}$                           |
| amoxor.w.rr        | А        | u, t, 0(s)    | $mem[rs1]_{032} \leftarrow rd \oplus rs2$                |
|                    |          |               | release *(rs1)                                           |
| and                | C        | Cs,Ct,Cw      | Teleabe (IBI)                                            |
| and                | C        | Cs,Cw,Co      |                                                          |
| and                | C        | Cs,Cw,Ct      |                                                          |
| and rd,rs1,imm12   | I        | d,s,j         | $rd \leftarrow rs1 \wedge imm12$                         |
| and rd,rs1,rs2     | I        | d,s,t         | $rd \leftarrow rs1 \land rs2$                            |
| andi               | C        | Cs,Cw,Co      | 76 ( 761/(762                                            |
| andi               | I        | d,s,j         |                                                          |
| andn rd,rs1,imm12  | Zbb      | d,s,j         | $rd \leftarrow rs1   mm12$                               |
| andi 14,151,1mm12  | Zbkb     | α,υ, <u>J</u> | 76 ( 7517( 011011012                                     |
| andn rd,rs1,rs2    | Zbb I    | d,s,t         | $rd \leftarrow rs1 \wedge rs2$                           |
| andir ru, rbr, rbz | Zbkb     | α,υ,υ         | 74 ( 7317(732                                            |
| auipc rd,imm20     | I        | d,u           | $rd \leftarrow pc + imm20$                               |
| bclr rd,rs1,im6    | Zbs      | d,s,>         | $rd \leftarrow (rs1 \land \sim (1 << im6))$              |
| bclr               | Zbs      | d,s,t         |                                                          |
| bclri              | Zbs      | d,s,>         |                                                          |
| beq                | С        | Cs,Cz,Cp      |                                                          |
| beq                | I        | s,t,p         |                                                          |
| begz rs,lab        | С        | Cs,Cp         | $pc \leftarrow pc + lab \times (rs1 = rs2)$              |
| beqz rs,lab        | I        | s,p           | $pc \leftarrow pc + lab \times (rs = rs2)$               |
| bext               | Zbs      | d,s,>         |                                                          |
| bext               | Zbs      | d,s,t         |                                                          |
| bexti              | Zbs      | d,s,>         |                                                          |
| bge                | I        | s,t,p         |                                                          |
| bgeu               | I        | s,t,p         |                                                          |
| bgez               | I        | s,p           |                                                          |
| bgt rs1,rs2,lab    | I        | t,s,p         | $pc \leftarrow pc + lab \times (rs2 < rs1)$              |
| bgtu r1,r2,lab     | I        | t,s,p         | $pc \leftarrow pc + lab \times (r2 <_u r1)$              |
| bgtz r1,lab        | I        | t,p           | $pc \leftarrow pc + lab \times (0 < r1)$                 |
| binv rd,rs1,imm6   | Zbs      | d,s,>         | alias for binvi                                          |
| binv rd,rs1,rs2    | Zbs      | d,s,t         | $rd \leftarrow (\sim rs1[rs2]) \lor \sim (1 <<$          |
|                    |          | , ,           | $ rs2\rangle$                                            |
| binvi rd,rs1,imm6  | Zbs      | d,s,>         | $rd \leftarrow (\sim rs1[imm6]) \lor \sim$               |
|                    |          |               | $(1 << imm6)^{-58}$                                      |
| ble                | I        | t,s,p         |                                                          |
| bleu               | I        | t,s,p         |                                                          |
| blez               | I        | t,p           |                                                          |
| blt rs1,rs2,lab    | I        | s,t,p         | $pc \leftarrow pc + lab \times (rs1 < rs2)$              |
| bltu r1,r2,lab     | I        | s,t,p         | $pc \leftarrow pc + lab \times (r1 <_u r2)$              |
|                    |          | · ·•          | /                                                        |

 $<sup>^{58}\</sup>mathrm{rd}$  is assigned the value of rs1 with the bit at position imm6 inverted

|                    |          |                  | 1,,                                                    |
|--------------------|----------|------------------|--------------------------------------------------------|
| Instruction name   | Extens.  | Abstract         | Very short                                             |
| and syntax         | required | parameters       | description                                            |
| bltz rs1,lab       | I        | s,p              | $pc \leftarrow pc + lab \times (rs1 < 0)$              |
| bne rs1,rs2,label  | C        | Cs,Cz,Cp         | $pc \leftarrow pc + label \times (rs1 \neq rs2)$       |
| bne rs1,rs2,label  | I        | s,t,p            | $pc \leftarrow pc + label \times (rs1 \neq rs2)$       |
| bnez rs,label      | C        | Cs,Cp            | $\nabla \ pc \leftarrow pc + label \times (rs \neq 0)$ |
| bnez rs, label     | I        | s,p              | $\nabla \ pc \leftarrow pc + label \times (rs \neq 0)$ |
| brev8 rd, rs       | Zbkb     | d,s              | Reverse bits in byte.                                  |
|                    |          |                  | for (i=0;i<4;i++)                                      |
|                    |          |                  | $rd[i]_{07} \leftarrow rs[i]_{70}$                     |
| bset rd,rs,imm6    | Zbs      | d,s,>            | Alias of bseti                                         |
| bset rd,rs1,rs2    | Zbs      | d,s,t            | Set bit:                                               |
|                    |          |                  | $rd \leftarrow rs$                                     |
|                    |          | •                | $rd[rs2] \leftarrow 1$                                 |
| bseti rd,rs1,imm6  | Zbs      | d,s,>            | Set bit:                                               |
|                    |          |                  | $rd \leftarrow rs$                                     |
|                    |          | d,CV             | $rd[imm6] \leftarrow 1$                                |
| c.add<br>c.addi    | C        |                  |                                                        |
|                    | C        | d,Co             |                                                        |
| c.addi16sp         | C        | Cc,CL            |                                                        |
| c.addi4spn         | C        | Ct,Cc,CK         |                                                        |
| c.addiw            | C        | d,Co             |                                                        |
| c.addw             | C        | Cs,Ct            |                                                        |
| c.and              | C        | Cs,Ct            |                                                        |
| c.andi             | C        | Cs,Co            |                                                        |
| c.beqz             | C        | Cs,Cp            |                                                        |
| c.bnez             | C        | Cs,Cp            |                                                        |
| c.ebreak           | C        |                  |                                                        |
| c.fld              | D&C      | CD,C1(Cs)        |                                                        |
| c.fldsp            | D&C      | D,Cn(Cc)         |                                                        |
| c.flw              | F&C      | CD,Ck(Cs)        |                                                        |
| c.flwsp            | F&C      | D,Cm(Cc)         |                                                        |
| c.fsd              | D&C      | CD,Cl(Cs)        |                                                        |
| c.fsdsp            | D&C      | CT,CN(Cc)        |                                                        |
| c.fsw              | F&C      | CD,Ck(Cs)        |                                                        |
| c.fswsp            | F&C      | CT,CM(Cc)        |                                                        |
| c.j                | C        | Ca               |                                                        |
| c.jalr             |          | d                |                                                        |
| c.jr               | C        | d<br>Ct,Cl(Cs)   |                                                        |
| c.ld               | C        |                  |                                                        |
| c.ldsp             | C        | d,Cn(Cc)<br>d,Co |                                                        |
| c.li               | C        |                  |                                                        |
| c.lui              | C        | d,Cu             |                                                        |
| c.lw               |          | Ct,Ck(Cs)        |                                                        |
| c.lwsp             | C        | d,Cm(Cc)         |                                                        |
| c.mv               | C        | d,CV             |                                                        |
| c.nop              | C        |                  |                                                        |
| c.nop              |          | Cj<br>Ca Ct      |                                                        |
| c.or               | C        | Cs,Ct            |                                                        |
| c.sd               | C        | Ct,Cl(Cs)        | mom[on   wime < 2]                                     |
| c.sdsp rs,uim6(sp) | C        | CV,CN(Cc)        | $mem[sp + uim6 << 3] \leftarrow rs$                    |
| c.slli             | C        | d,C>             |                                                        |
| c.slli64           | C        | d<br>C= C>       |                                                        |
| c.srai             | C        | Cs,C>            |                                                        |
| c.srai64           | C        | Cs               |                                                        |

| Instruction name | Extens.       | Abstract            | Very short             |
|------------------|---------------|---------------------|------------------------|
| and syntax       | required      |                     | description            |
| -                |               |                     | description            |
| c.srli           | C             | Cs,C>               |                        |
|                  | C             |                     |                        |
| c.sub            | C             | Cs,Ct               |                        |
|                  | C             | Cs,Ct               |                        |
| C.SW             | C             | Ct,Ck(Cs) CV,CM(Cc) |                        |
| c.swsp           | C             |                     |                        |
| c.unimp          | C             |                     |                        |
| c.xor            |               | Cs,Ct               |                        |
|                  | I             | C                   |                        |
| call             | I<br>Zi aham  | d,c                 |                        |
| cbo.clean        | Zicbom        | 0(s)                |                        |
| cbo.flush        | Zicbom        | 0(s)                |                        |
| cbo.inval        | Zicbom        | 0(s)                |                        |
| cbo.zero         | Zicboz        | 0(s)                |                        |
| clmul            | Zbc  <br>Zbkc | d,s,t               |                        |
| clmulh           | Zbc  <br>Zbkc | d,s,t               |                        |
| clmulr           | Zbc           | d,s,t               |                        |
| clz              | Zbb           | d,s                 |                        |
| clzw             | Zbb           | d,s                 |                        |
| срор             | Zbb           | d,s                 |                        |
| срорш            | Zbb           | d,s                 |                        |
| csrc csreg,x0    | Zicsr         | E,Z                 | Clears bits in control |
| obio obiog, no   | 21001         | 2,2                 | reg                    |
| csrc creg,rs1    | Zicsr         | E,s                 |                        |
| csrci creg,imm12 | Zicsr         | E,Z                 | Clears bits of the     |
| 3,               |               |                     | control reg            |
| csrr             | Zicsr         | d,E                 |                        |
| csrrc            | Zicsr         | d,E,Z               |                        |
| csrrc            | Zicsr         | d,E,s               |                        |
| csrrci           | Zicsr         | d,E,Z               |                        |
| csrrs            | Zicsr         | d,E,Z               |                        |
| csrrs            | Zicsr         | d,E,s               |                        |
| csrrsi           | Zicsr         | d,E,Z               |                        |
| csrrw            | Zicsr         | d,E,Z               |                        |
| csrrw            | Zicsr         | d,E,s               |                        |
| csrrwi           | Zicsr         | d,E,Z               |                        |
| csrs             | Zicsr         | E,Z                 |                        |
| csrs             | Zicsr         | E,s                 |                        |
| csrsi            | Zicsr         | E,Z                 |                        |
| csrw             | Zicsr         | E,Z                 |                        |
| csrw             | Zicsr         | E,s                 |                        |
| csrwi            | Zicsr         | E,Z                 |                        |
| ctz              | Zbb           | d,s                 |                        |
| ctzw             | Zbb           | d,s                 |                        |
| div              | M             | d,s,t               |                        |
| divu             | M             | d,s,t               |                        |
| divuw            | M             | d,s,t               |                        |
| divw             | M             | d,s,t               |                        |
| dret             | I             |                     |                        |
| ebreak           | C             |                     |                        |
| ebreak           | I             |                     |                        |
|                  | _             |                     | 1                      |

|           |          |            | Very short                            |
|-----------|----------|------------|---------------------------------------|
|           | required | parameters | description                           |
| ecall     | I        |            | <u> </u>                              |
|           | D        | D,U        | $rd \leftarrow (rs < 0)? - rs : rd$   |
| · ·       | Zfh      | D,U        | $rd \leftarrow (rs < 0)? - rs : rd$   |
| · · ·     | Q        | D,U        | $rd \leftarrow (rs < 0)? - rs : rd$   |
|           | F        | D,U        | $rd \leftarrow (rs < 0)? - rs : rd$   |
| · ·       | D        | D,S,T      | 14 (13 < 0): 13.14                    |
|           | D        | D,S,T,m    | $rd \leftarrow rs1 + rs2$             |
|           | Zfh      | D,S,T      | $rd \leftarrow rs1 + rs2$             |
|           | Zfh      | D,S,T,m    | $rd \leftarrow rs1 + rs2, \approx rm$ |
|           | Q        | D,S,T      | 74 ( 751   752,70 771                 |
|           | Q        | D,S,T,m    |                                       |
| -         | F        | D,S,T      |                                       |
|           | F        | D,S,T,m    |                                       |
|           | D        | d,S        |                                       |
|           | Zfh      | d,S        |                                       |
|           | Q        | d,S        |                                       |
| •         | F        | d,S        |                                       |
|           | Zfhmin&D | D,S        |                                       |
| ·         | D        | D,s        | $frd \leftarrow rs1$                  |
|           | D        | D,s,m      | $frd \leftarrow rs1 \approx rm$       |
|           | D        | D,s        | J14 \ 131 ~ 1111                      |
|           | D        | D,s,m      |                                       |
|           | Q        | D,S,m      |                                       |
| •         | Q        | D,S,m      |                                       |
| •         | D        | D,S,m      |                                       |
|           | D        | D,s        |                                       |
|           | D        | D,s        |                                       |
|           | Zfhmin&D | D,S        |                                       |
|           | Zfhmin&D | D,S,m      |                                       |
|           | Zfh      | D,s        |                                       |
|           | Zfh      | D,s,m      |                                       |
|           | Zfh      | D,s        |                                       |
|           | Zfh      | D,s,m      |                                       |
|           | Zfhmin&Q | D,S        |                                       |
| •         | Zfhmin   | D,S,m      |                                       |
| 1         | Zfhmin   | D,S        |                                       |
|           |          | D,S,m      |                                       |
|           | Zfh      | D,s        |                                       |
|           | Zfh      | D,s,m      |                                       |
|           | Zfh      | D,s        |                                       |
|           | Zfh      | D,s,m      |                                       |
|           | D        | d,S        |                                       |
|           | D        | d,S,m      |                                       |
|           | Zfh      | d,S        |                                       |
|           | Zfh      | d,S,m      |                                       |
|           | Q        | d,S        |                                       |
| _         | Q        | d,S,m      |                                       |
|           | F        | d,S        |                                       |
| fcvt.l.s  | F        | d,S,m      |                                       |
| fcvt.lu.d | D        | d,S        |                                       |
| fcvt.lu.d | D        | d,S,m      |                                       |
| fcvt.lu.h | Zfh      | d,S        |                                       |
| fcvt.lu.h | Zfh      | d,S,m      |                                       |

| And syntax                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Instruction name  | Extens.  | Abstract | Very short                              |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------|----------|----------|-----------------------------------------|
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                   |          |          |                                         |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | ·                 |          |          | describition                            |
| fcvt.lu.s         F         d,S,m           fcvt.q.d         Q         D,S           fcvt.q.h         Zfhmin&Q         D,S           fcvt.q.h         Q         D,s           fcvt.q.l         Q         D,s,m           fcvt.q.lu         Q         D,s,m           fcvt.q.u         Q         D,s,m           fcvt.q.s         Q         D,S           fcvt.q.w         Q         D,S           fcvt.q.w         Q         D,S           fcvt.s.d         D         D,S           fcvt.s.d         D         D,S           fcvt.s.d         D         D,S           fcvt.s.l         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S           fcvt.s.q         Q         D,S           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D                                                                                                                                                                       | _                 | •        |          |                                         |
| fcvt.q.d         Q         D,S           fcvt.q.d         Q         D,S           fcvt.q.h         Zfhmin&Q         D,S           fcvt.q.1         Q         D,s,m           fcvt.q.lu         Q         D,s,m           fcvt.q.lu         Q         D,s,m           fcvt.q.w         Q         D,S           fcvt.s.d         D         D,S,m           fcvt.s.d         D         D,S,m           fcvt.s.d         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S           fcvt.s.q         Q         D,S,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D<                                                                                                                                                                 | <del>-</del>      |          |          |                                         |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |                   |          |          |                                         |
| fcvt.q.h         Zfhmin&Q         D,S           fcvt.q.1         Q         D,s,m           fcvt.q.1         Q         D,s,m           fcvt.q.lu         Q         D,s,m           fcvt.q.w         Q         D,s           fcvt.q.w         Q         D,s           fcvt.q.w         Q         D,s           fcvt.s.d         D         D,S,m           fcvt.s.d         D         D,S,m           fcvt.s.d         D         D,S,m           fcvt.s.d         F         D,s           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.q                                                                                                                                                                   |                   |          |          |                                         |
| $ \begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | _                 |          |          |                                         |
| fcvt.q.lu         Q         D,s,m           fcvt.q.lu         Q         D,s,m           fcvt.q.s         Q         D,s,m           fcvt.q.w         Q         D,s           fcvt.q.wu         Q         D,s           fcvt.s.d         D         D,S           fcvt.s.d         D         D,S,m           fcvt.s.d         D         D,S,m           fcvt.s.d         F         D,s,m           fcvt.s.l         F         D,s,m           fcvt.s.l         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S,m           fcvt.w.h                                                                                                                                                                   |                   |          |          |                                         |
| fcvt.q.lu         Q         D,s,m           fcvt.q.s         Q         D,s,m           fcvt.q.w         Q         D,s           fcvt.q.wu         Q         D,s           fcvt.s.d         D         D,S,m           fcvt.s.d         D         D,S,m           fcvt.s.d         F         D,s           fcvt.s.l         F         D,s           fcvt.s.l         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S,m           fcvt.w.h         rd,fs         rd[032] ← fs[016]           fcvt.w.q         Q         d,S,m           fc                                                                                                                                                            |                   | -        |          |                                         |
| $ \begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |                   |          |          |                                         |
| $ \begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |                   |          |          |                                         |
| fcvt.q.w         Q         D,s           fcvt.s.d         D         D,S           fcvt.s.d         D         D,S,m           fcvt.s.d         D         D,S,m           fcvt.s.h         D         D,S,m           fcvt.s.l         F         D,s           fcvt.s.l         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S,m           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs         Zfh         d,S           fcvt.w.q         Q         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S           fcvt.w.d                                                                                                                                                                  | -                 | -        |          |                                         |
| $ \begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | _                 |          |          |                                         |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | _                 | -        | D,s      |                                         |
| fcvt.s.d         D         D,S,m $fd_{[032]} \leftarrow fs_{[016]}$ fcvt.s.l         F         D,s $fd_{[032]} \leftarrow fs_{[016]}$ fcvt.s.l         F         D,s,m $fcvt.s.m$ fcvt.s.lu         F         D,s,m $fcvt.s.m$ fcvt.s.q         Q         D,S,m $fcvt.s.m$ fcvt.s.q         Q         D,S,m $fcvt.s.m$ fcvt.s.wu         F         D,s,m $fcvt.s.m$ fcvt.s.wu         F         D,s,m $fcvt.s.m$ fcvt.s.wu         F         D,s,m $fcvt.s.m$ fcvt.w.d         D         d,S,m $fclour.s.m$ fcvt.w.d         D         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q                                                                                                                                                                                                                            |                   | Q        |          |                                         |
| fcvt.s.h fd,fs         Zfhmin         D,S $fd_{[032]} \leftarrow fs_{[016]}$ fcvt.s.l         F         D,s           fcvt.s.lu         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S,m           fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         Q                                                                                                                                                          |                   | D        | D,S      |                                         |
| fcvt.s.1         F         D,s           fcvt.s.1u         F         D,s,m           fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S           fcvt.w.d.p         Q         d,S,m           fcvt.w.d         Q         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.b <td></td> <td></td> <td></td> <td></td>                                                                                                                   |                   |          |          |                                         |
| fcvt.s.lu         F         D,s           fcvt.s.lu         F         D,s           fcvt.s.q         Q         D,S           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs,rm         Zfh         d,S         rd[0.32] ← fs[0.16] ≈ r.           fcvt.w.q         Q         d,S,m         rd[0.32] ← fs[0.16] ≈ r.         rc[0.32] ← fs[0.16] ≈ r.           fcvt.w.q         Q         d,S,m         rd[0.32] ← fs[0.16] ≈ r.         rc[0.32] ← fs[0.16] ≈ r.           fcvt.w.q         Q         d,S,m         rd[0.32] ← fs[0.16] ≈ r.         rc[0.32] ← fs[0.16] ≈ r.           fcvt.w.q         Q         d,S,m         rd[0.32] ← fs[0.16] ≈ r.         rc[0.32] ← fs[0.16] ≈ r.           fcvt.w.s         F         d,S         rc[0.32] ← fs[0.16] ≈ r.         rc[0.32] ← fs[0.16] ≈ r.           fcvt.w.a         Q         d,S,m         rc[0.32] ←               | ·                 |          |          | $fd_{[032]} \leftarrow fs_{[016]}$      |
| fcvt.s.lu         F         D,s,m           fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.w         F         D,s,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         D         d,S,m           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs         Zfh         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S           fcvt.w.s         F         d,S,m           fcvt.w.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S           fcvt.wu.s         F         d,S,m           fdiv.d                                                                                                                                                        |                   |          |          |                                         |
| fcvt.s.q         Q         D,s,m           fcvt.s.q         Q         D,S           fcvt.s.w         F         D,s           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs         Zfh         d,S,m           fcvt.w.h rd,fs,rm         Zfh         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S           fcvt.w.s         F         d,S,m           fcvt.w.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T,m <td< td=""><td></td><td>F</td><td>D,s,m</td><td></td></td<>                                                                                                      |                   | F        | D,s,m    |                                         |
| fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S,m           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs,rm         Zfh         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.d         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.q         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.s         F         d,S         rd[032] ← fs[016] ≈ r.           fcvt.w.a         F         d,S<                                                                   | fcvt.s.lu         | F        |          |                                         |
| fcvt.s.q         Q         D,S,m           fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S,m           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs         Zfh         d,S,m         rd[032] ← fs[016] ≈ r           fcvt.w.d         D         d,S,m         rd[032] ← fs[016] ≈ r           fcvt.w.q         Q         d,S,m         rd[032] ← fs[016] ≈ r </td <td>fcvt.s.lu</td> <td>F</td> <td>D,s,m</td> <td></td> | fcvt.s.lu         | F        | D,s,m    |                                         |
| fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S,m           fcvt.w.d         D         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.d         D         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.d         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.d         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.q         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.d                                                   | fcvt.s.q          | Q        | D,S      |                                         |
| fcvt.s.w         F         D,s,m           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.d         D         d,S,m           fcvt.w.h rd,fs         Zfh         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.h rd,fs,rm         Zfh         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.q         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.s         F         d,S         re           fcvt.w.s         F         d,S         re           fcvt.w.s         F         d,S         re           fcvt.wu.d         Q <td>fcvt.s.q</td> <td>Q</td> <td>D,S,m</td> <td></td>   | fcvt.s.q          | Q        | D,S,m    |                                         |
| fcvt.s.wu         F         D,s           fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.h rd,fs         Zfh         d,S         rd[032] ← fs[016] ≈ r.           fcvt.w.h rd,fs,rm         Zfh         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.h rd,fs,rm         Zfh         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.q         Q         d,S,m         rd[032] ← fs[016] ≈ r.           fcvt.w.s         F         d,S         r.           fcvt.w.s         F         d,S         r.           fcvt.w.s         F         d,S         r.           fcvt.wu.d         D         d,S,m         r.           fcvt.wu.q         Q         d,S,m         r.           fcvt.wu.s <t< td=""><td>fcvt.s.w</td><td>F</td><td>D,s</td><td></td></t<>  | fcvt.s.w          | F        | D,s      |                                         |
| fcvt.s.wu         F         D,s,m           fcvt.w.d         D         d,S           fcvt.w.h rd,fs         Zfh         d,S $rd[032] \leftarrow fs[016]$ fcvt.w.h rd,fs,rm         Zfh         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.s         F         d,S $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.s         F         d,S $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.s         F         d,S $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.s         F         d,S $rd[032] \leftarrow fs[016] \approx r$                                                                       | fcvt.s.w          | F        | D,s,m    |                                         |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | fcvt.s.wu         | F        | D,s      |                                         |
| fcvt.w.d         D         d,S,m $rd[032] \leftarrow fs[016]$ fcvt.w.h rd,fs         Zfh         d,S,m $rd[032] \leftarrow fs[016]$ fcvt.w.q         Q         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.s         F         d,S $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.s         F         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.wu.d         D         d,S $rd[032] \leftarrow fs[032] \approx r$ fcvt.wu.d         Q <t< td=""><td>fcvt.s.wu</td><td>F</td><td>D,s,m</td><td></td></t<>           | fcvt.s.wu         | F        | D,s,m    |                                         |
| fcvt.w.h rd,fs         Zfh         d,S $rd[032] \leftarrow fs[016]$ fcvt.w.h rd,fs,rm         Zfh         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q         d,S,m           fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T,m           fdiv.s         F         D,S,T,m           force         I         P,Q                                                                                                                                                                                                                                                      |                   | D        | d,S      |                                         |
| fcvt.w.h rd,fs,rm         Zfh         d,S,m $rd[032] \leftarrow fs[016] \approx r$ fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S,m           fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T,m           fdiv.s         F         D,S,T,m           force         I          V fence iorw,iorw                                                                                                                                                                                                                                                            | fcvt.w.d          | D        | d,S,m    |                                         |
| fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S,m           fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T,m           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw                                                                                                                                                                                                                                                                                                                                                                                          | fcvt.w.h rd,fs    | Zfh      | d,S      |                                         |
| fcvt.w.q         Q         d,S,m           fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T,m           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw                                                                                                                                                                                                                                                                                                                                                                                                                                     | fcvt.w.h rd,fs,rm | Zfh      | d,S,m    | $rd[032] \leftarrow fs[016] \approx rm$ |
| fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T,m           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |                   | Q        | d,S      |                                         |
| fcvt.w.s         F         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                              | fcvt.w.q          | Q        | d,S,m    |                                         |
| fcvt.wu.d         D         d,S           fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                | fcvt.w.s          | F        | d,S      |                                         |
| fcvt.wu.d         D         d,S,m           fcvt.wu.h         Zfh         d,S           fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | fcvt.w.s          | F        | d,S,m    |                                         |
| fcvt.wu.h         Zfh         d,S           fcvt.wu.q         Q         d,S,m           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | fcvt.wu.d         | D        | d,S      |                                         |
| fcvt.wu.h         Zfh         d,S,m           fcvt.wu.q         Q         d,S           fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | fcvt.wu.d         |          |          |                                         |
| fcvt.wu.q       Q       d,S         fcvt.wu.q       Q       d,S,m         fcvt.wu.s       F       d,S,m         fdiv.d       D       D,S,T         fdiv.d       D       D,S,T,m         fdiv.h       Zfh       D,S,T         fdiv.h       Zfh       D,S,T,m         fdiv.q       Q       D,S,T         fdiv.q       Q       D,S,T,m         fdiv.s       F       D,S,T         fdiv.s       F       D,S,T,m         fence       I        ∇ fence iorw,iorw         fence       I       P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | fcvt.wu.h         | Zfh      | d,S      |                                         |
| fcvt.wu.q         Q         d,S,m           fcvt.wu.s         F         d,S,m           fdv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | fcvt.wu.h         | Zfh      | d,S,m    |                                         |
| fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |                   | Q        | d,S      |                                         |
| fcvt.wu.s         F         d,S,m           fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                   | Q        |          |                                         |
| fdiv.d         D         D,S,T           fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          V fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | fcvt.wu.s         | F        | d,S      |                                         |
| fdiv.d         D         D,S,T,m           fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | fcvt.wu.s         | F        | d,S,m    |                                         |
| fdiv.h         Zfh         D,S,T           fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |                   | D        |          |                                         |
| fdiv.h         Zfh         D,S,T,m           fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | fdiv.d            | D        | D,S,T,m  |                                         |
| fdiv.q         Q         D,S,T           fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                   | Zfh      |          |                                         |
| fdiv.q         Q         D,S,T,m           fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                   | Zfh      |          |                                         |
| fdiv.s         F         D,S,T           fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | _                 | Q        |          |                                         |
| fdiv.s         F         D,S,T,m           fence         I          ∇ fence iorw,iorw           fence         I         P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | fdiv.q            | Q        |          |                                         |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |                   | F        | D,S,T    |                                         |
| fence I P,Q                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | fdiv.s            | F        | D,S,T,m  |                                         |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | fence             | I        |          | $\nabla$ fence iorw,iorw                |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                   | I        | P,Q      |                                         |
| fence.i Zifencei                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | fence.i           | Zifencei |          |                                         |

| Instruction name        | Extens.  | Abstract         | Very short                    |
|-------------------------|----------|------------------|-------------------------------|
| and syntax              | required | parameters       | description                   |
| fence.tso               | I        |                  | 1                             |
| feq.d                   | D        | d,S,T            |                               |
| feq.h                   | Zfh      | d,S,T            |                               |
| feq.q                   | Q        | d,S,T            |                               |
| feq.s                   | F        | d,S,T            |                               |
| fge.d                   | D        | d,T,S            |                               |
| fge.h                   | Zfh      | d,T,S            |                               |
| fge.q                   | Q        | d,T,S            |                               |
| fge.s                   | F        | d,T,S            |                               |
| fgt.d                   | D        | d,T,S            |                               |
| fgt.h                   | Zfh      | d,T,S            |                               |
| fgt.q                   | Q        | d,T,S            |                               |
| fgt.s                   | F        | d,T,S            |                               |
| fld                     | D        | D,A,s            |                               |
| fld                     | D        | D,o(s)           |                               |
| fld                     | D&C      | CD,C1(Cs)        |                               |
| fld                     | D&C      | D,Cn(Cc)         |                               |
| fle.d                   | D&C      | d,S,T            |                               |
| fle.h                   | Zfh      | d,S,T            |                               |
| fle.q                   | Q        | d,S,T            |                               |
| fle.s                   | F        | d,S,T            |                               |
| flh                     | Zfhmin   | D,A,s            |                               |
| flh                     | Zfhmin   | D,o(s)           |                               |
| flq                     | Q        | D,A,s            |                               |
| flq                     | Q        | D,o(s)           |                               |
| flt.d                   | D        | d,S,T            |                               |
| flt.h                   | Zfh      | d,S,T            |                               |
| flt.q                   | Q        | d,S,T            |                               |
| flt.s                   | F        | d,S,T            |                               |
| flw                     | F        | D,A,s            |                               |
| flw                     | F        | D,o(s)           |                               |
| fmadd.d fd,fs1,fs2, fs3 | D        | D,S,T,R          |                               |
| fmadd.d fd,fs1,fs2,     | D        | D,S,T,R,m        |                               |
| fs3,rm                  |          | D, D, I, II, III |                               |
| fmadd.h fd,fs1,fs2, fs3 | Zfh      | D,S,T,R          |                               |
| fmadd.h fd,fs1,fs2,     | Zfh      | D,S,T,R,m        |                               |
| fs3,rm                  | 2111     | D, D, I, II, III |                               |
| fmadd.q fd,fs1,fs2, fs3 | Q        | D,S,T,R          |                               |
| fmadd.q fd,fs1,fs2,     | Q        | D,S,T,R,m        |                               |
| fs3,rm                  | 4        | υ,υ,1,16,III     |                               |
| fmadd.s fd,fs1,fs2, fs3 | F        | D,S,T,R          |                               |
| fmadd.s fd,fs1,fs2, fs3 | F        | D,S,T,R,m        |                               |
| fmax.d fd,fs1,fs2       | D        | D,S,T,K,M        | $fd \leftarrow fs1\lceil rs2$ |
| fmax.h fd,fs1,fs2       | Zfh      | D,S,T            | J.W. 101   102                |
| fmax.q fd,fs1,fs2       | Q        | D,S,T            |                               |
| fmax.s fd,fs1,fs2       | F        | D,S,T            |                               |
| fmin.d fd,fs1,fs2       | D        | D,S,T            |                               |
| fmin.h fd,fs1,fs2       | Zfh      | D,S,T            |                               |
| fmin.q fd,fs1,fs2       |          | D,S,T            |                               |
| fmin.s fd,fs1,fs2       | Q<br>F   | D,S,T            |                               |
| fmsub.d fd,fs1,fs2, fs3 | D        | D,S,T,R          |                               |
| fmsub.d fd,fs1,fs2, fs3 | D        | D,S,T,R,m        |                               |
| fs3,rm                  | -        | ν,ν,1,16,III     |                               |
| 150,1111                |          |                  |                               |

| Instruction name               | Extens.    | Abstract   | Very short  |
|--------------------------------|------------|------------|-------------|
| and syntax                     | required   | parameters | description |
| fmsub.h fd,fs1,fs2, fs3        | Zfh        | D,S,T,R    |             |
| fmsub.h fd,fs1,fs2,            | Zfh        | D,S,T,R,m  |             |
| fs3,rm                         |            |            |             |
| fmsub.q fd,fs1,fs2, fs3        | Q          | D,S,T,R    |             |
| fmsub.q fd,fs1,fs2,            | Q          | D,S,T,R,m  |             |
| fs3,rm                         | _          |            |             |
| fmsub.s fd,fs1,fs2, fs3        | F          | D,S,T,R    |             |
| fmsub.s fd,fs1,fs2,            | F          | D,S,T,R,m  |             |
| fs3,rm<br>fmul.d fd,fs1,fs2    | D          | D,S,T      |             |
| fmul.d fd,fs1,fs2              | D          | D,S,T,m    |             |
| fmul.h fd,fs1,fs2              | Zfh        | D,S,T      |             |
| fmul.h fd,fs1,fs2              | Zfh        | D,S,T,m    |             |
| fmul.q fd,fs1,fs2              | Q          | D,S,T      |             |
| fmul.q fd,fs1,fs2              | Q          | D,S,T,m    |             |
| fmul.s fd,fs1,fs2              | F          | D,S,T      |             |
| fmul.s fd,fs1,fs2              | F          | D,S,T,m    |             |
| fmv.d fd,fs1                   | D          | D,U        |             |
| fmv.d.x fd,fs1                 | D          | D,s        |             |
| fmv.h fd,fs1                   | Zfh        | D,U        |             |
| fmv.h.x fd,fs1                 | Zfhmin     | D,s        |             |
| fmv.q                          | Q          | D,U        |             |
| fmv.s                          | F          | D,U        |             |
| fmv.s.x                        | F          | D,s        |             |
| fmv.w.x                        | F          | D,s        |             |
| fmv.x.d                        | D          | d,S        |             |
| fmv.x.h                        | Zfhmin     | d,S        |             |
| fmv.x.s                        | F          | d,S        |             |
| fmv.x.w                        | F          | d,S        |             |
| fneg.d                         | D          | D,U        |             |
| fneg.h                         | Zfh        | D,U        |             |
| fneg.q                         | Q          | D,U        |             |
| fneg.s                         | F          | D,U        |             |
| fnmadd.d fd,fs1,fs2, fs3       | D          | D,S,T,R    |             |
| fnmadd.d fd,fs1,fs2,           | D          | D,S,T,R,m  |             |
| fs3,rm                         | 7.0        | D G        |             |
| fnmadd.h fd,fs1,fs2, fs3       | Zfh<br>Zfh | D,S,T,R    |             |
| fnmadd.h fd,fs1,fs2,<br>fs3,rm | Zfh        | D,S,T,R,m  |             |
| fnmadd.q fd,fs1,fs2, fs3       | Q          | D,S,T,R    |             |
| fnmadd.q fd,fs1,fs2,           | Q          | D,S,T,R,m  |             |
| fs3,rm                         | ,          | ,, ,,      |             |
| fnmadd.s fd,fs1,fs2, fs3       | F          | D,S,T,R    |             |
| fnmadd.s fd,fs1,fs2,           | F          | D,S,T,R,m  |             |
| fs3,rm                         |            |            |             |
| fnmsub.d fd,fs1,fs2, fs3       | D          | D,S,T,R    |             |
| fnmsub.d fd,fs1,fs2,           | D          | D,S,T,R,m  |             |
| fs3,rm                         |            |            |             |
| fnmsub.h fd,fs1,fs2, fs3       | Zfh        | D,S,T,R    |             |
| fnmsub.h fd,fs1,fs2,           | Zfh        | D,S,T,R,m  |             |
| fs3,rm                         |            |            |             |
| fnmsub.q fd,fs1,fs2, fs3       | Q          | D,S,T,R    |             |

| Instruction name         | Extens.  | Abstract   | Very short  |
|--------------------------|----------|------------|-------------|
| and syntax               | required | parameters | description |
| •                        |          | _          | description |
| fnmsub.q fd,fs1,fs2,     | Q        | D,S,T,R,m  |             |
| fs3,rm                   | -        | D G E D    |             |
| fnmsub.s fd,fs1,fs2, fs3 | F        | D,S,T,R    |             |
| fnmsub.s fd,fs1,fs2,     | F        | D,S,T,R,m  |             |
| fs3,rm                   | P        | 3          |             |
| frcsr rd                 | F        | d          |             |
| frflags rd<br>frrm rd    | F        | d          |             |
|                          | F        | d          |             |
| frsr rd                  | F        | d          |             |
| fscsr rd,rs              | F        | d,s        |             |
| fscsr rd                 | F        | S          |             |
| fsd fs2,imm12(rs1)       | D        | T,A,s      |             |
| fsd                      | D        | T,q(s)     |             |
| fsd                      | D&C      | CD,C1(Cs)  |             |
| fsd                      | D&C      | CT,CN(Cc)  |             |
| fsflags                  | F        | d,s        |             |
| fsflags                  | F        | S          |             |
| fsflagsi                 | F        | Z          |             |
| fsflagsi                 | F        | d,Z        |             |
| fsgnj.d                  | D        | D,S,T      |             |
| fsgnj.h                  | Zfh      | D,S,T      |             |
| fsgnj.q                  | Q        | D,S,T      |             |
| fsgnj.s                  | F        | D,S,T      |             |
| fsgnjn.d                 | D        | D,S,T      |             |
| fsgnjn.h                 | Zfh      | D,S,T      |             |
| fsgnjn.q                 | Q        | D,S,T      |             |
| fsgnjn.s                 | F        | D,S,T      |             |
| fsgnjx.d                 | D        | D,S,T      |             |
| fsgnjx.h                 | Zfh      | D,S,T      |             |
| fsgnjx.q                 | Q        | D,S,T      |             |
| fsgnjx.s                 | F        | D,S,T      |             |
| fsh                      | Zfhmin   | T,A,s      |             |
| fsh                      | Zfhmin   | T,q(s)     |             |
| fsq                      | Q        | T,A,s      |             |
| fsq                      | Q        | T,q(s)     |             |
| fsqrt.d                  | D        | D,S        |             |
| fsqrt.d                  | D        | D,S,m      |             |
| fsqrt.h                  | Zfh      | D,S        |             |
| fsqrt.h                  | Zfh      | D,S,m      |             |
| fsqrt.q                  | Q        | D,S        |             |
| fsqrt.q                  | Q        | D,S,m      |             |
| fsqrt.s                  | F        | D,S        |             |
| fsqrt.s                  | F        | D,S,m      |             |
| fsrm                     | F        | d,s        |             |
| fsrm                     | F        | s          |             |
| fsrmi                    | F        | Z          |             |
| fsrmi                    | F        | d,Z        |             |
| fssr                     | F        | d,s        |             |
| fssr                     | F        | s          |             |
| fsub.d frd,frs1,frs2     | D        | D,S,T      |             |
| fsub.d frd,frs1,frs2     | D        | D,S,T,m    |             |
| fsub.hi frd,frs1,frs2    | Zfh      | D,S,T      |             |
| fsub.h frd,frs1,frs2     | Zfh      | D,S,T,m    |             |
|                          |          |            |             |

| Instruction name                             | Extens.     | Abstract         | Very short                                              |
|----------------------------------------------|-------------|------------------|---------------------------------------------------------|
| and syntax                                   | required    | parameters       | description                                             |
| -                                            | <del></del> | -                | description                                             |
| fsub.q frd,frs1,frs2                         | Q           | D,S,T            |                                                         |
| fsub.q frd,frs1,frs2                         | Q<br>F      | D,S,T,m          |                                                         |
| fsub.s frd,frs1,frs2<br>fsub.s frd,frs1,frs2 | F           | D,S,T<br>D,S,T,m |                                                         |
| fsw fsw                                      | F           |                  |                                                         |
| fsw                                          | F           | T,A,s            |                                                         |
|                                              |             | T,q(s)           |                                                         |
| hfence.gvma                                  | H           |                  |                                                         |
| hfence.gvma                                  |             | S                |                                                         |
| hfence.gvma<br>hfence.vvma                   | H           | s,t<br>          |                                                         |
| hfence.vvma                                  | Н           |                  |                                                         |
|                                              |             | S                |                                                         |
| hfence.vvma                                  | H           | s,t              |                                                         |
| hinval.gvma                                  | svinval     | s,t              |                                                         |
| hinval.vvma                                  | svinval     | s,t              | 2/ 1 / 1 [0. 7]                                         |
| hlv.b rd,0(rs1)                              | H           | d,0(s)           | $\mathcal{H} rd \leftarrow \pm (rs1)[07]$               |
| hlv.bu rd,0(rs1)                             | H           | d,0(s)           | $\mathcal{H} \ rd \leftarrow_u (rs1)[07]$ $\mathcal{H}$ |
| hlv.d                                        | H           | d,0(s)           | · ·                                                     |
| hlv.h rd,0(rs1)                              | H           | d,0(s)           | $\mathcal{H} rd \leftarrow (rs1)[015]$                  |
| hlv.hu rd,0(rs1)                             | H           | d,0(s)           | $\mathcal{H} rd \leftarrow_u (rs1)[015]$                |
| hlv.w rd,0(rs1)                              | H           | d,0(s)           | $\mathcal{H} rd \leftarrow (rs1)[031]$                  |
| hlv.wu rd,0(rs1)                             | H           | d,0(s)           | $\mathcal{H} \ rd \leftarrow_u (rs1)[031]$              |
| hlvx.hu                                      | H           | d,0(s)           |                                                         |
| hlvx.wu                                      | H           | d,0(s)           |                                                         |
| hret                                         | I           |                  | 44 [ 9] . 1[9 2]                                        |
| hsv.b rs1,0(rs2)                             | H           | t,0(s)           | $\mathcal{H} \ mem[rs2] \leftarrow rs1[07]$             |
| hsv.d rs1,0(rs2)                             | H           | t,0(s)           | H [ 0] 1[0.15]                                          |
| hsv.h rs1,0(rs2)                             | H           | t,0(s)           | $\mathcal{H} \ mem[rs2] \leftarrow rs1[015]$            |
| hsv.w rs1,0(rs2)                             | H           | t,0(s)           | $\mathcal{H} \ mem[rs2] \leftarrow rs1[031]$            |
| j                                            | C           | Ca               |                                                         |
| j                                            | I           | a                |                                                         |
| jal                                          | I           | a                |                                                         |
| jal                                          | I           | d,a              |                                                         |
| jalr                                         | C           | d                |                                                         |
| jalr                                         | I           | d,o(s)           |                                                         |
| jalr                                         | I           | d,s              |                                                         |
| jalr                                         | I           | d,s,j            |                                                         |
| jalr                                         | I           | o(s)             |                                                         |
| jalr                                         | I           | S                |                                                         |
| jalr                                         | I           | s,j              |                                                         |
| jr                                           | C           | d                |                                                         |
| jr                                           | I           | o(s)             |                                                         |
| jr                                           | I           | S                |                                                         |
| jr                                           | I           | s,j              |                                                         |
| jump                                         | I           | c,s              |                                                         |
| la                                           | I           | d,B              |                                                         |
| la.tls.gd                                    | I           | d,A              | 7 1 0 7 7                                               |
| la.tls.ie rd,symbol                          | I           | d,A              | $\nabla rd \leftarrow \&symbol$                         |
| lb rd,lab                                    | I           | d,A              | $\nabla rd \leftarrow mem[lab][07]$                     |
| lb rd,imm12(rs1)                             | I           | d,o(s)           | $rd \leftarrow mem[rs1 + imm12][07]$                    |
| 1bu                                          | I           | d,A              |                                                         |
| 1bu                                          | I           | d,o(s)           |                                                         |
| 1d                                           | C           | Ct,Cl(Cs)        |                                                         |
| ld                                           | C           | d,Cn(Cc)         |                                                         |

| Instruction name               | Extens.  | Abstract         | Very short                                                                       |
|--------------------------------|----------|------------------|----------------------------------------------------------------------------------|
| and syntax                     | required | parameters       | description                                                                      |
| ld                             | I        | d,A              |                                                                                  |
| ld                             | I        | d,o(s)           |                                                                                  |
| lh rd,lab                      | I        | d,A              | $\nabla rd \leftarrow mem[lab]_{015}$                                            |
| lh rd,im12(rs1)                | I        | d,o(s)           | $rd \leftarrow mem[tao]_{015}$ $rd \leftarrow \pm mem[rs1 + im12]_{015}$         |
| lhu rd,im12(rs1)               | I        | ·                | $rd \leftarrow \pm mem[rs1 + im12]_{015}$ $rd \leftarrow +mem[rs1 + im12]_{015}$ |
| lhu                            | I        | d,A<br>d,o(s)    | $ra \leftarrow +mem[rs1 + tm12]015$                                              |
| li                             | C        | d,Co             |                                                                                  |
| li                             | C        | d,Cv             |                                                                                  |
| li                             | I        | d,I              |                                                                                  |
| li                             | I        |                  |                                                                                  |
| lla                            | I        | d,j<br>d,B       |                                                                                  |
| lr.d                           | A        | d,0(s)           |                                                                                  |
|                                |          | d,0(s)           |                                                                                  |
| lr.d.aq<br>lr.d.aqrl           | A        | •                |                                                                                  |
| lr.d.aqri                      |          | d,0(s)           |                                                                                  |
|                                | A        | d,0(s)           |                                                                                  |
| lr.w                           | A        | d,0(s)<br>d,0(s) |                                                                                  |
| lr.w.aq                        |          |                  |                                                                                  |
| lr.w.aqrl<br>lr.w.rl           | A        | d,0(s)<br>d,0(s) |                                                                                  |
|                                | C        |                  |                                                                                  |
| lui                            |          | d,Cu             |                                                                                  |
|                                | I        | d,u              |                                                                                  |
| lw                             | C        | Ct,Ck(Cs)        |                                                                                  |
|                                | C        | d,Cm(Cc)         | 7 md / 1 m am [1 a k]                                                            |
| lw rd,lab                      | I        | d,A              | $\nabla rd \leftarrow \pm mem[lab]_{032}$                                        |
| lw rd,im12(rs1)                | I        | d,o(s)           | $rd \leftarrow \pm mem[rs1 + im12]_{032}$                                        |
| lwu rd,lab<br>lwu rd,im12(rs1) | I        | d,A              |                                                                                  |
|                                | Zbb      | d,o(s)           | $rd \leftarrow mem[rs1 + im12]_{032}$ $rd \leftarrow (rs1 < rs2) ? rs2 : rs1$    |
| max rd,rs1,rs2                 |          | d,s,t            |                                                                                  |
| maxu rd,r1,r2                  | Zbb      | d,s,t            | $rd \leftarrow (r1 <_u r2) ? r2 : r1$ $rd \leftarrow (r1 <_u r2) ? r1 : r2$      |
| min rd,r1,r2                   | Zbb      | d,s,t            |                                                                                  |
| minu rd,r1,r2                  | Zbb      | d,s,t            | $rd \leftarrow (r1 <_u r2) ? r1 : r2$                                            |
| move rd,rs1                    | C        | d,CV             | $\nabla rd \leftarrow rs1$                                                       |
| move rd,rs1                    | I        | d,s<br>          | $\nabla rd \leftarrow rs1$                                                       |
| mret                           |          |                  | Return from exception                                                            |
| mul rd,rs1,rs2                 | Zmmul    | d,s,t            | $rd \leftarrow rs1 \times rs2$                                                   |
| mulh rd,rs1,rs2                | Zmmul    | d,s,t            | $rd \leftarrow (rs1 \times rs2)_{63127}$                                         |
| mulhsu rd,rs1,rs2u             | Zmmul    | d,s,t            | Signed rs1 × unsigned                                                            |
|                                |          |                  | rs2u                                                                             |
| mulhu                          | Zmmul    | de+              | $rd \leftarrow (rs1 \times rs2u)_{63127}$                                        |
|                                | Zmmul    | d,s,t            |                                                                                  |
| mulw                           |          | d,s,t            |                                                                                  |
| mv                             | C        | d,CV             |                                                                                  |
| mv                             | I        | d,s              |                                                                                  |
| neg                            | I        | d,t              |                                                                                  |
| negw                           | I        | d,t<br>          |                                                                                  |
| nop                            | C        |                  |                                                                                  |
| nop                            |          |                  |                                                                                  |
| not                            | I        | d,s              |                                                                                  |
| or                             | C        | Cs,Ct,Cw         |                                                                                  |
| or                             | C        | Cs,Cw,Ct         |                                                                                  |
| or                             | I        | d,s,j            |                                                                                  |
| or                             | I        | d,s,t            |                                                                                  |
| orc.b                          | Zbb      | d,s              |                                                                                  |

| Amount                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | Instruction name                      | Evtona  | Abatract | Vory short                            |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------|---------|----------|---------------------------------------|
| ori         I         d,s,j           orn         Zbb   Zbkb         d,s,t           Zbkb   Zbkb         d,s,t           pack rd,rs1,rs2         Zbkb         d,s,t           pack rd,rs1,rs2         Zbkb         d,s,t           pack rd,rs1,rs2         Zbkb         d,s,t           pause         Zichop         Wif(s)           prefetch.i imm12(rs1)         Zicbop         Wif(s)           prefetch.v imm12(rs1)         Zicbop         Wif(s)           prefetch.w imm12(rs1)         I         d           redycle         I         d           remail         I         d <tr< td=""><td></td><td>Extens.</td><td>Abstract</td><td>Very short</td></tr<> |                                       | Extens. | Abstract | Very short                            |
| Orn                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | ·                                     | •       |          | description                           |
| Zbkb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                       |         |          |                                       |
| Pack rd,rs1,rs2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | orn                                   | · .     | a,s,t    |                                       |
| packh rd,rs1,rs2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | nook nd na1 na0                       |         | d a +    |                                       |
| Packw rd,rs1,rs2   Zbkb   d,s,t                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                       |         |          |                                       |
| Pause   Zichinpause   Prefetch.imm12(rs1)   Zichop   Wif(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |                                       |         |          |                                       |
| Prefetch.i imm12(rs1)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | -                                     |         |          |                                       |
| prefetch.r imm12(rs1)         Zicbop         Wif(s)           prefetch.w imm12(rs1)         Zicbop         Wif(s)           rdcycle         I         d           rdcycleh         I         d           rdinstret         I         d           rdinstreth         I         d           retmude         M         d,s,t           remm         M         d,s,t           remm         M         d,s,t           remw         M         d,s,t           rew         C            ret         C            ret         C            ret         I         d,s           Zbbl         d,s,t           Zbbl         d,s,t           Zbbl         d,s,t           Zbbl         d,s,t           Zbbl         d,s,           Zbbl         d,s,           Zbkb                                                                                                                                                                                                                     |                                       |         |          |                                       |
| prefetch.w imm12(rs1)         Zicbop rdcycle         Wif(s)           rdcycleh         I         d           rdcycleh         I         d           rddinstreth         I         d           rdinstreth         I         d           rdtimeh         I         d           rem         M         d,s,t           rem         M         d,s,t           remu         M         d,s,t           remw         M         d,s,t           remw         M         d,s,t           ret         C            ret         I            rev8         Zbb l         d,s           Zbkb         d,s,t            rev8         Zbb l         d,s,t           Zbkb         d,s,t            rol         Zbb l         d,s,t           Zbkb         d,s,>            ror         Zbb l         d,s,>           Zbkb         d,s,<                                                                                                                                                                                                                                                                                                                                                                      |                                       | -       |          |                                       |
| rdcycle                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |                                       | -       |          |                                       |
| rdcycleh   I   d   d   d   d   d   d   d   d   d                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |                                       |         |          |                                       |
| rdinstreth         I         d           rdinstreth         I         d           rdtime         I         d           rdtimeh         I         d           remu         M         d,s,t           ret         C            ret         C            ret         I            rev8         Zbb I         d,s           Zbkb         d,s,t            row         Zbk I         d,s,t           Zbkb         d,s,t            Zbkb         d,s,t            Zbkb         d,s,            ror         Zbb I         d,s,            Zbkb         d,s,            rori         Zbb I         d,s,                                                                                                                                                                                                                                                                                                          |                                       |         |          |                                       |
| rdinstreth                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | · · · · · · · · · · · · · · · · · · · |         |          |                                       |
| rdtime         I         d           rettimeh         I         d           rem         M         d,s,t           remuw         M         d,s,t           remw         M         d,s,t           ret         C            ret         I            rev8         Zbb   d,s            Zbkb         Zbb   d,s            rev8         Zbb   d,s            rev8         Zbb   d,s            rev8         Zbb   d,s            rev8         Zbb   d,s            row         Zbb   d,s,t            Zbkb         d,s,t            row         Zbb   d,s,t            Zbkb                                                                                                                                                                                                                                                                                                                                                                                                 |                                       |         |          |                                       |
| rem         M         d,s,t           remu         M         d,s,t           remuw         M         d,s,t           remw         M         d,s,t           ret         C            ret         I            rev8         Zbb   d,s         Zbb   d,s           rev8         Zbb   d,s,t         Zbb   d,s,t           rol         Zbb   d,s,t         Zbb   d,s,t           rolw         Zbb   d,s,t         Zbb   d,s,t           ror         Zbb   d,s,t         Zbb   d,s,t           zbb   d,s,t         Zbb   d,s,t         Zbb   d,s,t                                                              |                                       |         |          |                                       |
| rem         M         d,s,t           remu         M         d,s,t           remw         M         d,s,t           ret         C            ret         I            rev8         Zbb   d,s         Zbb   d,s           rev8         Zbb   d,s,t         Zbb   d,s,t           rol         Zbb   d,s,t         Zbb   d,s,t           rolw         Zbb   d,s,t         Zbb   d,s,t           ror         Zbb   d,s,t         Zbb   d,s,t           zbkb         d,s,t         Zbb   d,s,t           rori         Zbb   d,s,t         Zbkb   d,s,t           rorw         Zbb   d,s,t         Zbkb   d,s,t           rory         Zbb   d,s,t         Zbkb   d,s,t           rorw         Zbb   d,s,                                                                               |                                       |         |          |                                       |
| remu         M         d,s,t           remw         M         d,s,t           ret         C            ret         I            rev8         Zbb   Zbkb         d,s           rev8         Zbb   Zbkb         d,s,t           rol         Zbb   Zbkb         d,s,t           rol         Zbb   Zbkb         d,s,t           ror         Zbb   Zbkb         d,s,t           ror         Zbb   Zbkb         d,s,t           ror         Zbb   Zbkb         d,s,t           rori         Zbb   Zbkb         d,s,           roriw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           sb rs1,var,rs2         I         t,A,s         V auipc + sb mem[var]o, 7 ← rs1o, 7           sb rd, imm12(rs1)         I         t,q(s)         rd ← mem[rs1 + imm12]o, 7           sbreak         I          sbreak           sc.d.aq         A         d,t,O(s)         column           sc.d.aq<                                                                                                                   |                                       |         |          |                                       |
| remuw         M         d,s,t           ret         C            ret         I            rev8         Zbb   d,s         Zbkb           rev8         Zbk   d,s         Zbkb           rol         Zbk   d,s,t         Zbkb           rol         Zbb   d,s,t         Zbkb           ror         Zbb   d,s,t         Zbkb           ror         Zbb   d,s,t         Zbkb           rori         Zbb   d,s,t         Zbkb           rori         Zbb   d,s,t         Zbkb           rorw         Zbb   d,s,<                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |                                       |         |          |                                       |
| remw         M         d,s,t           ret         I            rev8         Zbb   d,s         Zbkb           rev8         Zbb   d,s         Zbkb           rol         Zbb   d,s,t         Zbkb           rolw         Zbb   d,s,t         Zbkb           ror         Zbb   d,s,>         Zbkb           ror         Zbb   d,s,>         Zbkb           rori         Zbb   d,s,<                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |                                       |         |          |                                       |
| ret         C            rev8         Zbb   d,s           rev8         Zbb   d,s           rev8         Zbb   d,s           rol         Zbb   d,s,t           Zbkb         d,s,t           rolw         Zbb   d,s,t           Zbkb         d,s,t           ror         Zbb   d,s,t           Zbkb         d,s,t           rori         Zbb   d,s,t           Zbkb         d,s,           roriw         Zbb   d,s,           Zbkb         d,s,           rorw         Zbb   d,s,           Zbkb         rorw           zbrank         d,s,           zbkb         rorw           zbrank         c           rorw         zbrank           c         rorw <td></td> <td></td> <td></td> <td></td>                                                                                                                                                                     |                                       |         |          |                                       |
| ret         I            rev8         Zbb   Zbkb         d,s           rev8         Zbb   Zbkb         d,s           rol         Zbb   Zbkb         d,s,t           rolw         Zbb   Zbkb         d,s,t           ror         Zbb   Zbkb         d,s,>           ror         Zbb   Zbkb         d,s,>           rori         Zbb   Zbkb         d,s,           roriw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           rorw         Zbb   Zbkb         d,s,           sb rs1,var,rs2         I         t,A,s         Vauipc + sb mem[var]o,7 ← rs1o,7           sb rd, imm12(rs1)         I         t,q(s)         rd ← mem[rs1 + imm12]o,7           sbreak         C          sc.d.aq         A         d,t,0(s)           sc.d.aq         A         d,t,0(s)         sc.d.aq         A         d,t,0(s)         sc.d.aq           sc.d.al         A         d,t,0(s)         sc.d.al         A         d,t,0(s)         sc.d.al                                                                                                                                                                |                                       |         |          |                                       |
| rev8       Zbb   Zbkb       d,s         rev8       Zbb   Zbkb       d,s         rol       Zbb   Zbkb       d,s,t         rolw       Zbb   Zbkb       d,s,t         ror       Zbb   Zbkb       d,s,*         ror       Zbb   Zbkb       d,s,*         rori       Zbb   Zbkb       d,s,*         roriw       Zbb   Zbkb       d,s,*         rorw       Zbb   Zbkb       d,s,*         rorw       Zbb   Zbkb       d,s,*         rorw       Zbb   Zbkb       d,s,*         rorw       Zbb   Zbkb       d,s,*         sb rs1,var,rs2       I       t,A,s       Vauipc + sb mem[var] <sub>0.7</sub> ← rs1 <sub>0.7</sub> sb rd, imm12(rs1)       I       t,q(s)       rd ← mem[rs1 + imm12] <sub>0.7</sub> sbreak       C        sc.d.aq       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)       sc.d.aqrl       A       d,t,0(s)         sc.d.rl       A       d,t,0(s)       sc.w       sc.w                                                                                                                                                                                                                                                                                     |                                       |         |          |                                       |
| Tev8       Zbb   Zbb   Zbkb       d,s         rol       Zbb   Zbkb       d,s,t         rolw       Zbb   Zbkb       d,s,t         ror       Zbb   Zbkb       d,s,>         ror       Zbb   Zbkb       d,s,>         rori       Zbb   Zbkb       d,s,>         roriw       Zbb   Zbkb       d,s,         rorw       Zbb   Zbkb       d,s,         rorw       Zbb   Zbkb       d,s,t         rorw       Zbb   Zbkb       d,s,t         sb rs1,var,rs2       I       t,A,s       ∇ auipc + sb mem[var] <sub>07</sub> ← rs1 <sub>07</sub> sb rd, imm12(rs1)       I       t,q(s)       rd ← mem[rs1 + imm12] <sub>07</sub> sbreak       C          sbreak       I          sc.d. aq       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)         sc.d.rl       A       d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                |                                       |         |          |                                       |
| rev8       Zbb   Zbkb       d,s,t         rol       Zbb   Zbkb       d,s,t         rolw       Zbb   Zbkb       d,s,t         ror       Zbb   Zbkb       d,s,>         ror       Zbb   Zbkb       d,s,>         rori       Zbb   Zbkb       d,s,<                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | rev8                                  | · .     | d,s      |                                       |
| Tol       Zbkb       d,s,t         Zbkb       d,s,t         Tolw       Zbb   d,s,t         Zbkb       d,s,>         Tor       Zbb   d,s,>         Zbkb       d,s,+         Tori       Zbb   d,s,         Zbkb       d,s,         Toriw       Zbb   d,s,         Zbkb       d,s,         Torw       Zbb   d,s,         Zbkb       d,s,t         Zbkb       Torw         Zbb   d,s,t       Torw         Zbb   d,s,t       Torw         Zbb   d,s,t       Torw         Zbkb       Torw         Sb rs1,var,rs2       I       t,A,s       ∇ auipc + sb         mem[var]_0.7 ← rs1_07       rsb rd, imm12(rs1)       I       t,q(s)       rd ← mem[rs1 + imm12]_07         sbreak       C        sc.d.aq       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)       sc.d.aq         sc.d.aq       A       d,t,0(s)       sc.d.aq         sc.d.r1       A       d,t,0(s)       sc.d.r1         sc.w       A       d,t,0(s)                                                                                                                                                                                                                                                              |                                       |         | •        |                                       |
| rol       Zbb   Zbkb       d,s,t         rolw       Zbb   Zbkb       d,s,t         ror       Zbb   Zbkb       d,s,>         ror       Zbb   Zbkb       d,s, >         rori       Zbb   Zbkb       d,s, >         roriw       Zbb   Zbkb       d,s, <                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | rev8                                  | ·       | d,s      |                                       |
| Tolw                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                       |         | 1 .      |                                       |
| rolw         Zbb   Zbkb         d,s,t           ror         Zbb   d,s,> Zbkb         d,s,t           ror         Zbb   d,s,t         Zbkb           rori         Zbb   d,s,< Zbkb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | rol                                   |         | d,s,t    |                                       |
| Zbkb       Zbkb         ror       Zbb   d,s,> Zbkb         ror       Zbb   d,s,t Zbkb         rori       Zbb   d,s,< Zbkb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |                                       |         |          |                                       |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | rolw                                  |         | d,s,t    |                                       |
| Zbkb       Zbkb         ror       Zbkb       d,s,t         zbkb       d,s,>         zbkb       d,s,         roriw       Zbb   d,s,         zbkb       d,s,         rorw       Zbb   d,s,         zbkb       d,s,t         zbkb       sb rs1,var,rs2       I t,A,s       V auipc + sb mem[var] <sub>07</sub> ← rs1 <sub>07</sub> sb rd, imm12(rs1)       I t,q(s)       rd ← mem[rs1 + imm12] <sub>07</sub> sbreak       C       sbreak         sc.d       A d,t,0(s)         sc.d.aq       A d,t,0(s)         sc.d.aqrl       A d,t,0(s)         sc.d.rl       A d,t,0(s)         sc.w       A d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |                                       |         | 1 - >    |                                       |
| ror       Zbb   Zbkb       d,s,t         rori       Zbb   d,s,       Zbkb         roriw       Zbb   d,s,       Zbkb         rorw       Zbb   d,s,       Zbkb         rorw       Zbb   d,s,t       Zbkb         rorw       Zbb   d,s,t       Zbkb         sb rs1,var,rs2       I       t,A,s       V auipc + sb mem[var] <sub>07</sub> ← rs1 <sub>07</sub> sb rd, imm12(rs1)       I       t,q(s)       rd ← mem[rs1 + imm12] <sub>07</sub> sbreak       C          sbreak       I          sc.d       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)         sc.d.aqrl       A       d,t,0(s)         sc.d.rl       A       d,t,0(s)         sc.w       A       d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | ror                                   |         | a,s,>    |                                       |
| Zbkb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                       |         | 3 - 4    |                                       |
| rori       Zbb   d,s, > Zbkb         roriw       Zbb   d,s, < Zbkb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | ror                                   |         | a,s,t    |                                       |
| Zbkb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                       |         | 3 - >    |                                       |
| roriw       Zbb   d,s,         rorw       Zbb   d,s,         zbkb       d,s,t         rorw       Zbb   d,s,t         zbkb       zbkb         sb rs1,var,rs2       I       t,A,s $\sqrt{\text{auipc + sb}}$ mem[var]_07 $\leftarrow rs1_{07}$ sb rd, imm12(rs1)       I       t,q(s) $rd \leftarrow mem[rs1 + imm12]_{07}$ sbreak       C          sc.d       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)         sc.d.aqrl       A       d,t,0(s)         sc.d.rl       A       d,t,0(s)         sc.w       A       d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 1011                                  |         | α, ε, >  |                                       |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | rorit                                 |         | d a /    |                                       |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | TOLIW                                 |         | α, ε, <  |                                       |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | roru                                  |         | dec      |                                       |
| rorw       Zbb   Zbkb       d,s,t         sb rs1,var,rs2       I       t,A,s $\nabla$ auipc + sb $mem[var]_{07} \leftarrow rs1_{07}$ sb rd, imm12(rs1)       I       t,q(s) $rd \leftarrow mem[rs1 + imm12]_{07}$ sbreak       C          sbreak       I          sc.d       A       d,t,0(s)         sc.d.aq       A       d,t,0(s)         sc.d.aqrl       A       d,t,0(s)         sc.d.rl       A       d,t,0(s)         sc.w       A       d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | TOTW                                  |         | u,s,\    |                                       |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | rory                                  |         | d s t    |                                       |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | TOTM                                  |         | u,s,t    |                                       |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | sh rel war rel                        |         | + A c    | ∇ auinc + sh                          |
| sbreak         C            sbreak         I            sc.d         A         d,t,0(s)           sc.d.aq         A         d,t,0(s)           sc.d.aqrl         A         d,t,0(s)           sc.d.rl         A         d,t,0(s)           sc.w         A         d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |                                       | 1       |          | $mem[var]_{07} \leftarrow rs1_{07}$   |
| sbreak         I            sc.d         A         d,t,0(s)           sc.d.aq         A         d,t,0(s)           sc.d.aqrl         A         d,t,0(s)           sc.d.rl         A         d,t,0(s)           sc.w         A         d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |                                       |         |          | $rd \leftarrow mem[rs1 + imm12]_{07}$ |
| sc.d     A     d,t,0(s)       sc.d.aq     A     d,t,0(s)       sc.d.aqrl     A     d,t,0(s)       sc.d.rl     A     d,t,0(s)       sc.w     A     d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |                                       |         |          |                                       |
| sc.d.aq     A     d,t,0(s)       sc.d.aqrl     A     d,t,0(s)       sc.d.rl     A     d,t,0(s)       sc.w     A     d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |                                       | I       |          |                                       |
| sc.d.aqrl       A       d,t,0(s)         sc.d.rl       A       d,t,0(s)         sc.w       A       d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | sc.d                                  | A       |          |                                       |
| sc.d.rl         A         d,t,0(s)           sc.w         A         d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | sc.d.aq                               | A       |          |                                       |
| sc.w A d,t,0(s)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | sc.d.aqrl                             | A       |          |                                       |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                       | A       |          |                                       |
| sc.w.aq                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | SC.W                                  | A       |          |                                       |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | sc.w.aq                               | A       | d,t,0(s) |                                       |

| Sc.w.aqr                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | Instruction name  | Extens. | Abstract | Very short                   |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------|---------|----------|------------------------------|
| Sc.w.aqr1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                   |         |          | •                            |
| scall         I            sd         C         CV,CN(Cc)           sd         C         CV,CN(Cc)           sd         I         t,A,s           sd         I         t,Q(s)           sed         I         t,Q(s)           set         I         d,s           sext.b         I         d,s           sext.h         I         d,s           sext.w         C         d,CU           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.wm         I         s           sfence.wm         I         s           sfence.wm         I         s,t           sfence.wma         I         s,t           sfence.wma         I         s,t           sfence.wma         I         s,t                                                                                                                                                                                                                                      | -                 | -       | -        | description                  |
| Scall                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | _                 |         |          |                              |
| sd         C         CV,CN(Cc)           sd         C         Ct,C1(Cs)           sd         I         t,A,s           sd         I         t,Q(s)           seq2         I         d,s           sext.b         I         d,s           sext.b         Zbb         d,s           sext.h         I         d,s           sext.w         C         d,CU           sext.w         I         d,s           sext.w         I         I         d,s           sext.e         I         I         d,s           sfence.uma         I         I         S                                                                                                                                                                                                            |                   |         |          |                              |
| sd         C         Ct,Cl(Cs)           sd         I         t,A,s           sd         I         t,q(s)           sed         I         t,q(s)           sext.b         I         d,s           sext.b         Zbb         d,s           sext.h         I         d,s           sext.w         C         d,CU           sext.w         I         d,s           sfence.wn         I            sfence.wm         I         s.           sfence.wm         I         s,t           sfence.wma         I         s,t           sfence.winval         svinval            sgte         I         d,t,s           sgt         I         d,t,s           sgt         I         d,t,s           sgt         I         d,t,s           sh         I         t,q(s)           sh                                                                                                                                                                                                                                |                   |         |          |                              |
| sd         I         t,A,s           sed         I         t,q(s)           sext.b         I         d,s           sext.b         Zbb         d,s           sext.h         I         d,s           sext.h         Zbb         d,s           sext.w         C         d,CU           sext.w         I         d,s           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I            sfence.vm         I         s           sfence.vma         I         s           sfence.vma         I         s,t           sferce.vma         I         s,t           sferce.vma         I         s,t           sferce.vma         I         t,t,                                                                                                                                                                                               |                   |         |          |                              |
| sed         I         t,q(s)           sext.b         I         d,s           sext.b         Zbb         d,s           sext.h         I         d,s           sext.w         C         d,CU           sext.w         C         d,CU           sext.w         I         d,s           sfence.vma         I            sfence.vma         I         s,t           sfence.wna         I         s,t                                                                                                                                                                                                         |                   |         | · ·      |                              |
| sext.b         I         d,s           sext.b         I         d,s           sext.h         I         d,s           sext.h         Zbb         d,s           sext.w         C         d,CU           sext.w         I         d,s           sext.w         I         d,s           sext.w         I         d,s           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I            sfence.vm         I            sfence.vma         I         s,t           sfence.wna         I         s,t           sfence.wnival          sfence.winval           sgt         I         d,t,s           sgtu         I         d,t,s           sgtu         I         d,t,s           sgtu         I         d,t           shrs1,symb,rs2         I         t,A,s           sh         I         t,q(s)           shladd.uw         Zba         d,s,t           shladd.uw         Zba         d,s,t           sh3add.uw         Zba <t< td=""><td></td><td></td><td></td><td></td></t<>                                                                                                                                                                      |                   |         |          |                              |
| Sext.b                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | sd                |         | _        |                              |
| sext.b         Zbb         d,s           sext.h         I         d,s           sext.w         C         d,CU           sext.w         I         d,s           sext.w         I         d,s           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I         s           sfence.vma         I         s           sfence.vma         I         s,t           sfence.vma         I         s,t           sfence.w.inval         svinval            sfence.w.inval         svinval            sfence.w.inval         svinval            sfence.w.inval         I         s,t           sfence.w.inval         I         d,t,t                                                                                                                                                                                |                   |         |          |                              |
| sext.h         I         d,s           sext.h         Zbb         d,s           sext.w         I         d,s           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I            sfence.vma         I            sfence.vma         I         s,t           sfence.vma         I         s,t           sfence.winval         I         d,t,s           sgt         I         d,t,s           sgt         I         d,t,s           <                                                                                                                                                                                      |                   |         |          |                              |
| sext.w         Zbb         d,s           sext.w         C         d,CU           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I            sfence.vm         I         s           sfence.vma         I         s           sfence.winval         I         s,t           sgt         I         d,t,s           sgt         I         d,t,s                                                                                                                                                                                   |                   |         | -        |                              |
| sext.w         C         d,CU           sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I            sfence.vm         I         s           sfence.waa         I         s,t           sfence.winval         svinval            sgtnce.winval         svinval            sgtu         I         d,t,s           sgtu         I         d,t,s           sgtu         I         t,q(s)           shrs1,symb,rs2         I         t,A,s         ∨           sh         I         t,q(s)           sh1add         Zba         d,s,t         sh2ada           sh2add.uw         Zba         d,s,t         sh2ada           sh2add.uw         Zba         d,s,t         sh2ada           sh3add         Zba         d,s,t         sh256sig0 rd,rs1         Zknh         d,s           sh256sig0 rd,rs1         Zknh         d,s         sh3af2sig1 rd,rs1         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s         sh3af12sig1 rd,rs1,rs2         Zknh         d,s           sha512s                                                                                                                                      | sext.h            | I       | d,s      |                              |
| sext.w         I         d,s           sfence.inval.ir         svinval            sfence.vm         I            sfence.vm         I         s           sfence.vma         I         s           sfence.vma         I         s,t           sfence.w.inval         svinval            sgt         I         d,t,s           sgtu         I         d,t,s           sgtz         I         d,t,s           sgtz         I         t,A,s           sh         I         t,q(s)           shladd         Zba         d,s,t           shladd.uw         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sh3a256sig0 rd,rs1         Zknh         d,s           sh256sig1 rd,rs1         Zknh         d,s           sh256sig1 rd,rs1         Zknh         d,s           sh3512sig0 rd,rs1,rs2         Zknh         d,s           sh3512sig1 rd,rs1,rs2         Zknh         d,s           sh512sum0 rd,rs1         Zknh         <                                                                                                                                                         | sext.h            | Zbb     |          |                              |
| sfence.inval.ir         svinval            sfence.vm         I            sfence.vma         I            sfence.vma         I         s           sfence.vma         I         s,t           sfence.w.inval         svinval            sgt         I         d,t,s           sgtu         I         d,t,s           sgtz         I         d,t,s           shrs,symb,rs2         I         t,A,s           sh rs1,symb,rs2         I         t,A,s           sh rs1,symb,rs2         I         t,A,s           sh rs1,symb,rs2         I         t,A,s           sh rs1,symb,rs2         I         t,A,s           sh         I         t,q(s)           sh1add         Zba         d,s,t           sh1add.uw         Zba         d,s,t           sh2add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sh3a26sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh                                                                                                                                                                     | sext.w            | C       | d,CU     |                              |
| sfence.vm         I            sfence.vma         I         s           sfence.vma         I         s           sfence.vma         I         s           sfence.vma         I         s,t           sfence.v.inval         svinval            sgt         I         d,t,s           sgtu         I         d,t,s           sgtz         I         d,t           sh rs1,symb,rs2         I         t,A,s           sh         I         t,q(s)           sh1add         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh3add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sh256sig0 rd,rs1         Zknh         d,s           sh256sig1 rd,rs1         Zknh         d,s           sh256sun0 rd,rs1         Zknh         d,s           sh2512sig1 rd,rs1,rs2         Zknh         d,s           sh512sun0 rd,rs1         Zknh         d,s           sh512sum0 rd,rs1,rs2         Zknh <t< td=""><td>sext.w</td><td>I</td><td>d,s</td><td></td></t<>                                                                                            | sext.w            | I       | d,s      |                              |
| sfence.vma         I         s           sfence.vma         I            sfence.vma         I         s           sfence.w.inval         I         s,t           sfence.w.inval         svinval            sgt         I         d,t,s           sgtu         I         d,t,s           sgtz         I         d,t,s           sptz         I         t,A,s         ∇           sh         I         t,A,s         V           sh         I         t,A,s         V           sh         I         t,A,s         V           sh         I         t,A,s         V           sh         I         t,A,s         I           sh         I         t,A,s         I           sh         I         t,A,s         I           sh         I                                                                                                                                                                                                                           | sfence.inval.ir   | svinval |          |                              |
| sfence.vma         I            sfence.vma         I         s           sfence.w.inval         I         s,t           sfence.w.inval         svinval            sgt         I         d,t,s           sgtu         I         d,t,s           sgtz         I         d,t           sh rs1,symb,rs2         I         t,A,s         V           sh         I         t,q(s)         s           sh1add         Zba         d,s,t         sh1add.uw         Zba         d,s,t           sh2add         Zba         d,s,t         sh2add.uw         Zba         d,s,t           sh3add         Zba         d,s,t         sh3add.uw         Zba         d,s,t           sh256sig0 rd,rs1         Zknh         d,s         sh3a256sig0 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s         sh3512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s         sh3512sum0 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1,rs2         Zknh         d,s         rd = ror64(rs1,34)         ror64(rs1,34)         ror64(rs1,34)         ror64(rs1,39)                                                       | sfence.vm         | I       |          |                              |
| Sfence.vma                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | sfence.vm         | I       | s        |                              |
| Sfence.vma                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | sfence.vma        | I       |          |                              |
| sfence.w.inval         svinval            sgt         I         d,t,s           sgtz         I         d,t           sh rs1,symb,rs2         I         t,A,s         ▼           sh rs1,symb,rs2         I         t,A,s         ▼           sh         I         t,Q(s)         tsh           sh1add         Zba         d,s,t         ssh           sh2add.uw         Zba         d,s,t         ssh2add.uw         Zba         d,s,t           sh3add.uw         Zba         d,s,t         ssh256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s         ssh256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s         ssh3512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s         ssh3512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s         rd = ror64(rs1,28)         ror64(rs1,34)         ror64(rs1,34)         ror64(rs1,34)         ror64(rs1,39);59         sinval.vma         svinval         s,t         st1 rd,rd, shamt         I         d,s,t         rd ← rd << shamt                          | sfence.vma        | I       | S        |                              |
| sgt       I       d,t,s         sgtz       I       d,t         sh rs1,symb,rs2       I       t,A,s       ∇         sh I       t,Q(s)       sh1add       Zba       d,s,t         sh1add uw       Zba       d,s,t       sh2add uw       Zba       d,s,t         sh2add uw       Zba       d,s,t       sh3add uw       Zba       d,s,t         sh3add uw       Zba       d,s,t       sh256sig0 rd,rs1       Zknh       d,s         sha256sig1 rd,rs1       Zknh       d,s       sh256sig1 rd,rs1       Zknh       d,s         sha256sum0 rd,rs1       Zknh       d,s       sh3512sig0 rd,rs1,rs2       Zknh       d,s         sha512sig0 rd,rs1,rs2       Zknh       d,s       sh3512sig1 rd,rs1,rs2       Zknh       d,s         sha512sum0 rd,rs1       Zknh       d,s       rd = ror64(rs1,34)       ror64(rs1,33)       ror64(rs1,39); 59         sinval.vma       svinval       s,t       rd ← rd << shamt         sll rd,rd, shamt       C       d,CU,C>       rd ← rd << shamt         sll rd,rs1,shamt       I       d,s,t       rd ← rd << shamt         sll rd,rs1,shamt       C       d,CU,C>       rd ← rd << shamt                                                                               | sfence.vma        | I       | s,t      |                              |
| sgtu       I       d,t,s         sgtz       I       d,t         sh rs1,symb,rs2       I       t,A,s       ▼         sh       I       t,q(s)       T         sh1add       Zba       d,s,t       S         sh1add.uw       Zba       d,s,t       S         sh2add.uw       Zba       d,s,t       S         sh3add.uw       Zba       d,s,t       S         sh3add.uw       Zba       d,s,t       S         sha256sig0 rd,rs1       Zknh       d,s       S         sha256sig1 rd,rs1       Zknh       d,s       S         sha256sum0 rd,rs1       Zknh       d,s       S         sha512sig0 rd,rs1,rs2       Zknh       d,s       S         sha512sig0 rd,rs1,rs2       Zknh       d,s       s         sha512sum0 rd,rs1       Zknh       d,s       rd = ror64(rs1, 28)         ror64(rs1, 34)       ror64(rs1, 34)       ror64(rs1, 39); 59         sinval.vma       svinval       s,t         sl1 rd,rd, shamt       C       d,SU,C>       rd ← rd << shamt         sl1 rd,rs1,shamt       I       d,s,t       rd ← rd << shamt         sl1 rd,rs1,shamt       C                                                                                                                                            | sfence.w.inval    | svinval |          |                              |
| sgtz       I       d,t         sh rs1,symb,rs2       I       t,A,s       ▼         sh s                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | sgt               | I       | d,t,s    |                              |
| sh rs1,symb,rs2       I       t,A,s       ▼         sh sh1add       Zba       d,s,t       sh1add.uw       Zba       d,s,t       sh2add.uw       Zba       d,s,t       sh2add.uw       Zba       d,s,t       sh3add.uw       Zbah       d,s       sh3add.uw       Sbah       sh3add.uw       d,s       sh3add.uw       Zbah       d,s       sh3add.uw       zbak       d,s       sh3add.uw       zbak       d,s       sh3add.uw       zbak       d,s       d,s | sgtu              | I       | d,t,s    |                              |
| sh       I       t,q(s)         sh1add       Zba       d,s,t         sh2add.uw       Zba       d,s,t         sh2add.uw       Zba       d,s,t         sh3add.uw       Zba       d,s,t         sha256sig0 rd,rs1       Zknh       d,s         sha256sig1 rd,rs1       Zknh       d,s         sha256sum0 rd,rs1       Zknh       d,s         sha512sig0 rd,rs1,rs2       Zknh       d,s         sha512sig1 rd,rs1,rs2       Zknh       d,s         sha512sum0 rd,rs1       Zknh       d,s         sha512sum0 rd,rs1       Zknh       d,s         sha512sum1 rd,rs1,rs2       Zknh       d,s,t         sha512sum1 rd,rs1,rs2       Zknh       d,s,t         sha512sum1 rd,rs1,rs2       Zknh       d,s,t         sha512sum1 rd,rs1,rs2       Zknh       d,s         sinval.vma       svinval       s,t         sll rd,rd, shamt       C       d,CU,C>       rd ← rd << shamt                                                                                                                                                                                                                                                                                                                                   | sgtz              | I       | d,t      |                              |
| sh1add         Zba         d,s,t           sh1add.uw         Zba         d,s,t           sh2add         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sha256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s           sha512sum0 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s           sha512sum1 rd,rs1,rs2         Zknh         d,s           sinval.vma         svinval         s,t           sl1 rd,rd, shamt         C         d,CU,C>         rd ← rd << shamt                                                                                                                                                                                                                                                                         | sh rs1,symb,rs2   | I       | t,A,s    | $\nabla$                     |
| sh1add         Zba         d,s,t           sh2add         Zba         d,s,t           sh2add         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh3add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sha256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s           sinval.vma         svinval         s,t           sl1 rd,rd, shamt         C         d,CU,C>         rd ← rd << shamt                                                                                                                                                                                                                                   | sh                | I       |          |                              |
| sh2add         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh3add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sha256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s           sha512sum0 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s           sha512sum1 rd,rs1,rs2         Zknh         d,s           sinval.vma         svinval         s,t           sl1 rd,rd, shamt         C         d,CU,C>         rd ← rd << shamt                                                                                                                                                                                                                                                                                                                                                                                  | sh1add            | Zba     | _        |                              |
| sh2add         Zba         d,s,t           sh2add.uw         Zba         d,s,t           sh3add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sha256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s           sha512sum0 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s           sinval.vma         svinval         s,t           sl1 rd,rd, shamt         C         d,CU,C>         rd ← rd << shamt                                                                                                                                                                                                                                                                                                                                                                                | sh1add.uw         | Zba     | d,s,t    |                              |
| sh2add.uw         Zba         d,s,t           sh3add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sha256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s           sha512sum0 rd,rs1,rs2         Zknh         d,s           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sinval.vma         svinval         s,t           sl1 rd,rd, shamt         C         d,CU,C>         rd ← rd << shamt                                                                                                                                                                                                                                                                                                                                                                                                                           | sh2add            | Zba     |          |                              |
| sh3add         Zba         d,s,t           sh3add.uw         Zba         d,s,t           sha256sig0 rd,rs1         Zknh         d,s           sha256sig1 rd,rs1         Zknh         d,s           sha256sum0 rd,rs1         Zknh         d,s           sha512sig0 rd,rs1,rs2         Zknh         d,s           sha512sig1 rd,rs1,rs2         Zknh         d,s           sha512sum0 rd,rs1         Zknh         d,s           sha512sum0 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sha512sum1 rd,rs1,rs2         Zknh         d,s,t           sinval.vma         svinval         s,t           sl1 rd,rd, shamt         C         d,CU,C>         rd ← rd << shamt                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | sh2add.uw         | Zba     |          |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | sh3add            | Zba     |          |                              |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | sh3add.uw         | Zba     |          |                              |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | sha256sig0 rd.rs1 | Zknh    |          |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                   |         |          |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                   | Zknh    | -        |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                   |         |          |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | -                 |         |          |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                   |         |          |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                   |         |          |                              |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                   |         |          |                              |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |                   |         | _        | $rd = ror64(rs1, 28) \oplus$ |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | 2                 |         | 2,2      | $ror64(rs1, 34)$ $\oplus$    |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | sinval.vma        | svinval | s,t      |                              |
| $\begin{array}{cccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | sll rd,rd, shamt  | С       | d,CU,C>  | $rd \leftarrow rd << shamt$  |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | sll rd,rs1,shamt  | I       |          | $rd \leftarrow rs1 << shamt$ |
| slli rd,rs1,shamt C d,CU,C> $rd \leftarrow rd << shamt$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |                   |         |          |                              |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |                   | C       |          |                              |
| SIII rd,rs,shamt   1   d,s,>   $rd \leftarrow rs << shamt$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | slli rd,rs,shamt  | I       | d,s,>    | $rd \leftarrow rs << shamt$  |
| slli.uw rd,rs1,shamt Zba d,s,> $rd \leftarrow (rs1_{031}) << shamt$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |                   |         |          |                              |
| slliw rd,rs1,shamt I d,s,< $rd \leftarrow rs1 << shamt$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |                   |         |          | 1                            |

 $<sup>^{59} \</sup>oplus = xor$ 

| Instruction name    | Extens.  | Abstract  | Very short                            |
|---------------------|----------|-----------|---------------------------------------|
| and syntax          | required |           | description                           |
|                     |          | _         | <u> </u>                              |
| sllw rd,rs1,im5     | I        | d,s,<     | $rd \leftarrow \pm rs1_{0-31} << im5$ |
| sllw                | I        | d,s,t     |                                       |
| slt                 | I        | d,s,j     |                                       |
| slt                 | I        | d,s,t     |                                       |
| slti                | I        | d,s,j     |                                       |
| sltiu               | I        | d,s,j     |                                       |
| sltu                | I        | d,s,j     |                                       |
| sltu                | I        | d,s,t     |                                       |
| sltz                | I        | d,s       |                                       |
| sm3p0 rd,rs1        | Zksh     | d,s       |                                       |
| sm3p1 rd,rs1,rs2,bs | Zksh     | d,s       |                                       |
| sm4ed               | Zksed    | d,s,t,y   |                                       |
| sm4ks rd,rs1,rs2,bs | Zksed    | d,s,t,y   |                                       |
| snez                | I        | d,t       |                                       |
| sra rd,rsrc1,rsrc2  | C        | Cs,Cw,C>  | $rd \leftarrow rsrc1 >> rsrc2$        |
| sra                 | I        | d,s,>     |                                       |
| sra                 | I        | d,s,t     |                                       |
| srai                | C        | Cs,Cw,C>  |                                       |
| srai                | I        | d,s,>     |                                       |
| sraiw               | I        | d,s,<     |                                       |
| sraw                | I        | d,s,<     |                                       |
| sraw                | I        | d,s,t     |                                       |
| sret                | I        |           |                                       |
| srl                 | C        | Cs,Cw,C>  |                                       |
| srl                 | I        | d,s,>     |                                       |
| srl                 | I        | d,s,t     |                                       |
| srli                | C        | Cs,Cw,C>  |                                       |
| srli                | I        | d,s,>     |                                       |
| srliw               | I        | d,s,<     |                                       |
| srlw                | I        | d,s,<     |                                       |
| srlw                | I        | d,s,t     |                                       |
| sub                 | C        | Cs,Cw,Ct  |                                       |
| sub                 | I        | d,s,t     |                                       |
| subw                | C        | Cs,Cw,Ct  |                                       |
| subw                | I        | d,s,t     |                                       |
| sw                  | C        | CV,CM(Cc) |                                       |
| SW                  | C        | Ct,Ck(Cs) |                                       |
| sw                  | I        | t,A,s     |                                       |
| sw                  | I        | t,q(s)    |                                       |
| tail                | I        | С         |                                       |
| th.addsl            | ba       | d,s,t,    |                                       |
|                     |          | Xu2@25    |                                       |
| th.dcache.call      | cmo      |           |                                       |
| th.dcache.ciall     | cmo      |           |                                       |
| th.dcache.cipa      | cmo      | S         |                                       |
| th.dcache.cisw      | cmo      | S         |                                       |
| th.dcache.civa      | cmo      | S         |                                       |
| th.dcache.cpa       | cmo      | S         |                                       |
| th.dcache.cpal1     | cmo      | s         |                                       |
| th.dcache.csw       | cmo      | s         |                                       |
| th.dcache.cva       | cmo      | S         |                                       |
| th.dcache.cval1     | cmo      | S         |                                       |
| th.dcache.iall      | cmo      |           |                                       |

| Instruction name         | Extens.  | Abstract              | Very short                                                   |
|--------------------------|----------|-----------------------|--------------------------------------------------------------|
| and syntax               | required | parameters            | description                                                  |
| th.dcache.ipa            | cmo      | s                     |                                                              |
| th.dcache.isw            | cmo      | s                     |                                                              |
| th.dcache.iva            | cmo      | s                     |                                                              |
| th.ext rd, rs1, im1,im2  | bb       | d,s,Xu6@26,           | $rd \leftarrow \pm rs1[im1im2]$                              |
|                          |          | Xu6@2a0               |                                                              |
| th.extu rd,rs1,im1,im2   | bb       | d,s,Xu6@26,<br>Xu6@20 | $rd \leftarrow rs1[im1im2]$                                  |
| th.ff0 rd,rs1            | bb       | d,s                   | rd←index first bit 0 in                                      |
| th.ff1 rd,rs1            | bb       | d,s                   | rd←index first bit 1 in                                      |
| th.flrd                  | fmemidx  | D,s,t,Xu2@25          | 151                                                          |
|                          | fmemidx  |                       |                                                              |
| th.flrw                  |          | D,s,t,Xu2025          |                                                              |
| th.flurd                 | fmemidx  | D,s,t,Xu2025          |                                                              |
| th.flurw                 | fmemidx  | D,s,t,Xu2025          |                                                              |
| th.fmv.hw.x              | fmv      | d,S                   |                                                              |
| th.fmv.x.hw              | fmv      | d,S                   |                                                              |
| th.fsrd fd, r1, r2, im2  | fmemidx  | D,s,t,Xu2@25          | $mem[r1+(r2 << im2)] \leftarrow fd$                          |
| th.fsrw fd, r1, r2, im2  | fmemidx  | D,s,t,Xu2@25          | $mem[r1 + (r2 << im2)]_{031}$<br>$\leftarrow fd_{031}$       |
| th.fsurd                 | fmemidx  | D,s,t,Xu2@25          |                                                              |
| th.fsurw                 | fmemidx  | D,s,t,Xu2025          |                                                              |
| th.icache.iall           | cmo      |                       |                                                              |
| th.icache.ialls          | cmo      |                       |                                                              |
| th.icache.ipa            | cmo      | s                     |                                                              |
| th.icache.iva            | cmo      | s                     |                                                              |
| th.ipop                  | int      |                       |                                                              |
| th.ipush                 | int      |                       |                                                              |
| th.12cache.call          | cmo      |                       |                                                              |
| th.12cache.ciall         | cmo      |                       |                                                              |
| th.12cache.iall          | cmo      |                       |                                                              |
| th.lbia rd, (rs1),       | memidx   | d,(s),                | $rd \leftarrow \pm (mem[rs1])$                               |
| im5,im2 Post-increment.  |          | Xs5@20,               | $rs1 \leftarrow rs1 \pm (im5 << im2)$                        |
|                          |          | Xu2@25                | $rd \neq rs1$                                                |
| th.lbib rd, (rs1),       | memidx   | d,(s),                | $rs1 \leftarrow rs1 \pm (im5 << im2)$                        |
| im5,im2 Pre-increment    |          | Xs5@20,               | $rd \leftarrow \pm (mem[rs1])$                               |
|                          |          | Xu2@25                | $rd \neq rs1$                                                |
| th.lbuia rd,(rs1)        | memidx   | d,(s),                | $rd \leftarrow (mem[rs1])$                                   |
| Post-increment           |          | Xs5@20,               | $rs1 \leftarrow rs1(im5 << im2)$                             |
|                          |          | Xu2@25                | $rd \neq rs1$                                                |
| th.lbuibrd, (rs1),       | memidx   | d,(s),                | $rs1 \leftarrow rs1 + (im5 << im2)$                          |
| im5,im2 Pre-increment    |          | Xs5@20,               | $rd \leftarrow +(mem[rs1])$                                  |
|                          |          | Xu2@25                | $rd \neq rs1$                                                |
| th.ldd d1,d2,(s3),im2    | mempair  | d,t,(s),              | $d1 \leftarrow mem[r3 + (im2 << 4)]$                         |
|                          |          | Xu2@25, X14           | $ d2 \leftarrow mem[r3+8+im2 << 4] $ $rd1 \neq rd2 \neq r3 $ |
| th.ldia rd,(rs1),im5,im2 | memidx   | d,(s),                | $rd \leftarrow mem[rs1]$                                     |
| Post-increment           |          | Xs5@20,               | $rs1 \leftarrow rs1 \pm (im5 << im2)$                        |
|                          |          | Xu2@25                | (                                                            |
| th.ldib rd,(r1),im5,im2  | memidx   | d,(s),                | $r1 \leftarrow r1 \pm (im5 << im2)$                          |
| Pre-increment            |          | Xs5@20,               | $rd \leftarrow mem[r1]$                                      |
|                          |          | Xu2@25                | ' '                                                          |
|                          |          |                       | <u>I</u>                                                     |

| Instruction name                        | Extens.  | Abstract                | Very short                                                            |
|-----------------------------------------|----------|-------------------------|-----------------------------------------------------------------------|
| and syntax                              | required | parameters              | description                                                           |
| -                                       | _        |                         | <u>-</u>                                                              |
| th.lhia rd,(rs1),im5,im2                | memidx   | d,(s),                  | $rd \leftarrow \pm mem[rs1]_{016}$                                    |
| Post-increment.                         |          | Xs5020,                 | $rs1 \leftarrow rs1 \pm (im5 << im2)$                                 |
| +h 1hihd (1) iF i0                      |          | Xu2@25                  | $rs1 \leftarrow rs1 \pm (im5 << im2)$                                 |
| th.lhib rd,(rs1),im5,im2 Pre-increment. | memidx   | a,(s),<br>Xs5@20,       | ,                                                                     |
| Pre-increment.                          |          | Xu2@25                  | $rd \leftarrow \pm mem[rs1]_{016}$                                    |
| th.lhuia                                | memidx   | d,(s),Xs5@20,           | $rd \leftarrow mem[rs1]_{016}$                                        |
| rd,(rs1),im5,im2                        | memiax   | u,(s),xs5@20,<br>Xu2@25 | $ra \leftarrow mem[rs1]_{016}$<br>$rs1 \leftarrow rs1 + (im5 << im2)$ |
| Post-increment                          |          | Au2@25                  | $731 \leftarrow 731 + (titto << titt2)$                               |
| th.lhuib rd,(r1),im5,im2                | memidx   | d,(s),Xs5@20,           | $r1 \leftarrow r1 + (im5 << im2)$                                     |
| Pre-increment                           | memiax   | Xu2@25                  | $rd \leftarrow +mem[r1]_{016}$                                        |
| th.lrb                                  | memidx   | d,s,t,Xu2@25            | / a \   mem[/ 1]016                                                   |
| th.lrbu                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lrd                                  | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lrh                                  | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lrhu                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lrw                                  | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lrwu                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurb                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurbu                                | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurd                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurh                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurhu                                | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurw                                 | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lurwu                                | memidx   | d,s,t,Xu2@25            |                                                                       |
| th.lwd rd,imm7(rs1)                     | mempair  | d,t,(s),                | $rd \leftarrow \pm mem[rs1 \pm imm7]_{031}$                           |
| on.iwa ia,imm/(ibi)                     | шешратт  | Xu2@25, X13             | $rd + 1 \leftarrow \pm mem[rs1 \pm tmmr]_{031}$                       |
|                                         |          | nuzuzo, nio             | $[imm7]_{3263}$                                                       |
| th.lwia                                 | memidx   | d,(s),                  |                                                                       |
|                                         |          | Xs5@20,                 |                                                                       |
|                                         |          | Xu2@25                  |                                                                       |
| th.lwib                                 | memidx   | d,(s),                  |                                                                       |
|                                         |          | Xs5@20                  |                                                                       |
|                                         |          | Xu2@25                  |                                                                       |
| th.lwud                                 | mempair  | d,t,(s),Xu2@25,         |                                                                       |
|                                         | _        | X13                     |                                                                       |
| th.lwuia                                | memidx   | d,(s),Xs5@20,           |                                                                       |
|                                         |          |                         |                                                                       |
|                                         |          | Xu2@25                  |                                                                       |
| th.lwuib                                | memidx   | d,(s),Xs5@20,           |                                                                       |
|                                         |          |                         |                                                                       |
|                                         |          | Xu2@25                  |                                                                       |
| th.mula                                 | mac      | d,s,t                   |                                                                       |
| th.mulah                                | mac      | d,s,t                   |                                                                       |
| th.mulaw                                | mac      | d,s,t                   |                                                                       |
| th.muls rd,r1,r2                        | mac      | d,s,t                   | $rd_{[015]} \leftarrow rd_{-}(r1 \times r2)$                          |
| th.mulsh rd,r1,r2                       | mac      | d,s,t                   | $rd \leftarrow \pm rd_{[015]} - (r1_{[015]} \times$                   |
|                                         |          |                         | $r2_{[015]})$ $rd \leftarrow \pm rd_{[031]} - (r1_{[031]} \times$     |
| th.mulsw rd,r1,r2                       | mac      | d,s,t                   |                                                                       |
|                                         |          | _                       | $r2_{[031]}$                                                          |
| th.mveqz rd,r1,r2                       | condmov  | d,s,t                   | $rd \leftarrow (r2 = 0) ? r1 : rd$                                    |
| th.mvnez rd,r1,r2                       | condmov  | d,s,t                   | $rd \leftarrow (r2 \neq 0) ? r1 : rd$                                 |
| th.rev                                  | bb       | d,s                     |                                                                       |

| Instruction name                   | Extens.  | Abstract                                | Very short                                                          |
|------------------------------------|----------|-----------------------------------------|---------------------------------------------------------------------|
| and syntax                         | required | parameters                              | description                                                         |
| th.revw                            | bb       | d,s                                     |                                                                     |
| th.sbia                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
|                                    |          |                                         |                                                                     |
|                                    |          | Xu2@25                                  |                                                                     |
| th.sbib                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
|                                    |          |                                         |                                                                     |
| .1 11                              |          | Xu2025                                  |                                                                     |
| th.sdd                             | mempair  | d,t,(s),Xu2@25,<br>X14                  |                                                                     |
| th.sdia                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
| cii. Safa                          | memiax   | u, (5), N50@20,                         |                                                                     |
|                                    |          | Xu2@25                                  |                                                                     |
| th.sdib                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
|                                    |          | ,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, |                                                                     |
|                                    |          | Xu2@25                                  |                                                                     |
| th.sfence.vmas                     | sync     | s,t                                     |                                                                     |
| th.shia                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
|                                    |          |                                         |                                                                     |
|                                    |          | Xu2@25                                  |                                                                     |
| th.shib r2,(r1),im5,im2            | memidx   | d,(s),Xs5@20                            | $r1 \leftarrow \pm r1 + (im5 << im2)$                               |
| 8 bit store                        |          | Xu2@25                                  | $mem[r1][015] \leftarrow r2[015]$                                   |
| th.srb rd,r1,r2,im2                | memidx   | d,s,t,Xu2@25                            | $r1 \leftarrow r1 \pm (im5 << im2)$                                 |
| +hdd100                            |          | 1 - + V000F                             | $mem[r1][07] \leftarrow r2[07]$ $mem[rs1+rs2 << im2] \leftarrow rd$ |
| th.srd rd,r1,r2,im2                | memidx   | d,s,t,Xu2@25                            | $mem[rs1+rs2 << im2] \leftarrow ra$                                 |
| 64 bits store th.srh rd,r1,r2,imm2 | memidx   | d,s,t,Xu2@25                            | $mem[r1+r2 << imm2] \leftarrow rd$                                  |
| 16bits store                       | memitax  | u,5,0,8u2@20                            |                                                                     |
| th.srri rd,rs,im6                  | bb       | d,s,Xu6@20                              | $rd \leftarrow \curvearrowleft (rs >> im6)$                         |
| th.srriw rd,rs,im5                 | bb       | d,s,Xu5@20                              | $rd \leftarrow \sim (rs >> im5)[031]$                               |
| th.srw rd,r1,r2,im2                | memidx   | d,s,t,Xu2@25                            | $mem[r1+r2 << im2] \leftarrow$                                      |
|                                    |          |                                         | rd[031]                                                             |
| th.surb rd,r1,r2,im2               | memidx   | d,s,t,Xu2@25                            | $mem[r1 + r2 << im2] \leftarrow$                                    |
|                                    |          |                                         | rd[07]                                                              |
| th.surd rd,r1,r2,im2               | memidx   | d,s,t,Xu2@25                            |                                                                     |
| th.surh rd,r1,r2,im2               | memidx   | d,s,t,Xu2@25                            | $mem[r1 + r2 << im2] \leftarrow$                                    |
|                                    |          |                                         | rd[015]                                                             |
| th.surw                            | memidx   | d,s,t,Xu2@25                            | $mem[r1 + r2 << im2] \leftarrow$                                    |
| .1 1                               |          | ( )                                     | rd[031]                                                             |
| th.swd                             | mempair  | d,t,(s),Xu2@25,<br>X13                  |                                                                     |
| th.swia                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
| cii. Swia                          | memitax  | u, (5), N50@20,                         |                                                                     |
|                                    |          | Xu2@25                                  |                                                                     |
| th.swib                            | memidx   | d,(s),Xs5@20,                           |                                                                     |
|                                    |          |                                         |                                                                     |
|                                    |          | Xu2@25                                  |                                                                     |
| th.sync                            | sync     |                                         |                                                                     |
| th.sync.i                          | sync     |                                         |                                                                     |
| th.sync.is                         | sync     |                                         |                                                                     |
| th.sync.s                          | sync     |                                         |                                                                     |
| th.tst                             | bs       | d,s,Xu6@20                              |                                                                     |
| th.tstnbz                          | bb       | d,s                                     |                                                                     |
| unimp                              | C        |                                         |                                                                     |
| unimp                              | I        |                                         |                                                                     |

| Instruction name                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| Uret                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |
| vaadd.vv                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vaadd.vx         V         Vd,Vt,SVm           vaaddu.vv         V         Vd,Vt,SVm           vadc.vim         V         Vd,Vt,Vi,VO           vadc.vvm         V         Vd,Vt,Vs,VO           vadc.vxm         V         Vd,Vt,S,VO           vadc.vxm         V         Vd,Vt,S,VO           vadc.vxm         V         Vd,Vt,SVWm         vde ← VvS2e + imm           vadd.vx         Vd, Vs,ViVm         vde ← vs2e + vs1e         vde ← vs2e + vs1e           vadd.vx         Vd,Vt,ViVm         vde ← vs2e + vs1e         vde ← vs2e + vs1e           vadd.vx         Vd,Vt,ViVm         vde ← vs2e + vs1e         vde ← vs2e + vs1e           vadd.vx         Vd,Vt,ViVm         vde ← vs2e + vs1e         vsee + vs2e + vs1e           vadd.vx         Vd,Vt,ViVm         vde ← vs2e + vs1e         vsee + vs2e + vs1e           vadd.vx         Vd,Vt,ViVm         vde ← vs2e + vs1e         vs1e           vadd.vx         Vd,Vt,VsVm         vde ← vs2e + vs1e         vs2e      < |  |
| vaaddu.vv         V         Vd,Vt,vsVm           vaadc.vim         V         Vd,Vt,vi,Vo           vadc.vxm         V         Vd,Vt,vs,VO           vadc.vxm         V         Vd,Vt,s,VO           vadd.vxm         V         Vd,Vt,s,VO           vadd.vi vd, vs2, imm, vm         V         Vd,Vt,viVm         vde ← ∀vs2e + imm           vadd.vi vd, vs2, vs1, vm         V         Vd,Vt,vsVm         vde ← vs2e + vs1e           vadd.vx vd,vs2,rs1,vm         V         Vd,Vt,vsVm         vde ← vs2e + rs1           vand.vi         V         Vd,Vt,vsVm         vde ← vs2e + rs1           vand.vi         V         Vd,Vt,vsVm         vde ← vs2e + vs1e           vand.vi         V         Vd,Vt,vsVm         vd,Vt,vsVm           vasub.vi         V         Vd,Vt,VsVm           vasub.vv         V                                    |  |
| vaaddu.vx         V         Vd,Vt,vi,Vo           vadc.vim         V         Vd,Vt,Vi,Vo           vadc.vvm         V         Vd,Vt,vs,Vo           vadd.vx         V         Vd,Vt,viVm         vde ← ∀vs2e + imm           vadd.vi         vd, vs2, imm, vm         V         Vd,Vt,ViVm         vde ← ∀vs2e + imm           vadd.vx         vd,vs2, rs1, vm         V         Vd,Vt,vsVm         vde ← vs2e + vs1e           vadd.vx         vd,vs2,rs1,vm         V         Vd,Vt,vsVm         vde ← vs2e + rs1           vand.vi         V         Vd,Vt,vsVm         vd,Vt,vsVm           vand.vi         V         Vd,Vt,vsVm         vd,Vt,vsVm           vasub.vi         V         Vd,Vt,VsVm         vd,Vt,vsVm           vdivs.vi         V                             |  |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |
| $\begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |
| vand.vi         V         Vd,Vt,ViVm           vand.vx         V         Vd,Vt,vsVm           vand.vx         V         Vd,Vt,sVm           vasub.vv         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,vsVm           vcompress.vm         V         Vd,VtVm           vdiv.vx         V         Vd,Vt,vsVm           vdiv.vx         V         Vd,Vt,vsVm           vdiv.vx         V         Vd,Vt,vsVm           vdiv.vx         V         Vd,Vt,vsVm           vfadd.vr         Zvef         Vd,Vt,VsVm           vfcadd.vr         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.x.v.v         Zvef         Vd,VtVm           vfcvt.f.x.v.v         Zvef         Vd,VtVm <td></td>                                                                                                    |  |
| vand.vx         V         Vd,Vt,VsVm           vand.vx         V         Vd,Vt,sVm           vasub.vv         V         Vd,Vt,VsVm           vasub.vx         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,vsVm           vcompress.vm         V         Vd,Vt,vs           vd,vt,vsvm         Vd,Vt,vsvm           vdiv.vx         V         Vd,Vt,vsvm           vdiv.vx         V         Vd,Vt,vsvm                                                                                                                                |  |
| vand.vx         V         Vd,Vt,sVm           vasub.vv         V         Vd,Vt,sVm           vasub.vx         V         Vd,Vt,sVm           vasubu.vx         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,vsVm           vcompress.vm         V         Vd,Vt,vsVm           vcpop.m         V         Vd,Vt,vsVm           vdiv.vx         V         Vd,Vt,sVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vx         V         Vd,Vt,sVm           vdivu.vx         V         Vd,Vt,sVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.x.v.         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vv         Zvef         Vd,VtVm                                                                                                                          |  |
| vasub.vv         V         Vd,Vt,vsVm           vasub.vx         V         Vd,Vt,sVm           vasubu.vv         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,sVm           vcompress.vm         V         Vd,Vt,vs           vcpop.m         V         Vd,Vt,vsVm           vdiv.vv         V         Vd,Vt,sVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vx         V         Vd,Vt,sVm           vdad.v         Zvef         Vd,Vt,sVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,Vt,VsVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm                                                                                                                                                                                                                      |  |
| vasub.vx         V         Vd,Vt,sVm           vasubu.vv         V         Vd,Vt,vsVm           vasubu.vx         V         Vd,Vt,sVm           vcompress.vm         V         Vd,Vt,vs           vcpop.m         V         Vd,Vt,VsVm           vdiv.vv         V         Vd,Vt,sVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vv         V         Vd,Vt,vsVm           vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vv         Zvef         Vd,VtVm                                                                                                                                                                                                                         |  |
| vasubu.vv         V         Vd,Vt,VsVm           vasubu.vx         V         Vd,Vt,sVm           vcompress.vm         V         Vd,Vt,Vs           vcpop.m         V         Vd,Vt,VsVm           vdiv.vv         V         Vd,Vt,vsVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vv         V         Vd,Vt,vsVm           vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vv         Zvef         Vd,VtVm                                                                                                                                                                                                                                                                     |  |
| vasubu.vv         V         Vd,Vt,VsVm           vasubu.vx         V         Vd,Vt,sVm           vcompress.vm         V         Vd,Vt,Vs           vcpop.m         V         Vd,Vt,VsVm           vdiv.vv         V         Vd,Vt,vsVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vv         V         Vd,Vt,vsVm           vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vv         Zvef         Vd,VtVm                                                                                                                                                                                                                                                                     |  |
| vasubu.vx         V         Vd,Vt,Vs           vcompress.vm         V         Vd,Vt,Vs           vcpop.m         V         d,VtVm           vdiv.vv         V         Vd,Vt,VsVm           vdiv.vx         V         Vd,Vt,SVm           vdivu.vx         V         Vd,Vt,sVm           vdadv.vx         V         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.xif.v         Zvef         Vd,VtVm           vfcvt.xxi.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm                                                                                                                                                                                                                                                                                                                  |  |
| vcompress.vm         V         Vd,Vt,Vs           vcpop.m         V         d,VtVm           vdiv.vv         V         Vd,Vt,VsVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vv         V         Vd,Vt,VsVm           vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.xi.f.v         Zvef         Vd,VtVm           vfcvt.xxi.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfair.m         V         Vd,VtVm                                                                                                                                                                                                                                                                                                                                                                               |  |
| vcpop.m         V         d,VtVm           vdiv.vv         V         Vd,Vt,VsVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vv         V         Vd,Vt,VsVm           vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,VuVm           vfadd.vf         Zvef         Vd,Vt,VsVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.xs.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                             |  |
| vdiv.vv         V         Vd,Vt,VsVm           vdiv.vx         V         Vd,Vt,sVm           vdivu.vv         V         Vd,Vt,VsVm           vdivu.vx         V         Vd,Vt,SVm           vfabs.v         Zvef         Vd,VuVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                |  |
| vdiv.vx         V         Vd,Vt,sVm           vdivu.vx         V         Vd,Vt,VsVm           vdivu.vx         V         Vd,Vt,SVm           vfabs.v         Zvef         Vd,VuVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |  |
| vdivu.vv         V         Vd,Vt,VsVm           vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,VuVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| vdivu.vx         V         Vd,Vt,sVm           vfabs.v         Zvef         Vd,Vt,SVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vfabs.v         Zvef         Vd,VuVm           vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfadd.vf         Zvef         Vd,Vt,SVm           vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vfadd.vv         Zvef         Vd,Vt,VsVm           vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| vfclass.v         Zvef         Vd,VtVm           vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |
| vfcvt.f.x.v         Zvef         Vd,VtVm           vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vfcvt.f.xu.v         Zvef         Vd,VtVm           vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |
| vfcvt.rtz.x.f.v         Zvef         Vd,VtVm           vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |  |
| vfcvt.rtz.xu.f.v         Zvef         Vd,VtVm           vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |
| vfcvt.x.f.v         Zvef         Vd,VtVm           vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |  |
| vfcvt.xu.f.v         Zvef         Vd,VtVm           vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |  |
| vfdiv.vf         Zvef         Vd,Vt,SVm           vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| vfdiv.vv         Zvef         Vd,Vt,VsVm           vfirst.m         V         d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |
| vfirst.m V d,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |
| vfmacc.vf Zvef Vd,S,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmacc.vv Zvef Vd,Vs,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vfmadd.vf Zvef Vd,S,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmadd.vv Zvef Vd,Vs,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vfmax.vf Zvef Vd,Vt,SVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| vfmax.vv Zvef Vd,Vt,VsVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmerge.vfm Zvef Vd,Vt,S,VO                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |  |
| vfmin.vf Zvef Vd,Vt,SVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| vfmin.vv Zvef Vd,Vt,VsVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmsac.vf Zvef Vd,S,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmsac.vv Zvef Vd,Vs,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vfmsub.vf Zvef Vd,S,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmsub.vv Zvef Vd,Vs,VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vfmul.vf Zvef Vd,Vt,SVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| vfmul.vv Zvef Vd,Vt,VsVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
| vfmv.f.s Zvef D,Vt                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |  |

| Instruction name  | Evtona   | Abatract   | Vory short  |
|-------------------|----------|------------|-------------|
| Instruction name  | Extens.  | Abstract   | Very short  |
| and syntax        | required | parameters | description |
| vfmv.s.f          | Zvef     | Vd,S       |             |
| vfmv.v.f          | Zvef     | Vd,S       |             |
| vfncvt.f.f.w      | Zvef     | Vd,VtVm    |             |
| vfncvt.f.x.w      | Zvef     | Vd,VtVm    |             |
| vfncvt.f.xu.w     | Zvef     | Vd,VtVm    |             |
| vfncvt.rod.f.f.w  | Zvef     | Vd,VtVm    |             |
| vfncvt.rtz.x.f.w  | Zvef     | Vd,VtVm    |             |
| vfncvt.rtz.xu.f.w | Zvef     | Vd,VtVm    |             |
| vfncvt.x.f.w      | Zvef     | Vd,VtVm    |             |
| vfncvt.xu.f.w     | Zvef     | Vd,VtVm    |             |
| vfneg.v           | Zvef     | Vd,VuVm    |             |
| vfnmacc.vf        | Zvef     | Vd,S,VtVm  |             |
| vfnmacc.vv        | Zvef     | Vd,Vs,VtVm |             |
| vfnmadd.vf        | Zvef     | Vd,S,VtVm  |             |
| vfnmadd.vv        | Zvef     | Vd,Vs,VtVm |             |
| vfnmsac.vf        | Zvef     | Vd,S,VtVm  |             |
| vfnmsac.vv        | Zvef     | Vd,Vs,VtVm |             |
| vfnmsub.vf        | Zvef     | Vd,S,VtVm  |             |
| vfnmsub.vv        | Zvef     | Vd,Vs,VtVm |             |
| vfrdiv.vf         | Zvef     | Vd,Vt,SVm  |             |
| vfrec7.v          | Zvef     | Vd,VtVm    |             |
| vfrece7.v         | Zvef     | Vd,VtVm    |             |
| vfredmax.vs       | Zvef     | Vd,Vt,VsVm |             |
| vfredmin.vs       | Zvef     | Vd,Vt,VsVm |             |
| vfredosum.vs      | Zvef     | Vd,Vt,VsVm |             |
| vfredsum.vs       | Zvef     | Vd,Vt,VsVm |             |
| vfredusum.vs      | Zvef     | Vd,Vt,VsVm |             |
| vfrsqrt7.v        | Zvef     | Vd,VtVm    |             |
| vfrsqrte7.v       | Zvef     | Vd,VtVm    |             |
| vfrsub.vf         | Zvef     | Vd,Vt,SVm  |             |
| vfsgnj.vf         | Zvef     | Vd,Vt,SVm  |             |
| vfsgnj.vv         | Zvef     | Vd,Vt,VsVm |             |
| vfsgnjn.vf        | Zvef     | Vd,Vt,SVm  |             |
| vfsgnjn.vv        | Zvef     | Vd,Vt,VsVm |             |
| vfsgnjx.vf        | Zvef     | Vd,Vt,SVm  |             |
| vfsgnjx.vv        | Zvef     | Vd,Vt,VsVm |             |
| vfslide1down.vf   | Zvef     | Vd,Vt,SVm  |             |
| vfslide1up.vf     | Zvef     | Vd,Vt,SVm  |             |
| vfsqrt.v          | Zvef     | Vd,VtVm    |             |
| vfsub.vf          | Zvef     | Vd,Vt,SVm  |             |
| vfsub.vv          | Zvef     | Vd,Vt,VsVm |             |
| vfwadd.vf         | Zvef     | Vd,Vt,SVm  |             |
| vfwadd.vv         | Zvef     | Vd,Vt,VsVm |             |
| vfwadd.wf         | Zvef     | Vd,Vt,SVm  |             |
| vfwadd.wv         | Zvef     | Vd,Vt,VsVm |             |
| vfwcvt.f.f.v      | Zvef     | Vd,VtVm    |             |
| vfwcvt.f.x.v      | Zvef     | Vd,VtVm    |             |
| vfwcvt.f.xu.v     | Zvef     | Vd,VtVm    |             |
| vfwcvt.rtz.x.f.v  | Zvef     | Vd,VtVm    |             |
| vfwcvt.rtz.xu.f.v | Zvef     | Vd,VtVm    |             |
| vfwcvt.x.f.v      | Zvef     | Vd,VtVm    |             |
| vfwcvt.xu.f.v     | Zvef     | Vd,VtVm    |             |
| vfwmacc.vf        | Zvef     | Vd,S,VtVm  |             |
| <u> </u>          |          |            | l .         |

| Instruction name       | Extens.  | Abstract                    | Very short  |
|------------------------|----------|-----------------------------|-------------|
| and syntax             | required |                             | description |
| vfwmacc.vv             | Zvef     | Vd, Vs, VtVm                | dobotipoton |
| vfwmacc.vf             | Zvei     | Vd,Vs,VtVm<br>Vd,S,VtVm     |             |
| vfwmsac.vv             | Zvei     | Vd, Vs, VtVm                |             |
| vfwmul.vf              | Zvei     | Vd, Vs, VtVm<br>Vd, Vt, SVm |             |
| vfwmul.vv              | Zvei     |                             |             |
| vfwnmacc.vf            | Zvei     | Vd,Vt,VsVm<br>Vd,S,VtVm     |             |
| vfwnmacc.vv            | Zvei     | Vd, Vs, VtVm                |             |
| vfwnmsac.vf            | Zvef     | Vd, VS, VtVm<br>Vd, S, VtVm |             |
| vfwnmsac.vv            | Zvei     | Vd,Vs,VtVm                  |             |
| vfwredosum.vs          | Zvei     | Vd, Vt, VsVm                |             |
| vfwredsum.vs           | Zvef     | Vd, Vt, VsVm                |             |
| vfwredusum.vs          | Zvei     | Vd, Vt, VsVm                |             |
| vfwsub.vf              | Zvei     | Vd,Vt,VSVm<br>Vd,Vt,SVm     |             |
| vfwsub.vv              | Zvei     |                             |             |
| vfwsub.wf              |          | Vd, Vt, VsVm                |             |
| vfwsub.wv              | Zvef     | Vd,Vt,SVm                   |             |
| vid.v                  | Zvef     | Vd,Vt,VsVm<br>VdVm          |             |
| viota.m                | V        | Vd,VtVm                     |             |
|                        |          |                             |             |
| vl1r.v                 | V        | Vd,0(s)                     |             |
| vl1re16.v              | V        | Vd,0(s)                     |             |
| vl1re32.v<br>vl1re64.v |          | Vd,0(s)<br>Vd,0(s)          |             |
|                        | V        |                             |             |
| vl1re8.v<br>vl2r.v     | V        | Vd,0(s)                     |             |
|                        |          | Vd,0(s)                     |             |
| vl2re16.v              | V        | Vd,0(s)                     |             |
| vl2re32.v<br>vl2re64.v | V        | Vd,0(s)<br>Vd,0(s)          |             |
| vl2re8.v               | V        | Vd,0(s)                     |             |
|                        | V        |                             |             |
| vl4r.v<br>vl4re16.v    | V        | Vd,0(s)<br>Vd,0(s)          |             |
| v14re32.v              | V        | Vd,0(s)                     |             |
| v14re64.v              | V        | Vd,0(s)                     |             |
| v14re8.v               | V        |                             |             |
| v14reo.v               | V        | Vd,0(s)<br>Vd,0(s)          |             |
| vl8re16.v              | V        | Vd,0(s)                     |             |
|                        | V        | Vd,0(s)                     |             |
| vl8re32.v<br>vl8re64.v | V        | Vd,0(s)                     |             |
|                        | V        | Vd,0(s)                     |             |
| vl8re8.v<br>vle1.v     | V        |                             |             |
|                        | V        | Vd,0(s)                     |             |
| vle16.v                | V        | Vd,0(s)Vm                   |             |
| vle16ff.v              |          | Vd,0(s)Vm                   |             |
| vle32.v                | V        | Vd,0(s)Vm                   |             |
| vle32ff.v              | V        | Vd,0(s)Vm                   |             |
| vle64.v                | V        | Vd,0(s)Vm                   |             |
| vle64ff.v              | V        | Vd,0(s)Vm                   |             |
| vle8.v                 | V        | Vd,0(s)Vm                   |             |
| vle8ff.v               |          | Vd,0(s)Vm                   |             |
| vlm.v                  | V        | Vd,0(s)                     |             |
| vloxei16.v             | V        | Vd,0(s),VtVm                |             |
| vloxei32.v             | V        | Vd,0(s),VtVm                |             |
| vloxei64.v             | V        | Vd,0(s),VtVm                |             |
| vloxei8.v              | V        | Vd,0(s),VtVm                |             |
| vloxseg2ei16.v         | V        | Vd,0(s),VtVm                |             |

| Instruction name                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |  |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| vloxseg2ei32.v         V         Vd,0(s),VtVm           vloxseg2ei64.v         V         Vd,0(s),VtVm           vloxseg2ei8.v         V         Vd,0(s),VtVm           vloxseg3ei16.v         V         Vd,0(s),VtVm           vloxseg3ei32.v         V         Vd,0(s),VtVm           vloxseg3ei64.v         V         Vd,0(s),VtVm           vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei4.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v |  |
| vloxseg2ei64.v         V         Vd,0(s),VtVm           vloxseg2ei8.v         V         Vd,0(s),VtVm           vloxseg3ei16.v         V         Vd,0(s),VtVm           vloxseg3ei32.v         V         Vd,0(s),VtVm           vloxseg3ei64.v         V         Vd,0(s),VtVm           vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                          |  |
| vloxseg2ei8.v         V         Vd,0(s),VtVm           vloxseg3ei16.v         V         Vd,0(s),VtVm           vloxseg3ei32.v         V         Vd,0(s),VtVm           vloxseg3ei64.v         V         Vd,0(s),VtVm           vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei44.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                 |  |
| vloxseg3ei16.v         V         Vd,0(s),VtVm           vloxseg3ei32.v         V         Vd,0(s),VtVm           vloxseg3ei64.v         V         Vd,0(s),VtVm           vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                 |  |
| vloxseg3ei32.v         V         Vd,0(s),VtVm           vloxseg3ei64.v         V         Vd,0(s),VtVm           vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei64.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                              |  |
| vloxseg3ei64.v         V         Vd,0(s),VtVm           vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei64.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                              |  |
| vloxseg3ei8.v         V         Vd,0(s),VtVm           vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei64.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                        |  |
| vloxseg4ei16.v         V         Vd,0(s),VtVm           vloxseg4ei32.v         V         Vd,0(s),VtVm           vloxseg4ei64.v         V         Vd,0(s),VtVm           vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                       |  |
| vloxseg4ei32.v         V         Vd,O(s),VtVm           vloxseg4ei64.v         V         Vd,O(s),VtVm           vloxseg4ei8.v         V         Vd,O(s),VtVm           vloxseg5ei16.v         V         Vd,O(s),VtVm           vloxseg5ei32.v         V         Vd,O(s),VtVm           vloxseg5ei64.v         V         Vd,O(s),VtVm           vloxseg5ei8.v         V         Vd,O(s),VtVm           vloxseg6ei16.v         V         Vd,O(s),VtVm           vloxseg6ei32.v         V         Vd,O(s),VtVm           vloxseg6ei64.v         V         Vd,O(s),VtVm           vloxseg7ei16.v         V         Vd,O(s),VtVm           vloxseg7ei32.v         V         Vd,O(s),VtVm           vloxseg7ei64.v         V         Vd,O(s),VtVm           vloxseg7ei8.v         V         Vd,O(s),VtVm           vloxseg8ei16.v         V         Vd,O(s),VtVm                                                                                                                                                                                                                                                                                                                |  |
| vloxseg4ei64.v         V         Vd,0(s),VtVm           vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                        |  |
| vloxseg4ei8.v         V         Vd,0(s),VtVm           vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                |  |
| vloxseg5ei16.v         V         Vd,0(s),VtVm           vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |  |
| vloxseg5ei32.v         V         Vd,0(s),VtVm           vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |  |
| vloxseg5ei64.v         V         Vd,0(s),VtVm           vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |
| vloxseg5ei8.v         V         Vd,0(s),VtVm           vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |  |
| vloxseg6ei16.v         V         Vd,0(s),VtVm           vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |
| vloxseg6ei32.v         V         Vd,0(s),VtVm           vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |  |
| vloxseg6ei64.v         V         Vd,0(s),VtVm           vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |
| vloxseg6ei8.v         V         Vd,0(s),VtVm           vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |  |
| vloxseg7ei16.v         V         Vd,0(s),VtVm           vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| vloxseg7ei32.v         V         Vd,0(s),VtVm           vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |  |
| vloxseg7ei64.v         V         Vd,0(s),VtVm           vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| vloxseg7ei8.v         V         Vd,0(s),VtVm           vloxseg8ei16.v         V         Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |  |
| vloxseg8ei16.v V Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| 9                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |  |
| v]oveog8oi32 v                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |  |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |
| vloxseg8ei64.v V Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |  |
| vloxseg8ei8.v V Vd,0(s),VtVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |
| vlse16.v V Vd,0(s),tVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |
| vlse32.v V Vd,0(s),tVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |
| Vlse64.v Vd,0(s),tVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |  |
| Vlse8.v V Vd,0(s),tVm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |
| vlseg2e16.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
| vlseg2e16ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg2e32.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
| vlseg2e32ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg2e64.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
| vlseg2e64ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg2e8.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |
| vlseg2e8ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |  |
| vlseg3e16.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
| vlseg3e16ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg3e32.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
| vlseg3e32ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg3e64.v Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |
| vlseg3e64ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg3e8.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |
| vlseg3e8ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |  |
| vlseg4e16.v Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |
| vlseg4e16ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg4e32.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |
| vlseg4e32ff.v V Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| vlseg4e64.v Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |
| vlseg4e64ff.v Vd,0(s)Vm                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |  |

| Instruction name             | Extens.  | Abstract                   | Very short  |
|------------------------------|----------|----------------------------|-------------|
| and syntax                   | required |                            | description |
| vlseg4e8.v                   | V        | Vd,0(s)Vm                  |             |
| vlseg4e8ff.v                 | V        | Vd,0(s)Vm                  |             |
| vlseg5e16.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg5e16ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg5e32.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg5e32ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg5e64.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg5e64ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg5e8.v                   | V        | Vd,0(s)Vm                  |             |
| vlseg5e8ff.v                 | V        | Vd,0(s)Vm                  |             |
| vlseg6e16.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg6e16ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg6e32.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg6e32ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg6e64.v                  | V        | Vd,0(s)Vm                  |             |
|                              | V        | Vd,0(s)Vm<br>Vd,0(s)Vm     |             |
| vlseg6e64ff.v<br>vlseg6e8.v  | V        | Vd,0(s)Vm<br>Vd,0(s)Vm     |             |
|                              | V        | Vd,0(s)Vm                  |             |
| vlseg6e8ff.v                 | V        | Vd,0(s)Vm                  |             |
| vlseg7e16.v<br>vlseg7e16ff.v | V        | Vd,0(s)Vm                  |             |
| vlseg7e32.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg7e32.v<br>vlseg7e32ff.v | V        | Vd,0(s)Vm                  |             |
|                              | V        | Vd,0(s)Vm                  |             |
| vlseg7e64.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg7e64ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg7e8.v                   | V        | Vd,0(s)Vm                  |             |
| vlseg7e8ff.v                 | V        | Vd,0(s)Vm                  |             |
| vlseg8e16.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg8e16ff.v                | V        | Vd,0(s)Vm                  |             |
| vlseg8e32.v                  | V        | •                          |             |
| vlseg8e32ff.v                | V        | Vd,0(s)Vm<br>Vd,0(s)Vm     |             |
| vlseg8e64.v                  | V        | Vd,0(s)Vm                  |             |
| vlseg8e64ff.v                | V        |                            |             |
| vlseg8e8.v                   | V        | Vd,0(s)Vm<br>Vd,0(s)Vm     |             |
| vlseg8e8ff.v                 | V        | •                          |             |
| vlsseg2e16.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg2e32.v                 | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vlsseg2e64.v                 | V        |                            |             |
| vlsseg2e8.v                  | V        | Vd,0(s),tVm                |             |
| vlsseg3e16.v                 | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vlsseg3e32.v                 | V        |                            |             |
| vlsseg3e64.v                 |          | Vd,0(s),tVm                |             |
| vlsseg3e8.v                  | V        | Vd,0(s),tVm                |             |
| vlsseg4e16.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg4e32.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg4e64.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg4e8.v                  | V        | Vd,0(s),tVm                |             |
| vlsseg5e16.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg5e32.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg5e64.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg5e8.v                  | V        | Vd,0(s),tVm                |             |
| vlsseg6e16.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg6e32.v                 | V        | Vd,0(s),tVm                |             |
| vlsseg6e64.v                 | V        | Vd,0(s),tVm                |             |

| Instruction name | Extens.  | Abstract     | Very short  |
|------------------|----------|--------------|-------------|
| and syntax       | required | parameters   | description |
| vlsseg6e8.v      | V        | Vd,0(s),tVm  |             |
| vlsseg7e16.v     | V        | Vd,0(s),tVm  |             |
| vlsseg7e32.v     | V        | Vd,O(s),tVm  |             |
| vlsseg7e64.v     | V        | Vd,O(s),tVm  |             |
| vlsseg7e8.v      | V        | Vd,0(s),tVm  |             |
| vlsseg8e16.v     | V        | Vd,O(s),tVm  |             |
| vlsseg8e32.v     | V        | Vd,0(s),tVm  |             |
| vlsseg8e64.v     | V        | Vd,0(s),tVm  |             |
| vlsseg8e8.v      | V        | Vd,0(s),tVm  |             |
| vluxei16.v       | V        | Vd,0(s),VtVm |             |
| vluxei32.v       | V        | Vd,0(s),VtVm |             |
| vluxei64.v       | V        | Vd,0(s),VtVm |             |
| vluxei8.v        | V        | Vd,0(s),VtVm |             |
| vluxseg2ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg2ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg2ei64.v   | V        | Vd,0(s),VtVm |             |
| vluxseg2ei8.v    | V        | Vd,O(s),VtVm |             |
| vluxseg3ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg3ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg3ei64.v   | V        | Vd,0(s),VtVm |             |
| vluxseg3ei8.v    | V        | Vd,0(s),VtVm |             |
| vluxseg4ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg4ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg4ei64.v   | V        | Vd,O(s),VtVm |             |
| vluxseg4ei8.v    | V        | Vd,0(s),VtVm |             |
| vluxseg5ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg5ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg5ei64.v   | V        | Vd,0(s),VtVm |             |
| vluxseg5ei8.v    | V        | Vd,O(s),VtVm |             |
| vluxseg6ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg6ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg6ei64.v   | V        | Vd,0(s),VtVm |             |
| vluxseg6ei8.v    | V        | Vd,0(s),VtVm |             |
| vluxseg7ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg7ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg7ei64.v   | V        | Vd,O(s),VtVm |             |
| vluxseg7ei8.v    | V        | Vd,0(s),VtVm |             |
| vluxseg8ei16.v   | V        | Vd,0(s),VtVm |             |
| vluxseg8ei32.v   | V        | Vd,0(s),VtVm |             |
| vluxseg8ei64.v   | V        | Vd,O(s),VtVm |             |
| vluxseg8ei8.v    | V        | Vd,O(s),VtVm |             |
| vmacc.vv         | V        | Vd,Vs,VtVm   |             |
| vmacc.vx         | V        | Vd,s,VtVm    |             |
| vmadc.vi         | V        | Vd,Vt,Vi     |             |
| vmadc.vim        | V        | Vd,Vt,Vi,VO  |             |
| vmadc.vv         | V        | Vd,Vt,Vs     |             |
| vmadc.vvm        | V        | Vd,Vt,Vs,V0  |             |
| vmadc.vx         | V        | Vd,Vt,s      |             |
| vmadc.vxm        | V        | Vd,Vt,s,VO   |             |
| vmadd.vv         | V        | Vd,Vs,VtVm   |             |
| vmadd.vx         | V        | Vd,s,VtVm    |             |
| vmand.mm         | V        | Vd,Vt,Vs     |             |
| vmandn.mm        | V        | Vd,Vt,Vs     |             |

| Instruction name | Entona   | Abatmost                | Vone about             |
|------------------|----------|-------------------------|------------------------|
| Instruction name | Extens.  | Abstract                | Very short description |
| and syntax       | required | parameters              | description            |
| vmandnot.mm      | V        | Vd,Vt,Vs                |                        |
| vmax.vv          | V        | Vd,Vt,VsVm              |                        |
| vmax.vx          | V        | Vd,Vt,sVm               |                        |
| vmaxu.vv         | V        | Vd,Vt,VsVm              |                        |
| vmaxu.vx         | V        | Vd,Vt,sVm               |                        |
| vmclr.m          | V        | Vv                      |                        |
| vmcpy.m          | V        | Vd, Vu                  |                        |
| vmerge.vim       | V        | Vd,Vt,Vi,VO             |                        |
| vmerge.vvm       | V        | Vd,Vt,Vs,V0             |                        |
| vmerge.vxm       | V        | Vd,Vt,s,V0              |                        |
| vmfeq.vf         | Zvef     | Vd,Vt,SVm               |                        |
| vmfeq.vv         | Zvef     | Vd,Vt,VsVm              |                        |
| vmfge.vf         | Zvef     | Vd,Vt,SVm               |                        |
| vmfge.vv         | Zvef     | Vd,Vs,VtVm              |                        |
| vmfgt.vf         | Zvef     | Vd,Vt,SVm               |                        |
| vmfgt.vv         | Zvef     | Vd,Vs,VtVm              |                        |
| vmfle.vf         | Zvef     | Vd,Vt,SVm               |                        |
| vmfle.vv         | Zvef     | Vd,Vt,VsVm              |                        |
| vmflt.vf         | Zvef     | Vd,Vt,SVm               |                        |
| vmflt.vv         | Zvef     | Vd,Vt,VsVm              |                        |
| vmfne.vf         | Zvef     | Vd,Vt,SVm               |                        |
| vmfne.vv         | Zvef     | Vd,Vt,VsVm              |                        |
| vmin.vv          | V        | Vd,Vt,VsVm              |                        |
| vmin.vx          | V        | Vd,Vt,sVm               |                        |
| vminu.vv         | V        | Vd,Vt,VsVm              |                        |
| vminu.vx         | V        | Vd,Vt,sVm               |                        |
| vmmv.m           | V        | Vd, Vu                  |                        |
| vmnand.mm        | V        | Vd,Vt,Vs                |                        |
| vmnor.mm         | V        | Vd,Vt,Vs                |                        |
| vmnot.m          | V        | Vd, Vu                  |                        |
| vmor.mm          | V        | Vd,Vt,Vs                |                        |
| vmorn.mm         | V        | Vd,Vt,Vs                |                        |
| vmornot.mm       | V        | Vd,Vt,Vs                |                        |
| vmsbc.vv         | V        | Vd,Vt,Vs                |                        |
| vmsbc.vvm        | V        | Vd,Vt,Vs,V0             |                        |
| vmsbc.vx         | V        | Vd,Vt,s                 |                        |
| vmsbc.vxm        | V        | Vd,Vt,s,V0              |                        |
| vmsbf.m          | V        | Vd,VtVm<br>Vd,Vt,ViVm   |                        |
| vmseq.vi         | V        |                         |                        |
| vmseq.vv         |          | Vd,Vt,VsVm<br>Vd,Vt,sVm |                        |
| vmseq.vx         | V        |                         |                        |
| vmset.m          | V        | Vv Vd V+ VlrVm          |                        |
| vmsge.vi         | V        | Vd,Vt,VkVm              |                        |
| vmsge.vv         | V        | Vd,Vs,VtVm              |                        |
| vmsge.vx         | V        | Vd,Vt,s,VM,VT Vd,Vt,sVm |                        |
| vmsge.vx         | V        |                         |                        |
| vmsgeu.vi        | V        | Vd, Vt, VkVm            |                        |
| vmsgeu.vi        | V        | Vd, Vu, OVm             |                        |
| vmsgeu.vv        |          | Vd, Vs, VtVm            |                        |
| vmsgeu.vx        | V        | Vd,Vt,s,VM,VT           |                        |
| vmsgeu.vx        |          | Vd,Vt,sVm               |                        |
| vmsgt.vi         | V        | Vd,Vt,ViVm              |                        |
| vmsgt.vv         | v        | Vd,Vs,VtVm              |                        |

| Instruction name | Extens.  | Abstract                | Very short  |
|------------------|----------|-------------------------|-------------|
| and syntax       | required | parameters              | description |
| vmsgt.vx         | V        | Vd,Vt,sVm               |             |
| vmsgtu.vi        | V        | Vd,Vt,ViVm              |             |
| vmsgtu.vv        | V        | Vd,Vs,VtVm              |             |
| vmsgtu.vx        | V        | Vd,Vt,sVm               |             |
| vmsif.m          | V        | Vd,VtVm                 |             |
| vmsle.vi         | V        | Vd,Vt,ViVm              |             |
| vmsle.vv         | V        | Vd,Vt,VsVm              |             |
| vmsle.vx         | V        | Vd,Vt,sVm               |             |
| vmsleu.vi        | V        | Vd,Vt,ViVm              |             |
| vmsleu.vv        | V        | Vd,Vt,VsVm              |             |
| vmsleu.vx        | V        | Vd,Vt,sVm               |             |
| vmslt.vi         | V        | Vd,Vt,VkVm              |             |
| vmslt.vv         | V        | Vd,Vt,VsVm              |             |
| vmslt.vx         | V        | Vd,Vt,sVm               |             |
| vmsltu.vi        | V        | Vd,Vt,VkVm              |             |
| vmsltu.vi        | V        | Vd,Vu,OVm               |             |
| vmsltu.vv        | V        | Vd,Vt,VsVm              |             |
| vmsltu.vx        | V        | Vd,Vt,sVm               |             |
| vmsne.vi         | V        | Vd,Vt,ViVm              |             |
| vmsne.vv         | V        | Vd,Vt,VsVm              |             |
| vmsne.vx         | V        | Vd,Vt,sVm               |             |
| vmsof.m          | V        | Vd,VtVm                 |             |
| vmul.vv          | V        | Vd,Vt,VsVm              |             |
| vmul.vx          | V        | Vd,Vt,sVm               |             |
| vmulh.vv         | V        | Vd,Vt,VsVm              |             |
| vmulh.vx         | V        | Vd,Vt,sVm               |             |
| vmulhsu.vv       | V        | Vd,Vt,VsVm              |             |
| vmulhsu.vx       | V        | Vd,Vt,sVm               |             |
| vmulhu.vv        | V        | Vd,Vt,VsVm              |             |
| vmulhu.vx        | V        | Vd,Vt,sVm               |             |
| vmv.s.x          | V        | Vd,s                    |             |
| vmv.v.i          | V        | Vd,Vi                   |             |
| vmv.v.v          | V        | Vd,Vs                   |             |
| vmv.v.x          | V        | Vd,s                    |             |
| VMV.X.S          | V        | d,Vt                    |             |
| vmv1r.v          | V        | Vd,Vt                   |             |
| vmv2r.v          | V        | Vd,Vt                   |             |
| vmv4r.v          | V        | Vd,Vt                   |             |
| vmv8r.v          | V        | Vd,Vt                   |             |
| vmxnor.mm        | V        | Vd,Vt,Vs                |             |
| vmxor.mm         | V        | Vd,Vt,Vs                |             |
| vnclip.wi        | V        | Vd,Vt,VjVm              |             |
| vnclip.wv        | V        | Vd,Vt,VsVm              |             |
| vnclip.wx        | V        | Vd,Vt,sVm               |             |
| vnclipu.wi       | V        | Vd,Vt,VjVm              |             |
| vnclipu.wx       | V        | Vd,Vt,VsVm              |             |
| •                | V        | Vd,Vt,sVm<br>Vd,VtVm    |             |
| vncvt.x.x.w      | V        |                         |             |
| vneg.v           |          | Vd,VtVm                 |             |
| Vnmsac.vv        | V        | Vd,Vs,VtVm<br>Vd,s,VtVm |             |
| vnmsac.vx        |          |                         |             |
| vnmsub.vv        | V        | Vd,Vs,VtVm              |             |
| vnmsub.vx        | V        | Vd,s,VtVm               |             |

| Instruction name   | Extens.  | Abstract          | Very short  |
|--------------------|----------|-------------------|-------------|
| and syntax         | required | parameters        | description |
| vnot.v             | V        | Vd,VtVm           | 1           |
| vnsra.wi           | V        | Vd,Vt,VjVm        |             |
| vnsra.wv           | V        | Vd,Vt,VsVm        |             |
| vnsra.wx           | V        | Vd,Vt,sVm         |             |
| vnsrl.wi           | V        | Vd,Vt,VjVm        |             |
| vnsrl.wv           | V        | Vd,Vt,VsVm        |             |
| vnsrl.wx           | V        | Vd,Vt,sVm         |             |
| vor.vi             | V        | Vd,Vt,ViVm        |             |
| vor.vv             | V        | Vd,Vt,VsVm        |             |
| vor.vx             | V        | Vd,Vt,sVm         |             |
| vpopc.m            | V        | d,VtVm            |             |
| vredand.vs         | V        | Vd,Vt,VsVm        |             |
| vredmax.vs         | V        | Vd,Vt,VsVm        |             |
| vredmaxu.vs        | V        | Vd,Vt,VsVm        |             |
| vredmin.vs         | V        | Vd,Vt,VsVm        |             |
| vredminu.vs        | V        | Vd,Vt,VsVm        |             |
| vredor.vs          | V        | Vd,Vt,VsVm        |             |
| vredsum.vs         | V        | Vd,Vt,VsVm        |             |
| vredxor.vs         | V        | Vd,Vt,VsVm        |             |
| vrem.vv            | V        | Vd,Vt,VsVm        |             |
| vrem.vx            | V        | Vd,Vt,sVm         |             |
| vremu.vv           | V        | Vd,Vt,VsVm        |             |
| vremu.vx           | V        | Vd,Vt,sVm         |             |
| vrgather.vi        | V        | Vd,Vt,VjVm        |             |
| vrgather.vv        | V        | Vd,Vt,VsVm        |             |
| vrgather.vx        | V        | Vd,Vt,sVm         |             |
| vrgatherei16.vv    | V        | Vd,Vt,VsVm        |             |
| vrsub.vi           | V        | Vd,Vt,ViVm        |             |
| vrsub.vx           | V        | Vd,Vt,sVm         |             |
| vs1r.v             | V        | Vd,0(s)           |             |
| vs2r.v             | V        | Vd,0(s)           |             |
| vs4r.v             | V        | Vd,0(s)           |             |
| vs8r.v             | V        | Vd,0(s)           |             |
| vsadd.vi           | V        | Vd,Vt,ViVm        |             |
| vsadd.vv           | V        | Vd,Vt,VsVm        |             |
| vsadd.vx           | V        | Vd,Vt,sVm         |             |
| vsaddu.vi          | V        | Vd,Vt,ViVm        |             |
| vsaddu.vv          | V        | Vd,Vt,VsVm        |             |
| vsaddu.vx          | V        | Vd,Vt,sVm         |             |
| vsbc.vvm           | V        | Vd,Vt,Vs,V0       |             |
| vsbc.vxm           | V        | Vd,Vt,vs,VO       |             |
| vse1.v             | V        | Vd, Vt, S, VO     |             |
| vse16.v            | V        | Vd,0(s)Vm         |             |
| vse32.v            | V        | Vd,0(s)Vm         |             |
| vse64.v            | V        | Vd,0(s)Vm         |             |
| vse8.v             | V        | Vd,O(s)Vm         |             |
| vsetivli           | V        | d,Z,Vb            |             |
| vsetvl             | V        | d,s,t             |             |
| vsetvli            | V        | d,s,Vc            |             |
| vsext.vf2          | V        | Vd,VtVm           |             |
| vsext.vf4          | V        | Vd,VtVm           |             |
| vsext.vf8          | V        | Vd,VtVm           |             |
| vslide1down.vx     | V        | Vd,Vt,sVm         |             |
| A PITICETOOMII. AY | V V      | vu, v t , 5 V III | 1           |

| Instruction name | Extens.  | Abstract     | Very short     |
|------------------|----------|--------------|----------------|
| and syntax       | required | parameters   | description    |
| •                | -        | -            | accert the rom |
| vslide1up.vx     | V        | Vd,Vt,sVm    |                |
| vslidedown.vi    | V        | Vd,Vt,VjVm   |                |
| vslidedown.vx    | V        | Vd,Vt,sVm    |                |
| vslideup.vi      | V        | Vd,Vt,VjVm   |                |
| vslideup.vx      | V        | Vd,Vt,sVm    |                |
| vsll.vi          | V        | Vd,Vt,VjVm   |                |
| vsll.vv          | V        | Vd,Vt,VsVm   |                |
| vsll.vx          | V        | Vd,Vt,sVm    |                |
| vsm.v            | V        | Vd,0(s)      |                |
| vsmul.vv         | V        | Vd,Vt,VsVm   |                |
| vsmul.vx         | V        | Vd,Vt,sVm    |                |
| vsoxei16.v       | V        | Vd,0(s),VtVm |                |
| vsoxei32.v       | V        | Vd,0(s),VtVm |                |
| vsoxei64.v       | V        | Vd,0(s),VtVm |                |
| vsoxei8.v        | V        | Vd,0(s),VtVm |                |
| vsoxseg2ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg2ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg2ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg2ei8.v    | V        | Vd,0(s),VtVm |                |
| vsoxseg3ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg3ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg3ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg3ei8.v    | V        | Vd,0(s),VtVm |                |
| vsoxseg4ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg4ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg4ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg4ei8.v    | V        | Vd,0(s),VtVm |                |
| vsoxseg5ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg5ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg5ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg5ei8.v    | V        | Vd,0(s),VtVm |                |
| vsoxseg6ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg6ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg6ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg6ei8.v    | V        | Vd,0(s),VtVm |                |
| vsoxseg7ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg7ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg7ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg7ei8.v    | V        | Vd,0(s),VtVm |                |
| vsoxseg8ei16.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg8ei32.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg8ei64.v   | V        | Vd,0(s),VtVm |                |
| vsoxseg8ei8.v    | V        | Vd,0(s),VtVm |                |
| vsra.vi          | V        | Vd,Vt,VjVm   |                |
| vsra.vv          | V        | Vd,Vt,VsVm   |                |
| vsra.vx          | V        | Vd,Vt,sVm    |                |
| vsrl.vi          | V        | Vd,Vt,VjVm   |                |
| vsrl.vv          | V        | Vd,Vt,VsVm   |                |
| vsrl.vx          | V        | Vd,Vt,sVm    |                |
| vsse16.v         | V        | Vd,0(s),tVm  |                |
| vsse32.v         | V        | Vd,0(s),tVm  |                |
| vsse64.v         | V        | Vd,0(s),tVm  |                |
| vsse8.v          | V        | Vd,0(s),tVm  |                |

|                              | п.       | 41                         | T           |
|------------------------------|----------|----------------------------|-------------|
| Instruction name             | Extens.  | Abstract                   | Very short  |
| and syntax                   | required | parameters                 | description |
| vsseg2e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg2e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg2e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg2e8.v                   | V        | Vd,0(s)Vm                  |             |
| vsseg3e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg3e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg3e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg3e8.v                   | V        | Vd,0(s)Vm                  |             |
| vsseg4e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg4e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg4e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg4e8.v                   | V        | Vd,0(s)Vm                  |             |
| vsseg5e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg5e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg5e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg5e8.v                   | V        | Vd,0(s)Vm                  |             |
| vsseg6e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg6e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg6e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg6e8.v                   | V        | Vd,0(s)Vm                  |             |
| vsseg7e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg7e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg7e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg7e8.v                   | V        | Vd,0(s)Vm                  |             |
| vsseg8e16.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg8e32.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg8e64.v                  | V        | Vd,0(s)Vm                  |             |
| vsseg8e8.v                   | V        | Vd,0(s)Vm                  |             |
| vssra.vi                     | V        | Vd,Vt,VjVm                 |             |
| vssra.vv                     | V        | Vd,Vt,VsVm                 |             |
| vssra.vx                     | V        | Vd,Vt,sVm                  |             |
| vssrl.vi                     | V        | Vd,Vt,VjVm                 |             |
| vssrl.vv                     | V        | Vd,Vt,VsVm                 |             |
| vssrl.vx                     | V        | Vd,Vt,sVm                  |             |
| vssseg2e16.v                 | V        | Vd,0(s),tVm                |             |
| vssseg2e32.v                 | V        | Vd,0(s),tVm                |             |
| vssseg2e64.v                 | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg2e8.v<br>vssseg3e16.v  | V        | Vd,0(s),tVm                |             |
| vssseg3e10.v<br>vssseg3e32.v | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg3e32.v<br>vssseg3e64.v | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg3e04.v                 | V        | Vd,0(s),tVm                |             |
| vssseg3eo.v<br>vssseg4e16.v  | V        | Vd,0(s),tVm                |             |
| vssseg4e16.v<br>vssseg4e32.v | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg4e32.v<br>vssseg4e64.v | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
|                              | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg4e8.v                  | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg5e16.v<br>vssseg5e32.v | V        | Vd,0(s),tVm                |             |
| vssseg5e52.v<br>vssseg5e64.v | V        | Vd,0(s),tVm                |             |
|                              | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg5e8.v<br>vssseg6e16.v  | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg6e32.v                 | V        | Vd,0(s),tVm                |             |
| vssseg6e64.v                 | V        | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| VDDBGGGGT. V                 | ٧        | να,υ(ο),υν <b>ι</b> ι      | 1           |

| Instruction name             | Extens.      | Abstract                   | Very short  |
|------------------------------|--------------|----------------------------|-------------|
| and syntax                   | required     | parameters                 | description |
| •                            | -            | •                          | description |
| vssseg6e8.v                  | V            | Vd,0(s),tVm                |             |
| vssseg7e16.v                 | V            | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg7e32.v<br>vssseg7e64.v | V            | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
|                              |              | Vd,0(s),tVm                |             |
| vssseg7e8.v                  | V            | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg8e16.v                 | V            |                            |             |
| vssseg8e32.v<br>vssseg8e64.v | V            | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| 3                            | V            | Vd,0(s),tVm<br>Vd,0(s),tVm |             |
| vssseg8e8.v<br>vssub.vv      | V            | Vd,Vt,VsVm                 |             |
| vssub.vx                     | V            | Vd,Vt,sVm                  |             |
| vssubu.vv                    | V            |                            |             |
|                              | V            | Vd,Vt,VsVm                 |             |
| vsub.vv                      | V            | Vd,Vt,sVm<br>Vd,Vt,VsVm    |             |
|                              | V            | Vd,Vt,sVm                  |             |
| vsub.vx                      | V            | Vd,Vt,SVM<br>Vd,O(s),VtVm  |             |
| vsuxei16.v                   | V            | Vd,O(s),VtVm               |             |
| vsuxei32.v                   | V            |                            |             |
| vsuxei64.v                   |              | Vd,0(s),VtVm               |             |
| vsuxei8.v                    | V            | Vd,0(s),VtVm               |             |
| vsuxseg2ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg2ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg2ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg2ei8.v                | V            | Vd,0(s),VtVm               |             |
| vsuxseg3ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg3ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg3ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg3ei8.v                | V            | Vd,0(s),VtVm               |             |
| vsuxseg4ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg4ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg4ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg4ei8.v                | V            | Vd,0(s),VtVm               |             |
| vsuxseg5ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg5ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg5ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg5ei8.v                | V            | Vd,0(s),VtVm               |             |
| vsuxseg6ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg6ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg6ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg6ei8.v                | V            | Vd,0(s),VtVm               |             |
| vsuxseg7ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg7ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg7ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg7ei8.v                | V            | Vd,0(s),VtVm               |             |
| vsuxseg8ei16.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg8ei32.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg8ei64.v               | V            | Vd,0(s),VtVm               |             |
| vsuxseg8ei8.v                | V            | Vd,0(s),VtVm               |             |
| vt.maskc                     | ventana      | d,s,t                      |             |
|                              | condops      | 3 - +                      |             |
| vt.maskcn                    | ventana      | d,s,t                      |             |
| world w                      | condops<br>V | Vd V+ VaVm                 |             |
| vwadd.vv                     | V            | Vd,Vt,VsVm                 |             |
| vwadd.vx                     | V            | Vd,Vt,sVm                  |             |

| Instruction name         | Extens.     | Abstract                    | Very short  |
|--------------------------|-------------|-----------------------------|-------------|
| and syntax               | required    |                             | description |
| vwadd.wv                 | V           | Vd,Vt,VsVm                  | dobotipoton |
| vwadd.wx                 | V           | Vd,Vt,vSVm<br>Vd,Vt,sVm     |             |
| vwaddu.vv                | V           |                             |             |
| vwaddu.vx                | V           | Vd,Vt,VsVm<br>Vd,Vt,sVm     |             |
| vwaddu.vx                | V           |                             |             |
| vwaddu.wx                | V           | Vd,Vt,VsVm<br>Vd,Vt,sVm     |             |
| vwcvt.x.x.v              | V           | Vd,VtVm                     |             |
|                          | V           | Vd,VtVm<br>Vd,VtVm          |             |
| vwcvtu.x.x.v             | V           | Vd, VtVm<br>Vd, Vs, VtVm    |             |
| vwmacc.vv                | V           | Vd, vs, VtVm<br>Vd, s, VtVm |             |
| vwmacc.vx<br>vwmaccsu.vv | V           | Vd, Vs, VtVm                |             |
|                          | V           | Vd,vs,vtVm<br>Vd,s,VtVm     |             |
| vwmaccsu.vx              | V           | Vd,Vs,VtVm                  |             |
| vwmaccu.vx               | V           | Vd, vs, VtVm                |             |
| vwmaccus.vx              | V           | Vd,s,VtVm                   |             |
| vwmul.vv                 | V           | Vd, Vt, VsVm                |             |
| vwmul.vx                 | V           | Vd,Vt,vSVm<br>Vd,Vt,sVm     |             |
| vwmulsu.vv               | V           | Vd, Vt, VsVm                |             |
| vwmulsu.vx               | V           | Vd,Vt,vSVm<br>Vd,Vt,sVm     |             |
| vwmulu.vv                | V           | Vd, Vt, VsVm                |             |
| vwmulu.vx                | V           | Vd, Vt, vsvm<br>Vd, Vt, sVm |             |
| vwredsum.vs              | V           | Vd, Vt, VsVm                |             |
| vwredsumu.vs             | V           | Vd, Vt, VsVm                |             |
| vwsub.vv                 | V           | Vd, Vt, VsVm                |             |
| vwsub.vx                 | V           | Vd,Vt,vSVm<br>Vd,Vt,sVm     |             |
| vwsub.wv                 | V           | Vd, Vt, VsVm                |             |
| vwsub.wx                 | V           | Vd,Vt,sVm                   |             |
| vwsubu.vv                | V           | Vd, Vt, VsVm                |             |
|                          | V           |                             |             |
| vwsubu.vx                | V           | Vd,Vt,sVm                   |             |
| vwsubu.wv<br>vwsubu.wx   | V           | Vd,Vt,VsVm<br>Vd,Vt,sVm     |             |
|                          | V           |                             |             |
| vxor.vi                  | V           | Vd,Vt,ViVm<br>Vd,Vt,VsVm    |             |
| vxor.vv                  | V           |                             |             |
| vxor.vx<br>vzext.vf2     | V           | Vd,Vt,sVm<br>Vd,VtVm        |             |
|                          | V           | •                           |             |
| vzext.vf4                | V           | Vd,VtVm                     |             |
| vzext.vf8                | I           | Vd,VtVm                     |             |
| wfi                      |             |                             |             |
| wrs.nto                  | Zawrs       |                             |             |
| wrs.sto                  | Zawrs       |                             |             |
| xnor                     | Zbb   Zbkb  | d,s,t                       |             |
| xor                      | C           | Cs,Ct,Cw                    |             |
| xor                      | C           | Cs,Cw,Ct                    |             |
| xor                      | I           | d,s,j                       |             |
| xor                      | I           | d,s,t                       |             |
| xori                     | I<br>7hlere | d,s,j                       |             |
| xperm4                   | Zbkx        | d,s,t                       |             |
| xperm8                   | Zbkx        | d,s,t                       |             |
| zext.b                   | I           | d,s                         |             |
| zext.h                   | I<br>7hh    | d,s                         |             |
| zext.h                   | Zbb         | d,s                         |             |
| zext.h                   | Zbb         | d,s                         |             |
| zext.w                   | I           | d,s                         |             |

145

| Instruction name | Extens.  | Abstract   | Very short  |
|------------------|----------|------------|-------------|
| and syntax       | required | parameters | description |
| zext.w           | Zba      | d,s        |             |

### 1.20 Further reading

- 1. The latest release of the Instruction Set Architecture (ISA): github/riscv
- 2. Linux Standard Base Core Specification, Generic Part. This is the official documentation for the ELF file format, for the debug frame machinery, etc. Download it from linuxfoundation.org.
- 3. The MaskRay blog. This is a very interesting blog full of references to low level stuff both for ARM, RISCV and other stuff, even windows. See: maskray.me
- 4. Improving DWARF. This is a very good and very readable critique of DWARF tables, presenting a DWARF table verifier, and, in general, a new perspective in debug tables and stack unwinding. (Slides) inria/france
- 5. This is the research article for the above slides. acm.org
- 6. A complete description of riscv relocations: sifive-blog
- 7. The specifications of the "Zbb" (bit manipulation) extension. github/riscv/bitmanip
- 8. The official specifications for the assembler: github/riscv/asm
- 9. Specifications for the Thead processor. www.t-head.cn
- 10. Want to write an hypervisor? Here you will find all about it: H extension

### 1.21 Answers to all exercises

Exercise 1: The instruction bgt is an alias. How would you build it from the other instructions?

### Answer:

Just invert the arguments. bgt rs1,rs2,label  $\rightarrow$  blt rs2,rs1,label

Exercise 2: Write a small program that uses a conditional branch.

Answer:

```
.globl main
1
2
      main:
3
      addi sp,sp,-16
                         // Build a stack frame
      sd ra, 8(sp)
      sd s0,0(sp)
                         // t1 \leftarrow 1
      li t1,1
                         // t2 \leftarrow 2
      li t2,2
                         // is t1 bigger than t2 ?
      bgt t1,t2,.L1
8
                         // We did not branch. Load string address of LC1
      la aO,.LC1
9
      j .L2
                         // Branch to call instruction
10
      .L1:
11
      la a0,.LC2
                         // We did branch. Load LC2 string
12
      .L2:
13
      call printf
                         // Do the call
14
15
      ld ra,8(sp)
                         // Restore stack frame
16
      ld s0,(sp)
                         // Return
17
      jr ra
18
      .LC1:
      .string "Branch not taken\n"
19
      .p2align 2
20
      .LC2:
21
      .string "Branch taken\n"
  Executing:
      star64:~/tiny-asm$ ./asm -o bgt.o bgt.s
1
      star64:~/tiny-asm$ gcc bgt.o
2
      star64: ~/tiny-asm$ ./a.out
      Branch not taken
```

Exercise 3: Disassemble the program. What you see instead of bgt?

Answer:

```
14: 0063c863 blt t2,t1,24 <.L1>
```

The assembler changed source and destination, using blt.

Exercise 4: Change bgt into blt line 8. Does the output change?

Answer:

```
star64:~/tiny-asm$ ./a.out
Branch taken
start64:~/tiny-asm$
```

Exercise 5: How is the change achieved? Look at the source asm.c.

### Answer:

Looking at the opcode table we have:

```
{"bgt",0,INSN_CLASS_I,"t,s,p",MATCH_BLT,MASK_BLT,match_opcode,INSN_ALIAS|
INSN_CONDBRANCH}
{"blt",0,INSN_CLASS_I,"s,t,p",MATCH_BLT,MASK_BLT,match_opcode,INSN_CONDBRANCH}
```

We can see that in the case of bgt, the instruction is marked as an *alias*. We see also that the match and the mask are identical of bgt and blt. The essential difference is in the argument string: bgt has "t,s,p", and in the mask of blt we have "s,t,p".

## Exercise 6: Use the XOR instruction to invert all bits in an integer register Answer:

Since  $1 \oplus 1$  is zero, and  $0 \oplus 1$  is 1, it suffices to have a right hand side of all ones (the number -1) and we are all set.

The instruction not (invert all bits) is xor rs1,-1. You can see this in the opcode table:

It has the INSN\_ALIAS bit set, and the match is MATCH\_XORI.

Exercise 7: Write a program in assembler to print these 3 counters.

Answer:

```
.globl main
1
                             Use of this name allows us to use C runtime
      main:
2
      addi sp,sp,-16
                         Make room to establish a stack frame
3
      sd ra,8(sp)
                         Save return address
4
      sd s0,0(sp)
                         Save old stack frame
      addi s0,sp,16
                         Establish a new frame. This is not actually needed.
                         Read time into a1, that is the second argument
      rdtime a1
      lla a0,.LCO
                        to printf. The first is the LCO string
      call printf
                         Let printf do the job
      rdcycle a1
                         The same thing for the cycles. Use LC1.
10
      lla a0,.LC1
                         String into a0
11
12
      call printf
                         And the same for instructions returned. Use LC2
      rdinstret a1
13
      lla a0,.LC2
                         String into a0
14
      call printf
15
      ld ra,8(sp)
                         Restore return address
16
      ld s0,0(sp)
17
                         Restore old frame pointer
      jr ra
                         Return to the startup code
18
      .LCO:
19
      .string "Time=0x%x\n"
20
21
      .LC1:
      .string "Cycles=0x%x\n"
22
      .LC2:
23
      .string "Instructions executed=%d\n"
```

Executing this yields:

Time=133663617138 Cycles=164249353990 Instructions executed=85266785454

Exercise 8: Try to verify that time corresponds to a time measure

#### Answer:

One way to do that is to call our program, then do something, then call it again. This should be a measure of how much time this "do something" takes. If we repeat that, we should arrive at similar results.

We will use "uptime" a utility that prints the time since startup.

```
star64: ~/tiny-asm$ ./a.out;uptime;./a.out
Time=139156277825
Cycles=81886690065
Instructions executed=52342547651
15:22:18 up 9:39, 2 users, load average: 0.00, 0.00, 0.00
Time=139156334252
Cycles=81891385713
Instructions executed=52345512910
star64: ~/tiny-asm$ ./a.out;uptime;./a.out
Time=139235738528
Cycles=81923228282
Instructions executed=52376705932
15:22:38 up 9:40, 2 users, load average: 0.00, 0.00, 0.00
Time=139235795755
Cycles=81927968778
Instructions executed=52379666534
star64: ~/tiny-asm$
```

Exercise 9: Use the "max" instruction to calculate the absolute value of a signed integer.

Answer:

Use:

```
neg rd,rs1
max rd,rs1,rd
```

\$ gcc rcsr.o

Exercise 10: Write an assembler program to show the CSR flags Answer:

```
.globl main
1
      main:
      addi sp,sp,-16
                         Establish stack frame
      sd ra,8(sp)
                         Save return address
      sd s0,0(sp)
                         Save frame pointer
                         Read the control register into the fisrt arg (a1)
6
     frcsr a1
     lla a0,.LCO
                         Read the string into the first argument (a0)
     call printf
                         Call printf
8
     ld ra,8(sp)
                         Restore return address
9
     ld s0,0(sp)
                         Restore frame pointer
10
11
     add sp,sp,16
                         Destroy stack frame
     ret
                         Bye bye
12
13
      .LCO:
      .string "CSR= 0x%lx\n"
     This whole things makes just printf("CSR=0x%x\n",csr); But... wait, there is a bug.
  $ asm -o rcsr.o rcsr.s
```

```
$ ./a.out
CSR= 0x0
$ ???
```

Well, of course. There wasn't any motives to set any of those flags above, and the rounding mode is zero (RNE). To see that we are really reading the csr let's provoke a division by zero, so at least we have something in there. We add following lines:

```
sd s0,0(sp)
                          Save frame pointer
5
      li t1,12
                          Put 12 in register t1
6
      fcvt.d.l f20,t1
                          Convert it to 12.0 in register f20
      fcvt.d.l f21,x0
                          Put zero into register f21
      fdiv.d f10,f20,f21 Divide 12.0/0.0
10
                          Read the control register into the fisrt arg (a1)
11
      etc
                          Rest is the same
```

Now, it should show the Division by zero bit as ON.

```
$ asm -o rcsr.o rcsr.s
$ gcc rcsr.o
$ ./a.out
CSR= 0x8
$
```

That was it!

### Exercise 11: Write a subroutine that returns the flags of the CSR

Here it is not necessary to build a stack frame since we do not make any calls, and we do not use any local variables. We build our result in the established register for returning results (a0).

Exercise 12: Change the amound by amoswap. What are the values of t2, t3 and t4 after the operation?

### Answer:

t2 stays unchanged at 47, t3 contains 123, the value we stored at the bottom of the stack, and the bottom of the stack (t4) contains now 47.

### Exercise 13: $Calculate \ ceil(log2(x))$

### Answer:

The key here is to use the instruction th.ff1: it finds the first one bit starting at the most significant bit.

The instructions in lines 4 and 5 calculate 64 - ff1. An alternative would be to put 64 in some register and then subtract the result of the ff1 operation from it, but that would destroy one register and would have the same number of instructions.

Another alternative would be:

```
1 .globl ff1
2 ff1:
3 th.ff1 a0,a0
4 xori a0,a0,63
```

This is almost right, but calculates  $\lfloor log_2(x) \rfloor$ , so we have to add one to the result, what makes for the same number of instructions than the first one presented. This version is much less clear than the first one.

A more serious objection to our assembler versions is that our function returns 1 for an input of one, what is wrong. A serious version would test for that condition and branch accordingly.

# Exercise 14: Mismatch between source and disassembly. Explain Answer:

The explanation: actually, the instruction at address 4 is addi a0,0, what is actually a mov instruction. But the zero is not zero, since it is just a placeholder for a relocation. Looking at the relocations we see:

We see that a relocation points to address 4, indicating to put the lower 12 bits of the main address into an immediate and add them to a0.

| 11: 00                        | 11 110                           |
|-------------------------------|----------------------------------|
| addi, 80                      | blez, 110                        |
| adjust_reloc_syms, 49         | blt, 92                          |
| aes64ds, 105                  | bltu, 92                         |
| aes64dsm, 105                 | bltz, 110                        |
| aes64es, 105                  | bne, 92                          |
| aes64esm, 105                 | bnez, 110                        |
| aes64im, 105                  | bset, 94                         |
| aes64ks1i, 43, 104            |                                  |
| aes64ks2, 105                 | c.fsdsp, <b>3</b> 8              |
| alloc_cfi_insn_data, 76       | c.fswsp, 38                      |
| alloc_fde_entry, 72           | c.swsp, 38                       |
| amoadd, 103                   | call, 110                        |
| amoand, 103                   | CFA, 69                          |
| amomax, 103                   | CFI, 69                          |
| amomin, 103                   | cfi add CFA def cfa offset, 75   |
| amoor, 103                    | cfi add CFA remember state, 79   |
| amoswap, 102                  | cfi end fde, 78                  |
| amoxor, 103                   | cfi finish, 72                   |
| and, 92                       | cfi new fde, 72                  |
| andi, 92                      | chain frchains together, 48      |
| append fixed insn, 47         | cie entry, 72                    |
| append insn, 47               | clmul, 94                        |
| auipc, 87                     | clmulh, 94                       |
|                               | clmulr, 94                       |
| bclr, 93                      | clz, 94                          |
| bclri, 93                     | clzw, 94                         |
| beq, 92                       | $\cos$ , $53$                    |
| beqz, 110                     | cpop, 94                         |
| bext, 93                      | create obj attrs section, 49     |
| bexti, 93                     | ctz, 94                          |
| bfd bwrite, 50                | ctzw, 94                         |
| bfd_elf64_swap_reloca_out, 24 | cvt_frag_to_fill, 48             |
| bfd elf generic reloc, 26     |                                  |
| bge, 92                       | debug type, 63                   |
| bgeu, 92                      | directives                       |
| bgez, 110                     | align, 51                        |
| bgt, 110                      | .ascii, 52                       |
| bgtu, 110                     | asciiz, 52                       |
| bgtz, 110                     | $attach\_to\_group, \frac{56}{}$ |
| binv, 93                      | .bss, $53$                       |
| binvi, 93                     | .byte, <b>53</b>                 |
| bleu, 110                     | .comm, 56                        |
| ,                             | ,                                |

| •                               | 16 6 1 1 1 10                          |
|---------------------------------|----------------------------------------|
| .common, 56                     | elf_frob_symbol, 49                    |
| $\det_{0}, \frac{54}{2}$        | elf_make_empty_symbol, 19              |
| m .dc,~53                       | $elf\_obj\_sy, 19, 63$                 |
| $ m .dc.a,\ 53$                 | ${ m elf\_set\_section\_contents}, 50$ |
| .dc.b, 53                       | ENCODE macros, 31, 34, 38, 39          |
| $\operatorname{dc.l}$ , 53      | $\exp$ r, $54$                         |
| m .dc.w, 53                     | EXTRACT macros, 31, 33, 34, 38, 39     |
| $. { m dtpreldword}, { m 55}$   |                                        |
| . dtprelword, 55                | fabs.d, 110                            |
| . dword, 55                     | fabs.s, 110                            |
| equ, 55                         | fadd, 98                               |
| .equiv, 55                      | $fcvt.\{hsd\}.\{hsd\}, 100$            |
| .globl, 56, 80                  | fcvt.l.s, 100                          |
| .half, 54                       | fcvt.lu.s, 100                         |
| .hidden, 57                     | fcvt.s.l, 100                          |
| .ident, 57                      | fcvt.s.lu, 100                         |
| insn, 58                        | fcvt.s.w, 100                          |
| internal, 59                    | fcvt.s.wu, 100                         |
| .lcomm, 56                      | fcvt.w.s, 100                          |
| .loc, 59                        | fcvt.wu.s, 100                         |
| .local, 63                      | fdiv, 98                               |
| option, 64                      | fence, 110                             |
| org, 64                         | feq, 89, 101                           |
| 9.                              | fix new, 23                            |
| .p2align, 51                    | fix_new_exp, 23                        |
| .p2align1, 51                   | fix new internal, 23, 48               |
| .p2alignw, 51                   | fix segment, 49                        |
| protected, 65                   |                                        |
| reloc, 65                       | fixS, 23                               |
| .set, 55                        | fixup, 23                              |
| sleb128, 55, 68                 | fld, 98                                |
| .string, $52$                   | fle, 101                               |
| .string16, 52                   | flh, 98                                |
| $.string 32, \frac{52}{}$       | flt, 89, 101                           |
| .string64, 52                   | flw, 98                                |
| .string8, 52                    | fmadd, 99                              |
| . stringer, 52                  | fmax, 99                               |
| text, 66, 80                    | fmin, 99                               |
| .uleb $128, 55, 68$             | fmsub, 99                              |
| . word, $55$                    | fmul, 98                               |
| div, 90                         | fmv.d, 111                             |
| divu, 90                        | fmv.s, 111                             |
| $dot\_cfi, 75$                  | fmv.w.x, 100                           |
| dot_cfi_endproc, 78             | fmv.x.w, 100                           |
| dot cfi startproc, 75           | fneg, 111                              |
| dot symbol init, 14             | fneqz, 111                             |
| DW CFA def cfa offset, 75       | fnmadd, 99                             |
| dwarf2 directive filename, 63   | fnmsub, 99                             |
| /                               | frag                                   |
| eh begin, 14                    | definition, 22                         |
| elf begin, 14, 15               | finishing, 47                          |
| elf frob file, 49               | frags chained, 48                      |
| elf frob file after relocs, 50  | frchain now, 67                        |
| elf frob file before adjust, 49 | frflags, 96                            |
|                                 | <b>υ</b> ,                             |

| frrm, 96<br>fsd, 98               | maxu, 94<br>md_apply_fix, 23          |
|-----------------------------------|---------------------------------------|
| fsflags, 96                       | $md\_assemble, 45, 86$                |
| fsgnj, 99                         | $\mathrm{md\_begin},\ 14$             |
| fsh, 98                           | $merge\_data\_into\_text, 48$         |
| fsqrt, 99                         | min, 94                               |
| fsrm, 96                          | minu, 94                              |
| fsub, 88, 98                      | mul, 89                               |
| fsw, 98                           | mulh, 89                              |
| ,                                 | mulhsu, 89                            |
| gas init, 14, 22                  | mulhu, 89                             |
| generic set section contents, 50  | mulw, 89                              |
| get_absolute_expression, 63       | mv, 111                               |
| get symbol name, 63               | my getSmallExpression, 85             |
| 3 _ 1                             | my_geromantixpression, 80             |
| howto_table, 24                   | neg, 111                              |
| input line pointer, 52            | negw, 111                             |
| INRIA, 146                        | nop, 111                              |
| INSERT BITS, 37                   | not, 111                              |
| INSERT_OPERAND, 36                |                                       |
| INSN ALIAS, 41                    | $obj\_attach\_to\_group, \frac{56}{}$ |
| INSN_REIAS, 41<br>INSN_BRANCH, 41 | $\mathrm{obj\_elf\_bss}, 53$          |
|                                   | ${ m obj\_elf\_common,57}$            |
| INSN_CONDBRANCH, 41               | ${ m obj\_elf\_ident}, { m f 57}$     |
| INSN_DEREF, 41                    | obj elf local, 63                     |
| INSN_JSR, 41                      | obj elf section change hook, 53       |
| INSN_MACRO, 41                    | obj elf visibility, 57, 59, 65        |
| INSN_V_EEW764, 41                 | $\frac{5}{92}$                        |
| INSN_XX_BYTE, 41                  | orc.b, 94                             |
|                                   | ori, 92                               |
| j, 111                            | orn, 94                               |
| jal, 111                          | output sleb128, 68                    |
| jalr, 111                         | output uleb128, 68                    |
| jr, 111                           |                                       |
| la, 111                           | pack, 95                              |
| lb, 84                            | packh, 95                             |
| ld, 84                            | packw, 95                             |
| lh, 84                            | parse relocation, 85                  |
| lhu, 80                           | pause, 111                            |
| li, 80, 111                       | pcrel access, 87                      |
| lla, 111                          | pcrel load, 87                        |
| loc.options, 60                   | pcrel store, 87                       |
| local symbol, 21                  | perform an assembly pass, 14          |
| <del></del>                       | po hash, 51                           |
| local_symbol_make, 21             | pobegin, 51                           |
| LSB, 146                          | potable, 50, 56                       |
| lw, 84                            | potable, 50, 50                       |
| lwd, 107                          | rdcycle, 93                           |
| macro build, 87                   | rdinstret, 93                         |
| map over sections, 48             | rdtime, 93                            |
| MaskRay, 146                      | read a source file, 16, 50            |
| match never, 86                   | reg lookup, 36                        |
| <del>-</del>                      |                                       |
| $\max$ , 94                       | $relax\_seg, 48$                      |

| relax_segment, 48                | $ m sha256sum1,\ 105$              |
|----------------------------------|------------------------------------|
| rem, 90                          | sha512sig0, 105                    |
| remu, 90                         | sha512sig1, 105                    |
| remuw, 90                        | sha512sum1, 105                    |
| remw, 90                         | size seg, 48                       |
| resolve local symbol value, 49   | $sll, \overline{91}$               |
| resolve reloc expr symbols, 49   | slli, 80, 91, 95                   |
| resolve symbol value, 49         | sllw, 91                           |
| ret, 111                         | slti, 88                           |
|                                  |                                    |
| rev8, 94                         | sltiu, 89                          |
| riscv_check_mapping_symbols, 46  | sltz, 112                          |
| riscv_frag_align_code, 52        | sm3p0, 105                         |
| riscv_insn_types, 59             | sm3p1, 105                         |
| riscv_ip, 17, 36, 42, 44, 85, 98 | sm4ed, 106                         |
| riscv_mapping_state, 46          | sm4ks, 106                         |
| riscv_opcode, 40                 | sm4s, 43                           |
| riscv_pre_output_hook, 47        | $\operatorname{snez}$ , $112$      |
| riscv_rm, 43, 98                 | srai, 91                           |
| riscv segment info type, 46      | sraw, 91                           |
| riscv_set_abi_by_arch, 46        | srl, 91                            |
| riscv set default priv spec, 46  | srli, 80, 91                       |
| $rol, 9\overline{4}$             | srlw, 91                           |
| rolw, 94                         | start assemble, 45                 |
| ror, 95                          | stdoutput, 17                      |
| rori, 95                         | stringer, 52                       |
| rvb, 95                          | sub, 88                            |
| 110, 00                          | subsections, 27                    |
| s align, 52                      | subsettions, 27 subsefg change, 67 |
| s comm internal, 57              | - <del>-</del> -                   |
| s data, 54                       | subseg_set, 66                     |
| s_riscv_insn, 59                 | subsegs_finish, 47                 |
| s riscv options, 64              | subw, 88                           |
| s set, 55                        | symbol_append, 20                  |
| <del>_</del>                     | symbol_begin, 22                   |
| sd, 80                           | symbol_create, 20                  |
| section, 27                      | symbol_find, 22                    |
| seg_info, 67                     | symbol_find_or_make, 20            |
| segment_info, 46                 | symbol_flags, 20                   |
| seqz, 111                        | symbol_get_obj, 63                 |
| set_section_contents, 50         | symbol_make, 19                    |
| set_symtab, 49                   | symbol_mark_used_in_reloc, 21      |
| sext.b, 95, 111                  | symbol_new, 20                     |
| sext.h, 95, 111                  | $symbol\_table\_insert, 22$        |
| sext.w, 111                      | $symbol\_used\_in\_reloc, 21$      |
| sgtz, 111                        | symbolS, 19                        |
| sh, 80                           |                                    |
| sh1add, 95                       | tail, 112                          |
| $sh1add_uw, 95$                  | ${ m th}$                          |
| $\operatorname{sh2add}, 95$      | addsl, 106                         |
| sh2add uw, 95                    | ext, 106, 129                      |
| sh3add uw, 95                    | ff0, 107                           |
| sha256sig0, 105                  | ff1, 107                           |
| sha256sig1, 105                  | lbia, 107, 108                     |
| sha256sum0, 105                  | lbib, 107, 108                     |
|                                  | 1515, 151, 150                     |

| lbuia, 107, 108     | tiny-asm, 10                         |
|---------------------|--------------------------------------|
| lbuib, 108          | $\operatorname{used\_in\_reloc}, 21$ |
| 1dd, 108            |                                      |
| ldia, 107, 108      | $\mathrm{write\_contents},50$        |
| ldib, 107, 108      | $\mathrm{write\_object\_file},47$    |
| lhia, 107, 108      | $\mathrm{write\_relocs},50$          |
| lhib, 107, 108      | ~~                                   |
| lhuia, 107, 108     | XCNEW, 76                            |
| lhuib, 107, 108     | $\mathrm{xnor},95$                   |
| lrb, 108, 109       | xor, 92                              |
| lrd, 109            | xori, 92                             |
| lrh, 109            | xperm.b, 95                          |
| lrhu, 109           | xperm.n, 96                          |
| lrw, 109            | xperm4, 106                          |
| lrwu, 109           | xperm8, 106                          |
| lurb, 109           | xsymbol, 19                          |
| lurbu, 109          |                                      |
| lurd, 109           | $Zbb, \frac{93}{}$                   |
| lurh, 109           | zero_address_frag, 19, 62            |
|                     | zext.b, 112                          |
| lurhu, 109          | zext.h, 96, 112                      |
| lurw, 109           | zext.w, 112                          |
| lurwu, 109          |                                      |
| lwd, 107            |                                      |
| lwia, 107           |                                      |
| lwib, 107           |                                      |
| lwuia, 107          |                                      |
| lwuib, 107          |                                      |
| mula, 109           |                                      |
| mulah, 109          |                                      |
| mulaw, 109          |                                      |
| mulsw, 109          |                                      |
| mveqz, 109          |                                      |
| mvneqz, 109         |                                      |
| rev, 107            |                                      |
| revw, 108           |                                      |
| srri, 110           |                                      |
| srriw, 110          |                                      |
| tst, 108            |                                      |
| tstnbz, 108         |                                      |
| Thead, $89, 106$    |                                      |
| m thu               |                                      |
| extu, 106           |                                      |
|                     |                                      |
| $\mathrm{URL}$      |                                      |
| Alex. Oliva, 48     |                                      |
| as-docs, 10         |                                      |
| binutils-docs, $64$ |                                      |
| DWARF, $10$         |                                      |
| ELF, 10             |                                      |
| pine64, 10          |                                      |
| riscv-specs, $10$   |                                      |
| Sifive, 11          |                                      |
| sipeed, 11          |                                      |
|                     |                                      |