## Legend Of Bluespec

kr469

October 2021

## Contents

| 1 | Intr | Introduction 3                                                    |  |  |  |  |  |
|---|------|-------------------------------------------------------------------|--|--|--|--|--|
|   | 1.1  | Motivation                                                        |  |  |  |  |  |
|   | 1.2  | Already existing tools                                            |  |  |  |  |  |
|   | 1.3  | Proposed solution                                                 |  |  |  |  |  |
| 2 | Pre  | paration                                                          |  |  |  |  |  |
| _ | 2.1  | Understanding Bluespec                                            |  |  |  |  |  |
|   | 2.1  | 2.1.1 Rules                                                       |  |  |  |  |  |
|   |      | 2.1.2 Modules and interfaces                                      |  |  |  |  |  |
|   |      | 2.1.3 Typeclasses                                                 |  |  |  |  |  |
|   |      | 2.1.4 Provisos                                                    |  |  |  |  |  |
|   | 2.2  | Choosing tools                                                    |  |  |  |  |  |
|   | 2.2  | 2.2.1 Haskell                                                     |  |  |  |  |  |
|   |      | 2.2.2 Tcl (pronounced "tickle")                                   |  |  |  |  |  |
|   |      |                                                                   |  |  |  |  |  |
|   |      | V                                                                 |  |  |  |  |  |
|   | 2.3  |                                                                   |  |  |  |  |  |
|   | 2.3  | Creating a grammar                                                |  |  |  |  |  |
|   |      | 2.3.2 Where to find this grammar?                                 |  |  |  |  |  |
|   |      | 2.3.3 Technical aspects                                           |  |  |  |  |  |
|   |      | 2.5.5 Technical aspects                                           |  |  |  |  |  |
| 3 | Imp  | plementation 10                                                   |  |  |  |  |  |
|   | 3.1  | Crawling the packages                                             |  |  |  |  |  |
|   | 3.2  | Loading Bluespec packages' information (TODO check grammar of s') |  |  |  |  |  |
|   |      | 3.2.1 Lexical and syntax analysis                                 |  |  |  |  |  |
|   |      | 3.2.2 Understanding abstract syntax tree (AST)                    |  |  |  |  |  |
|   |      | 3.2.3 Organizing data and functionality                           |  |  |  |  |  |
|   | 3.3  | Building the top level module                                     |  |  |  |  |  |
|   |      | 3.3.1 Resolving types                                             |  |  |  |  |  |
|   | 3.4  | Synthesizing                                                      |  |  |  |  |  |
|   |      | 3.4.1 GUI                                                         |  |  |  |  |  |
|   | _    |                                                                   |  |  |  |  |  |
| 4 |      | lluation 19                                                       |  |  |  |  |  |
|   | 4.1  | Overview                                                          |  |  |  |  |  |
|   |      | 4.1.1 Polymorphism disclaimer                                     |  |  |  |  |  |
|   | 4.2  | Example 1 - Connecting FIFOs                                      |  |  |  |  |  |
|   |      | 4.2.1 Platform Designer                                           |  |  |  |  |  |
|   |      | 4.2.2 My tool                                                     |  |  |  |  |  |
|   |      | 4.2.3 Using text format to do same things and Qualitative metrics |  |  |  |  |  |
|   | 4.3  | Example 2 - Simple System-on-Chip                                 |  |  |  |  |  |
|   |      | 4.3.1 Platform Designer                                           |  |  |  |  |  |
|   |      | 4.3.2 My tool                                                     |  |  |  |  |  |
|   |      | 4.3.3 Other differences 24                                        |  |  |  |  |  |

| 4.4 | Example 3 - | Working AXI4 masters and slaves |  | 25 |
|-----|-------------|---------------------------------|--|----|
|-----|-------------|---------------------------------|--|----|

## Chapter 1

## Introduction

Making hardware is hard, and similarly to other problems in computer science, we make it easier by adding layers of abstraction. There are many layers involved in making hardware from that abstract transistors to gates to logical operations to functions to modules to chips to multi-chip modules to whole devices. The layer this project focuses on is going from modules to chips (TODO: this probably need renaming, add graph showing layers). At this layer we already have code for modules like memory, CPU-cores, interconnects and so on. Now what we need is an ability to compose them in such a way that they create a whole device.

### 1.1 Motivation

#### Vocabulary

- Verilog Hardware description language
- Bluespec Hardware description language
- Intel Quartus Prime (IQP) programmable logic device design software produced by Intel. (source Wikipedia)
- Platform Designer Tool inside IQP used to connect high level components.

Verilog is an old language with minimal type system. This forces people to encode more complex interfaces using finicky naming conventions. To simplify process of connecting components with many wires, people tend to use high level tools like Platform Designer, that have built in logic allowing for easy connection of components. This approach is not perfect and as we will see during evaluation, a better option might be to move away from Verilog and to use more modern language like Bluespec. The problem with that is that tools like Intel Quartus Prime(IQP) don't have native support for Bluespec, and it forces Bluespec users to work with transpiled Verilog (from Bluespec) code. Unfortunately, because typing richness of Bluespec(comparable to Haskell), arbitrary code cannot be transpiled, and before code can be transpiled it needs to be stripped of any arguments or other kinds of polymorphism. Process of transpiration also destroys high level interfaces, and they must be recreated manually after importing Verilog code into IQP.

## 1.2 Already existing tools

Creating custom hardware has got expensive in last few decades both in therms of money but also in terms of man-hours needed to create it. This creates barriers to entry for new players, and also reinforces strong positions of already adopted standards. This also means that there isn't as large open source community as around hardware and therefore one cannot simply use plugin for Bluespec of Bluespec native high level integration tool because there are none.

My project will try to tackle a subset of functionality provided by a tool called Platform Designer that is a part of Intel Quartus Prime package. Platform Designer is a GUI tool for high-level integration, designs generated by it are kept in verbose text file format. Unfortunately, those files have a tendency to be megabytes long, making them hard to read and edit, by humans. There are other problems with using Platform Designer, like cumbersomeness of importing new components.

Market share and justification for focusing entirely on comparisons with Intel Quartus Prime

It's a bit difficult to accurately judge market share of IQP, but It is produced by one of the biggest designer and manufacturer in the world, and it's also used by Qualcomm according to HG Insights. Using Google trends we can see that while in recent years(since 2016) competitor Xilinx Vivado has overtaken, IQP in interest at the global scale, in most developed countries like US, Europe, and parts of Asia. There is roughly 50/50 split between IQP and Vivado. My personal observations suggest that Xilinx Vivado became highly popular in India which is a large country with history of chip design, and this might be strongly skewing data in favor of Xilinx's product. Another argument to compare against IQP is that it was used last year as a part of the course, and it can't be irrelevant if it was taught at Cambridge. Therefore, it's fair to not investigate Xilinx's software during evaluation(TODO: check if this has changed).

### 1.3 Proposed solution

During my project I created an alternative JSON based file format that is much simpler and human-readable, that can be interpreted using my tool to produce Bluespec code of a top level module. This file format is rather unimpressive on its own, and one might argue that it would be better to use Bluespec code instead of it. Interesting part of this project lie in the backend that is able to verify correctness of this file and produce set of typing information about produced Bluespec code. But to truly appreciate work I have done during this project I also implemented GUI, which uses same backend as one used to interpret JSON files, and it is able to supply user with typing information and possible choices when connecting components. One can think of it as a Bluespec native high level integration tool that uses simple language server in the backend. This way Bluespec user can do high level integration in the realm of not transpiled Bluespec. Then after satisfactory top level module is created, it can be transpiled to Verilog and then imported into IQP.

## Chapter 2

## Preparation

### 2.1 Understanding Bluespec

Before I started this project my only knowledge about Bluespec was from Cambridge Bluespec Tutor which covers basics, and It was completed by me few weeks prior to the start of this project. For the purposes of understanding this project I've learned about most of the typing system of Bluespec, but for You the reader most of it is not relevant. Therefore, I prepared a short tour of things that will be important to understand in this project.

#### 2.1.1 Rules

In Bluespec all computation is done in form of rules. Each cycle we will take a subset of all rules that we are going to execute in this cycle, rule is fired (executed) in cycle only if it's ready (or will be ready) and it's not conflicting with other rules (If this happens compiler must issue a warning, and picks arbitrary rule to fire from subset of conflicting rules). Each rule can fire at most one time per cycle. For rule to be ready to fire it needs it's implicit and explicit conditions to be true. Rule can be fired in a cycle even if it's not ready at the start of cycle, for example if you add item on an empty queue and then pop can happen in same cycle.



Figure 1: A general scheme for mapping an N-rule system into clocked synchronous hardware.

(TODO: add reference to the bsv reference document from which this image was taken)

TODO: piece of Bluespec with module using fifo and other rule explaining types of conditions.

#### 2.1.2 Modules and interfaces

Module is unit of organization of logic. Modules don't have types, instead they implement some interface. This means that you can have multiple modules with different behaviour that implement the same interface. It might be helpful to think about similarly to different physical connectors(like USB-C, PCI-e). Interfaces are made up out of two types of things:

- Methods that allow for interaction with the module, they come with few variants, but this is not important for now.
- Subinterfaces That allow for more generalization, for example if you want to implement module that contains two copies of the same interface, you can just create a new interface that contains two instances of that subinterface and assign them accordingly.

#### 2.1.3 Typeclasses

Typesclasses in Bluespec are used to group types for which specific functions or modules are implemented. Typeclass instance is (TODO describe what is it) and it can have provisos attached to it.

#### 2.1.4 Provisos

On big feature of Bluespec typing is support for provisos. They create constraints on types that can be used with given function and give information about expected type of output of the function. For example, we can have a polymorphic function extend of a type  $Bit\#(a) \to Bit\#(b)$  where Bit#(x) is a vector of x bits, such function accepts any length and always return vector of length b = a + 1. To make sure this relations stays like this we would attach a proviso Add#(a,1,b) which means that a+1 must be equal to b.

Mentioned above Add is one of Size Relationship provisos, other such provisos are Mul Div Max Min Log. Those provisos can't be nested, but we still can express complex arithmetic constraints, thanks to size relationship type functions. They essentially represent result of the operation on values on that were supplied. There are only 8 of them, and they are TAdd TSub TDiv TMul TMax TMin TLog TExp. To better explain this let's look at some simple example:

Add#(TExp#(a),TDiv#(a,8),c) is a proviso that says that following equation on parameters must be satisfied  $2^a + \lceil \frac{a}{8} \rceil = c$  otherwise it will mean that supplied parameters are not valid. Those equations can be also solved to deduce unknown parameters.

There is also other type of provisos that are typeclass based. Typeclass based provisos, allow us to either to ensure certain functions are defined for given type or to deuce something. For example Bits#(a,b) means that type a must be convertible to vector of b bits. This can be used to get the length in bits of given objects, on which we can later put constraints using Size Relationship provisos.

In my project I will often check if there exists instance of the Connectable#(a,b) typeclass to check if function mkConnection(used to connect two interfaces together) is defined for given types a and b.

This can be a bit confusing, but if one look closely this is very similar to Prolog. We have variables that are effectively the same as in Prolog, we have values that are integers, and we have parametric types /functions that are like Prolog's compound terms. Typeclass are like groups of rules with same name and provisos attached to Typeclass instances are like predicates. One big difference is that provisos can be solved in arbitrary order, which means that unlike in Prolog we need ability to solve sets of equations on integers.

## 2.2 Choosing tools

I had effectively 3 choices for a language to this project in. Here is some justification why I have chosen Python.

#### 2.2.1 Haskell

Compiler of Bluespec is written in Haskell. Therefore there are seweral reason why Haskell would be right choice. Here are some of them:

- I can reuse already written logic.
- I won't diverge in terms of logic from Bluespec compiler.
- I won't need to write grammar to load packages.

But there are also few very good reasons why I didn't used Haskell.

- I never used Haskell before, and my experience with functional languages equal to what was shown in the foundations of the computer science course.
- I never used Bluespec before, and my experience is limited to completing Bluespec tutor provided by computer science department.
- Even if I knew Haskell and Bluespec, industrial compilers are rather large and complicated. I also received advice to avoid modifying / extending them as a part of the project.

While each of those reasons on its own is something that can be managed. All of them combined would create unacceptable amount of risk of not delivering the project. There was also a risk that use of Haskell wouldn't create a good story for the project. (This is quite subtle).

#### 2.2.2 Tcl (pronounced "tickle")

This language is used as a scripting language in both Intel and Xilinx tools, While I will be reading packages using Tcl scripts provided by the creators of the Bluespec compiler (BSC), my understanding is that those scripts are just handy wrappers for some Haskell functions. This is also foreign language to me with minimal presence online, and negligible learning resources, making it difficult to learn. It's also quite obsolete and there is not much tooling for it making it undesirable as a time investment (As an example of desirable to learn technology. I learned for this project JavaScript, just to create a GUI).

#### **2.2.3** Python

Firstly this is a language I have experience working with, secondly it's widely supported, and there is extensive tooling for it. It's flexible typing system allows for rapid experimenting, which was especially handy when I wasn't sure how structure of the data would look like. It's also worth mentioning that it has support for typing and at some point I decided to started adding typing hints around. My rule of thumb that I used was that if Pylance (python language server) can't deduce type of something I need more hints. Other reasons for using python are:

- I will have many interesting problems to solve. Like creating and interpreting grammar.
- Code written in it can be used by large community of python developers.
- I can use many tools like, Lark (for grammar), Django (as a backend server), SymPy (to solve size relationship provisos) that will speed up the process.

The two arguments against using python are:

- Haskell would be better suited for this task. I explained above why it doesn't make sense for me to use Haskell.
- Python is usually slow. Fortunately this isn't a big issue thanks to cashing and use of PyPy (implementation of Python using JIT compiler).

#### 2.2.4 Postmortem

Looking at those choices now(after finishing project and knowing all encountered difficulties) I think a could also consider C#, due to its speed and similar typing flexibility to Python. I also think Haskell would be a better choice if this was more like my pet project on which I would work for few years with the intention of making it commercially viable product.

### 2.3 Creating a grammar

This section could be moved into implementation.

#### Bluetcl

Bluetcl is a tool written in Tcl language that allows for inspection of Bluespec packages. It's a first party tool that included with Bluespec compiler. There isn't much documentation about it, but from my understanding it's a wrapper around Haskell code. From the user perspective it's just a bash script that starts Tcl terminal compiled with some libraries. From my perspective I use it as an arbitrary command line program, as this method of using is intended by the authors and contains some documentation.

#### 2.3.1 What grammar is needed?

Because I'm using Bluetcl as a command line tool, communication with it is done via text. While synthesizing commands is easy, parsing outputs is a bit more difficult, and I decided to do it in systematic way. To do this I need grammar of all possible outputs that I'm interested in. Bluetcl produces range of outputs but two of them that are particularly useful are descriptions of functions and descriptions of types.

### 2.3.2 Where to find this grammar?

Unfortunately this grammar is not documented anywhere, so I had to reverse engineer it. This approach might not be perfect and might not cover every input, but It's good enough to parse every standard package and every package used to compile Flute(RISC-V CPU with 5 stage pipeline). Here are other reasons to justify this approach:

- Heaps' law suggests that number of unique words in given body of text is proportional to roughly square root of number of words in the text. I think it's fair to assume that something similar will be true if we consider number of unique grammar rules.
- This grammar while different from grammar of Bluespec language maps subset of Bluespec grammar, so we can supplement our deductions with reference guide for Bluespec language. Therefore, accounting for cases we haven't seen, but are possible.
- There is also some limit on how exotic things can get in top modules. (TODO rephrase this)

#### 2.3.3 Technical aspects

This is EBNF grammar, I parse it using Lark library for Python, and I'm using Earley parser, as it is capable of arbitrary length lookahead. Grammar I created contains roughly 90 rules, and I won't include all of them here, but I will show few examples to give a feel of what is happening. (TODO: Think about dumping them into an apendix)

A useful feature supported by Lark is ability to have regular expressions in the grammar, I'm mentioning this as it is effectively having parser inside a parser. It's also worth mentioning that there exists a handy tool for debugging and creating grammar. It's located at this website: https://www.lark-parser.org/ide/, it can run parser online, and show outputed AST tree.

#### Parsing position TODO maybe find a better example with shorter line

Here is an example of parsing small section of output that describes location of piece of code that is being described. I decided to show it as it's simple and shows different features of that I needed to use.

```
----- Rules used -----
identifier_u: /[A-Z][\w$_']*/
tcl_position: "{" "position" "{" tcl_path NUMBER NUMBER
 ["{" "Library" identifier_u "}" ]"}""}"
tcl_path: ["\%/"] /((\.{1,2})|(\w|-|\s)+)/
["/" /((\.\.)|(\\\\\|-|\\\\s)+))/]* "." /\\\\\\\\\
----- Text to parse -----
{position {%/Libraries/Connectable.bs 25 1 {Library Connectable}}}
  ▼ tcl_position
    ▼ tcl_path
          Libraries
          Connectable
          bs
       25
       1
    ▼ identifier_u
          Connectable
```

## Chapter 3

## **Implementation**

### 3.1 Crawling the packages

To interact with Bluetcl I have written a script using Pexpect library. This script works by creating subprocess of Bluetcl, and exposes functions that allow for performing of quarries. Core of this script is a function called call that takes as input a string that is a command and returns output of stripped of warnings or raises an exception if error occurred (for example in case where package was not found). To remove warnings I make some assumptions.

- I only use fixed set of commands(like list packages, describe a type).
- For those commands important output about is always on the last line. (this was checked empirically)
- Output of a command is always followed by % a character that never occurs in the rest of the output and marks the end of the output.

Thanks to them, I don't need sophisticated system for parsing error, and or warnings. I can just make a quick search for error, and if needed forward this to user.

In total this script allows me to:

- Initialize subprocess
- Add folder to search path of Bluetcl
- Load package (Bluetcl takes care of finding and loading dependencies)
- Get list of loaded packages
- List functions in package
- List types in package
- Get information about types and function in the package

# 3.2 Loading Bluespec packages' information (TODO check grammar of s')

As mentioned earlier output of bluetcl is text, and I need it in some better organized form. In this section I describe process of doing this.

#### 3.2.1 Lexical and syntax analysis

I decided to use Lark library for lexical and syntax analysis. This library takes grammar and string of text as input and returns an abstract syntax tree. The only difficult part here is creating grammar, but we have taken care of it during preparation. (TODO improve this last sentence)

### 3.2.2 Understanding abstract syntax tree (AST)

Given AST I needed to turn it into a more useful form. To do this I used Transformer class provided by Lark. The way it worked is that I created subclass of Transformer that implemented a function for each non-terminal / rule(Each non-terminal can have multiple rules and Lark allows for renaming individual of rules, this for example allows parsing different non-terminals using same function or parsing single non-terminal using different function depending on rule that was used). Transformer will then perform bottom up transformation of AST, and for each rule it will call respective member function, with parsed children(non-terminals used by rule) as arguments. At the end of the transforming process I have a list of objects that represented data from Blutecl.

One of the bigger inconveniences I encounter is the fact that Lark passes parsed children, as a simple list. This doesn't provide information about what exactly each child is, this isn't a problem if rule contains fixed number of non-terminals, but otherwise I employ mix of length based inference, and wrapping of data from children into tuples, to provide remove ambiguity.

What was quite useful at this stage is Python's type system flexibility, I was able to gradually build up Transformer class and add typing information only after I had firm understand what type something needs to be. For some things it wasn't immediately obvious, what is the best way to represent them, and what often happen is I would first leave them as lists of unparsed things. Then after I got to the point where I needed to use them I would decide what is an optimal way to represent them.

#### 3.2.3 Organizing data and functionality

I created a class called TypeDatabase that provides simple functions that manage loading and parsing packages information. It is also capable of loading and storing it's state to pickle file. This saves a lot of time as all this parsing and communication with bluetcl isn't particularly fast. This class is also a home to set of functionality used to resolve types. From my testing it takes about 370s to load roughly 250 packages that I found included with Bluespec compiler and created to compile Flute CPU. Loading those packages from cached files takes only few seconds.

## 3.3 Building the top level module

Building the top level module can be broken down into few operations.

- Instantiating new modules.
- Connecting modules using mkConnection function.
- Connecting modules using busses.
- Synthesizing Bluespec.
- Automated compilation and run of simulation.

Those operations if implemented naively would be almost as simple as formatting a string. But If this was the only thing that this project provided I wouldn't need to go through all the trouble of parsing Bluespec grammar, and generally users would be better off just writing Bluespec code. My unique value proposition is running something similar to a language server, that will be capable of dynamically resolving types, and aiding user in the design process by providing possible connections, informing whether values passed to initialize module are correct, and informing about subinterfaces that can be found in the instantiated modules. At the basis of all those features will be type resolution, and I will first explain how it works.

#### 3.3.1 Resolving types

This section assumes that one have read about typeclasses and provisos from previous chapter. In my code there are 4 main functions that combined allow for type resolution:

- merge function takes two type identifiers and returns map (dictionary) from names of variables to theirs values. It's similar to Prolog's unification operator.
- applyVariables function takes a dictionary and a type and returns a type with all variables replaced by their values. This is more of a utility function.
- resolveTypeclass function that takes a type identifier that is an instance of a typeclass and returns dictionary of resolved variables. In Prolog this is like looking which rule can be applied and returning result of applying that rule.
- solveProvisos function that takes a dictionary and a list of provisos and returns a dictionary with variables. This is like checking predicates in Prolog.

Those functions can also fail with exception if something is impossible / wrong data is provided.

Type identifier is a tree where every non leaf node is another type identifier, and leaf nodes are either variables or values(strings, ints). Example type identifier Foo#(a,42,"file.hex") is a type with name Foo and three arguments a (variable) and 42 (int value) and "file.hex" (string value). Type identifiers can also describe functions, and those just need to run unification on both result and arguments of a function.

#### Merge function

This function works like a two synchronized dfs's, that go through the trees (type identifiers) and apply logic based on nodes that are beginning currently visited. This function needs to account for quite a lot of special cases here are some of them:

- If one of the nodes is a variable, and other is a value or another type identifier, then we can add such pair to the dictionary of resolved variables.
- If both nodes, are type identifiers, we run merge recursively on pairs of theirs children. We also check if type identifiers share the same name.
- If one of the nodes is a value, then other node must be either a value with same value, or a variable.

The most complicated case is when both nodes are variables. There is a breakdown what happens then.

- Case 1 Both variables have a value, so I need to check if value is the same.
- Case 2 One of the variables has a value, and other variable doesn't value assigned. In this case I can just copy reference to a value to the other variable.
- Case 3 Neither of the variables has a value. In this case I create a new dummy object and give reference to it to both of the variables. Using references makes it so resolving value of one of the variables will resolve all of them.

If done carefully, this is quite efficient as we don't need to manage propagation of values of variables, everything is done by just writing to shared object.

#### Apply variables function

This function is quite simple, it just traverses a type identifier and replaces all variables with their values.

#### Resolve typeclass function

Earlier, I compared typeclass to groups of Prolog rules with the same name, one thing I didn't mention that typeclasses can also contain something called dependencies, they are something like marking inputs and outputs in Prolog. Those dependencies are used in typeclasses like Bits#(a,b) which are used to determine something, and they make sure that resolution of a typeclass is always unambiguous.

For example, Bits#(a,b) has a dependency a determines b which means that you can use it to determine b from a, but if you know only b then you can't use it to determine a because there may be multiple types that can be packed to the bit vector of the same length b. (TODO check use of 'the' in last sentence)

To check if proposed instance of a typeclass is a valid instance we perform following algorithm:

- 1. Check if a proposed instance of a typeclass is an instance of a known typeclass, otherwise fail.
- 2. If dependencies for a typeclass are defined check if information provided within proposed instance is enough to satisfy one of them.
- 3. Iterate over all templates of an instance of a typeclass, then attempt to merge function between a template and the proposed instance.
- 4. If merging failed, jump back to step 3 and continue iteration.
- 5. If merging succeeded, attempt to solve provisos.
- 6. If provisos were successfully solved, return the dictionary of resolved variables, otherwise continue iteration.
- 7. If all iterations failed, fail.

This procedure is a bit slow, so I speed it up by creating look up dictionaries that take as a key a dictionary and as a value a typeclass instance. This method can cut down search time significantly, but there are still rules that use polymorphic functions and those can't be preprocessed away so easily, therefore I call them universal instances and search through them every time. While I could invent many heuristics to speed up this process in general this is still as difficult as halting problem.

#### Solve provisos function

As established earlier, provisos can be of two types, one are based on typeclasses and those can be resolved using above described function, and the other ones are size based and require solving sets of equations on integers. To do this I use library called SymPy. SymPy is a symbolic math library that can be used to solve equations on integers. All I need to do is convert set of provisos into set of equations, and then extract data and put it back into dictionary used to store variables.

One big problem is that order in which provisos must be solved is not defined, so I handle this by creating set of provisos to be solved. Then I repeatedly iterate over this set and remove solved provisos. If set didn't change after iteration, and it's not empty this means not further progress can be made and raise an exception.

#### How those functions can are used

Thanks to those functions, I can tell user what values can be put into each argument in a function. I can check if arguments passed to a function satisfy provisos when combined. I can give user a list of all possible connection that can be made.

## 3.4 Synthesizing

The main goal of the project is to be able to take a JSON with some data and synthesize a top module in the Bluespec language from it. This goal is more like a justification to create a language server like API. With this approach I have some simple core goal, but in the later sections we will see how thanks to all the work I have done, this API can be used to support GUI that aids the user in creating a top module. Therefore, I will focus in this section on how the API works, and only later how is it used when reading JSON or as a backend to the GUI based app.

#### Top level module object

Let's start with quick overview of the main functions found in API.

- addModule as the name suggest this function adds an instance of a module, or (TODO how result of a function is called)
- addConnection This creates a connection between two interfaces or members of interfaces between previously instantiated things. Essentially a wrapper on a addModule function.

- addBus given list of slaves(with ranges on which they are accessible) and masters it adds a bus with those interfaces connected.
- to\_string that returns a string representing .bsv file with top module as defined by previous calls.

#### Adding modules (introduction)

Adding an instance of a module in Bluespec is done by calling function used to instantiatie chosen module, and potentially assigning it to a variable of a type of interface produced by a function. It's worth keeping in mind that not all such functions can create new modules (those who do are called modules and one that don't are called functions in Bluespec), for example fromAXI4LiteToAXI4\_Master is a function that will take already existing instance with AXI4\_Master interface and return AXI4Lite\_Master interface, and all of this will be done by just assigning functions and subinterfaces found in one to the other one. Fortunately, there is no need for me to treat modules different from the functions. The only thing that will be treated differently are busses and connections, as they are functions like described above but theirs result interface is Empty(similar to void in C++), so there is no need to assign them to a variable.

One more decision I had to make was whether I want to support let as a type. let keyword is used to omit writing type of variable explicitly. This makes it so compiler has to deduce it from the output of a function defining such variable. While Bluespec is strongly typed language it's not always possible to use let. For example, in cases where you create a module with polymorphic interface, there might not be enough information to deduce the full type of the interface. What's even more confusing is that sometimes you only need to provide some subset of variables in the type of interface, and the rest can be deduced using provisos. (TODO check this) Bluespec doesn't support something like this, but I do. On the other hand I don't support let keyword and I will always write out full type explicitly, this makes it harder to modify the code, as some changes might require more changes down the line, but this produces more readable code, also making such changes is still quick if done before code generation.

#### Adding modules (implementation)

Here is a procedure of instantiating a module:

- 1. Check if name string is a valid name and starts with lowercase.
- 2. Create an instance object.
- 3. Call update function of that object, that will verify correctness of the arguments and it will update global state. In more detail update function does following:
  - (a) Parse user given strings that represent arguments. This includes
  - (b) Run a unification (merge function described earlier) to check if types of arguments are correct.
  - (c) Run a unification on between user defined resulting type of interface and the type of interface given by the function. This produces set of context variables.
  - (d) Attempt to solve provisos attached to the function, with the context variables, and populate the context with newly learned variables.
  - (e) Apply variables to the type of interface, filling out unknows.
  - (f) Populate type of the instance with member subinterfaces.
  - (g) Add interface, and it's member subinterfaces to a global dictionary of accessible names. During this addition we also propagate updates, if for example

#### Adding connections

With creating instances explained above, adding connections become trivial. We just create and instance of mkConnection and ad it into list of known connections if we ever need to remove it.

#### Adding busses

Similarly to adding connections a lot of complexity is handled by logic to create instances, as bus is just a function that takes vectors of masters and slaves and routing function that specifies how addresses map to slaves. Two differences from standard instance is automatically instantiating of a vector that hold masters and slaves and generating routing function.

Instantiating of vectors was a bit tricky due to two conflicting requirements:

- 1. I wanted to give user an option to create a vector on their own, and if elements of such vector are updated, I wanted to propagate the update to all instances using that vector.
- 2. I wanted to create a vector automatically when creating a bus, so users don't have to create it manually.

This created a feedback loop, where update of a bus caused update of vectors used by this bus, and so on. To fix this I moved the updates of vectors to initialization of a bus, and allow for updates of vectors that are "silent" and don't cause updates.

Routing function is a function that takes a route and produces one hot vector, that specifies which slave is used for a given address. To create a routing function two things must be created. First I need to create a new function to be added to the database of known functions. To do this I just reuse my parsing tooling. As for creating code for it, I generate it by simply adding if statement for each address region. I also generate comments giving information about which slave occupy which address region.

#### JSON interface

I decided that my "human-readable format" will be JSON. This is because wide support for JSON is already available in many languages. The only problem of JSON is that it's more verbose than Bluespec, and in theory it would make more sense to use Bluespec. However, this could introduce user confusion, as I wouldn't be able to support arbitrary Bluespec features. My understanding is that supporting something only partially could lead users to think that my tool is broken, because it doesn't match theirs assumptions about what it can do. While it can be argued that if I had written this project as extension of Bluespec compiler I could have supported all features of Bluespec (thanks to already written logic). As explained at the beginning of this document, doing this was infeasible for me as a part II project. At the same time using JSON allows me to support simplified semantics for bus creation, and features not found in Bluespec like specification of folders with libraries imported.

With this out of the way let's talk about how I'm using JSON files.

#### Structure of JSON file

At the top level of the file user can define following things:(TODO change those names to something more descriptive)

- "additional folders" This should be a list of folders that will be added to Bluespec's search path used to find modules.
- "packages" List of packages that will be loaded. Theirs dependencies will be loaded automatically.
- "name" Name of the top module.
- "packageName" Name of the package, and therefore the file that contains the top module.
- "typedefs" List of typedefs that will be added to the global scope.
  - "name" Name of the typedef.
  - "value" Value of the typedef.
- "modules" List of modules that will be created.

- "name" Name of the module.
- "function" Name of the function that will be used to instantiate the module.
- "function\_params" List of parameters that will be passed to the function. (optional if function dose not take any parameters)
- "interface\_params" List of the parameters of created interface. (optional if it's possible to deduce them from functions type)
- "context" This adds ability to pass interface parameters in unordered maner. This is useful if actual type is quite complex, but most of the parameters can be deduced.
- "connections" List of connections that will be added to the global scope.
  - "from" Name used to access interface on the left side of the connection.
  - "to" Name used to access interface on the right side of the connection.
- "busses" List of busses that will be added to the global scope.
  - "name" Name of the bus. (optional)
  - "function" Name of the function that will be used to instantiate the bus.
  - "masters" List of names used to access interfaces that will be used as masters.
  - "slaves" List of slaves.
    - \* "name" Name used to access the interface.
    - \* "routes" List of lists length two that specify starts and ends of address regions that will be used by this slave.

#### Using JSON interface

If all I did was simply synthesizing a Bluespec file from JSON, it would be possible to do that just by doing simple string manipulation. I know this because my initial iterations were doing this. However, my tool can do more than that. One important feature is this tools ability to display all possible connections' user can make between interfaces and subinterfaces. It is also able to show what possible valid master and slaves are for a given bus(Assuming at least one master and slave are present as otherwise it doesn't make sense). Finally, it can also display types of all interfaces and subinterfaces. This can be useful when using complex modules, like a CPU core that doesn't have any direct connections, but It has many subinterfaces that can be connected to things like memory, external devices, etc.

#### 3.4.1 GUI

Adding GUI was an optional goal mentioned in the project proposal. I originally intended to use a game engine, but Bluespec compiler is a Linux tool and game engines like Unreal Engine 4 or Unity3d are optimized to be used on Windows, so I decided against that idea. Instead, I found a library called React Flow, that provided simple API for creating graph based user interfaces. Therefore I decided to create a GUI as a website. This was quite an adventure as It was my first contact with web development. (I have done backend in .NET and C# for group project in last term, but it was mostly focused on writing queries to database)

### Backend

For backend, I had a choice between Django and Flask (as those are main python libraries). After reading about them in theory Flask was supposed to be better for small projects, like this one. However, I decided to use Django because I wanted to learn technology that will be more useful in the future. I followed official tutorial on Django and after initial pain with security adding functionality was easy.

#### Frontend (React)

Frontend was written in React.js as you might have guessed from the name of the library I wanted to use. According to Stack Overflow 2021 survey React.js is the most popular frontend technology in 2021, so again I considered time spent on learning it to be good investment. One thing that took me quite a bit to get used to is everything being a function(React.js supports classes, but it's an old paradigm and I wanted to learn doing thing the "correct" way). This makes working with variables a bit tricky, as changes to state variable cause re-rendering of the whole component, and if variable is not a state variable, then it's value is going to be lost after re-rendering. If not careful is easy to cause feedback loop causing infinite re-rendering and subsequent crash of the application.

#### Frontend (React Flow)

#### Vocabulary

• Handle - Is a component of node that is used to start or end a connection. TODO include a image of a handle.

React Flow from developer perspective requires defining nodes (they can be basically and arbitrary React components), and some metadata about displaying a graph that is things like how edges look, how user can add new nodes, etc. The exposed API is quite simplistic. For example position of a handle is constrained to the middle of side of a node, and this library will generate CSS to put it there. Therefore, I needed to write some finicky CSS to align handle with text displaying its name. [TODO improve this sentence.]

Another thing that is problematic about React Flow is that it will cause node component to update every time it is moved. What I found is that if you wrap larger subcomponents of a node in memo from React, then performance stays quite good even if multiple nodes are moving as React will cache rendered subcomponents. On the bright side after I figured out workarounds for those issues, I was left with many other things being handled by library. This included automatic graph formatting, and pretty Bézier curves for edges.

#### Frontend (MUI)

My default HTML buttons, and text boxes are ugly. So I decided to use React UI component library called MUI (previously Material UI). I don't have much to say about it except that use of it was frictionless, at least in my humble opinion it made my application look modern. I'm mentioning this as one of my complaints about Intel's QSys was that it's UI is decades old.

#### **Functionality**

Using this GUI user can do following things.

- Instantiate new module.
- Connect two interfaces.
- Create a bus.
- Look at created Bluespec file.

Almost all the fields that user needs to fill have context based autocompletion.

- There are autocompletion for functions that can be used to instantiate a module, and for possible valid
  arguments of that function that include previously created interfaces and typedefs found in loaded
  packages.
- After user selects one side of the connection the other side will have completions only for interfaces that are compatible with the selected interface.
- When creating a bus it will only show functions that have a characteristic of a bus creating function.

• After initialization of a bus, autocompletion will show valid masters and slaves.

Also, user can just drag and drop connections between interfaces and fields, and if user types in name of interface connection will be automatically created.

#### Futher work

It's worth keeping in mind that for GUI like this there is multitude of things that can be added. Like an option to filter functions based on input that we want to use. Running and showing output of the simulation or an ability to collapse and expand nodes to make the graph more readable(currently some parts of nodes allow for this already). Adding comments that partition the graph (like in UE4).

## Chapter 4

## **Evaluation**

Throughout working on this project I have learned a lot about QSys and Bluespec, and how they interact. This means that I will need to adjust slightly it from the project proposal to better represent problems that are solved. Firstly my understanding is that Platform Designer tool is shared between QSys and QSys Pro.

#### 4.1 Overview

My evaluation will be divided into two sections each with three subsections. First section will compare text file formats of QSys and mine and second section will compare GUI of QSys and mine. Each of the subsection will go through three examples:

- First one will show simple example of two connected fifos, and difficulty of connecting a thrid one.
- Second will show an example of simple system-on-chip (SOC). With one Flute core, memory module and fake interface with outside world, all connected by a bus.
- Third example will two AXI4 masters and two AXI4 slaves connected by a bus.

First two examples will produce valid devices, but they won't do anything interesting. Last example will produce a working top level module that will demonstrate practical use of the tool.

#### 4.1.1 Polymorphism disclaimer

Before we even create text files that we are going to compare we talk about an issue when working with QSys and Bluespec. That is the fact that Bluespec can't generate Verilog code of functions that are polymorphic. Nor there is support for Bluespec in QSys nor Xilinx Vivado. My understanding is Verilog type system is not rich enough to express things that would be needed to support complex polymorphism found in Bluespec. Therefore, any module used with QSys will need to be converted into it's synthesizable version stripped of any arguments and transpiled into Verilog. One might argue that this is an issue with Bluespec and not QSys, but I would argue that it is still a problem that needs fixing I have solution for It, that is working with Bluespec natively rather than Verilog files.

## 4.2 Example 1 - Connecting FIFOs

In this example our goal is to create two connected FIFOs and then modify the file to connect a third one. FIFOs that I will be using have a copy of FIFO interface found in default Bluespec libraries.

- Enq If queue is not full then enqueue the data.
- Deq If queue is not empty then remove first element.
- first If queue is not empty then return first element.

In Bluespec there are provisos attached to those functions, and if in given cycle user will want to perform illegal operation (like dequeue when queue is empty) then rule in which this operation would be performed simply won't fire in that cycle. In verilog there is no notion of such provisos nor method so everything needs to be synthesized to wires to aid this process Bluespec uses Ready-Enable micro protocol. Every method has Ready and Enable wires. When Ready is high, then method is allowed to be executed. When Enable is high, it signifies that method must be executed.

#### 4.2.1 Platform Designer

To create a system with two connected FIFOs, we first need to import them into platform designer. Here we encounter first problems. As mentioned before, Bluespec can't generate Verilog code of functions that are polymorphic. So we must decide on the width of the data stored in such FIFO before transpiling to Verilog. After analysis of Verilog file by platform designer it decides that all wires found are parts of Avalon slave interface. This is an obvious error, and we need to correct it.



This is unfortunately easier said than done. There are few options of assigning those wires, but neither of them perfect. Firstly we will need to choose a type of interface those will represent. We have over 30 options to chose from, but most of them are variants of larger protocols like AXI4. The only option that will make sense for us is Conduit, which basically meaningless bank canvas. We could create a conduit for each method but after consultation with my supervisor he recommended creating a conduit for each wire. So I did as he recommended, and this produces following set of interfaces. This requires quite a bit of manual work, listed bellow.

- Create 8 new interfaces (QSys figured out what to do with clock wire on its own, so we have one for free) this includes giving them a name and assigning a reset signal.
- More each wire to its respective interface and change signal type of each wire to something generic like wire from type given to it as part of Avalon slave interface.

With all of those wires we can connect obvious things together like first to enq\_value and clock signals, but then we run into a problem that with all wires that represent Ready-Enable micro protocol because we would like to enable enqueue in second queue only if both wires ready\_deq and ready\_first are high in the first queue, otherwise because we don't know how our FIFO is implemented we might run into a problem where we might get bugs(for example we lose data because we ready\_deq became high cycle faster than ready\_first, or in opposite scenario we would read stale/corrupted data). Unfortunately we can't create such logic using platform designer and one way of solving this would be to create special connector module that would do this logic(I'm going to come back later to explain why this approach might be infeasible in general case).



Figure 4.1: After assigning wires and Interfaces

Drag with me Drag with me Creator: CONFIRM UPDATE Simplefifo::mkSimpleFIF0 Simplefifo::FIFOIfc fifo1 Bit#(32) Start CONFIRM UPDATE fifo2 End fifo1 > Simplefifo::FIFOIfc#(Bit#(32)) Creator: Simplefifo::mkSimpleFIF0 Simplefifo::FIFOIfc Bit#(32) CONFIRM UPDATE > Simplefifo::FIFOIfc#(Bit#(32)) Drag with me Drag with me CONFIRM UPDATE Creator: Simplefifo::mkSimpleFIF0 "Instanciation successful" Simplefifo::FIFOIfc Bit#(32) Start CONFIRM UPDATE fifo3 ✓ "Instanciation successful" End fifo3 > Simplefifo::FIFOIfc#(Bit#(32))

Figure 4.2: Three connected FIFOs using my tool

### 4.2.2 My tool

Using my tool whole phase of importing and setting up wires can be skipped. Connecting is as simple as creating a new node connection node and selecting chosen FIFO's interface. Extending two connected FIFOs to three is as simple as adding one more FIFO and one more connection node.

### 4.2.3 Using text format to do same things and Qualitative metrics

To count number of tokens I'm using following regexp (\w|\.)+. This regexp will match sequences of alphanumerical characters with dots and underscores. (TODO draw graphs) Raw numbers: 431 tokens to import FIFO, 347 for two connected FIFOs, and 415 for three connected FIFOs. My solution: 31 for two connected FIFOs and 43 for three connected FIFOs.



Figure 4.3: Connected Sys-on-Chip using Platform designer

## 4.3 Example 2 - Simple System-on-Chip

In this example we will create a simple system-on-chip (SOC) with one Flute core, memory module and fake interface with outside world, all connected by a bus. This example is designed to even out playing field by making all connections use AXI4 interface which is undestood by platform designer.

#### 4.3.1 Platform Designer

Again we start with importing our modules. This time we need to import two of them, one for fake 16550 interface and one for flute core. As for memory module we will use one from QSys library to save time and effort.

Unfortunately platform designer can't figure out how to organize wires into interfaces like AXI4 master or AXI4 slave. So we need to do it manually. This wouldn't be so painful if not for the fact that we need to set type of each wire according to it's name(for example cpu\_imem\_master\_awready has type awready). Each of those AXI4 interfaces has 37 wires, and if we account for all other interfaces of Flute core we get 150 wires that individually need to be assigned to interfaces. Similarly, our fake 16550 interfaces also uses 40 wires in total. When assigning this through GUI it took me around 16 minutes to click through all the assignments. (For context tools like Xilinx Vivado have heuristics that are able to do a lot of this automatically, but my understanding is that still this works only for certain known interfaces)

After initial pain of importing modules we arrive at the place where Platform designer shines. It is able to show possible connections between AXI4 masters and slaves, on a two 2 grid-like layout. Then connecting two together is as simple as clicking cross-section of two wires on the screen.

Drag with me

Tong with me

To

Figure 4.4: Connected Sys-on-Chip using my tool

#### 4.3.2 My tool

Again using my tool we don't need to do anything to import modules, as all the typing information is provided by the package. To connect those together we create a special bus node, select function for creating a bus, select number of masters and slaves, then assign each accordingly. While I'm unable to show nice 2d grid of possible connections, I have two-stage autocompletion. Stage 1 autocompletion becomes available after user selects function to create a bus. Such functions can be polymorphic, so I show all interfaces that might be valid individually, but not necessarily together. Stage 2 autocompletion becomes available after user selects at least one master and at least one slave, and clicks confirm update. After that bus is locked to specific type of masters and slaves and autocompletion narrows to show only masters and slaves that will work together.

#### 4.3.3 Other differences

There is a number of subtle differences that are worth talking about, but were not pronounced by this example.

Platform designer has ability to automatically convert between AXI4 interfaces and Avalon ones, and also within those families it allows for some degree of flexibility when parameters don't match exactly. When I create busses I'm limited by typing system, and way functions for creating buses were defined. Therefore, if user wants to do similar conversions they would need to use some functions and perform those conversions by themselves.

Another slight difference is the fact that in Platform designer, each master has its own bus, and while slaves can be connected to any number of masters they need to be connected individually.

| 4.4 Example 3 - V | Vorking AXI4 | $\cdot$ masters $i$ | ${ m and}\ { m s}$ | ${f slaves}$ |
|-------------------|--------------|---------------------|--------------------|--------------|
|-------------------|--------------|---------------------|--------------------|--------------|

This example