# **RAPIDSMITH 2**

# A Library for Low-level Manipulation of Vivado Designs at the Cell/BEL Level

## Technical Report and Documentation

Brent Nelson, Travis Haroldsen, Thomas Townsend

NSF Center for High Performance Reconfigurable Computing (CHREC) \*
Department of Electrical and Computer Engineering
Brigham Young University
Provo, UT, 84602

Last Modified: May 6, 2017



<sup>\*</sup>This work was supported in part by the I/UCRC program of the National Science Foundation, grant numbers 0801876 and 1265957.

CONTENTS 2

# **Contents**

| I | Introduction                               | 4        |
|---|--------------------------------------------|----------|
|   | 1.1 What is RapidSmith 2?                  |          |
|   | 1.2 Who Should Use RS2?                    | . 4      |
|   | 1.3 Why RS2?                               | . 4      |
|   | 1.4 Which Xilinx Parts does RS2 Support?   | . 5      |
|   | 1.5 How is RS2 Different than VPR and VTR? | . 5      |
|   | 1.6 Why Java?                              |          |
|   |                                            | ·        |
| 2 | Vivado, RS2, and Tincr                     | 6        |
|   | 2.1 RapidSmith vs. RS2                     |          |
|   | 2.2 RS2 Usage Model and Structure          |          |
|   | 2.2 NO2 Osage Model and Ottactate          | ,        |
| 3 | Getting Started                            | 8        |
|   | 3.1 Installation                           |          |
|   | 3.2 Device Files For Use With RS2          |          |
|   | 5.2 Device riles for use with K52          | 10       |
| 4 | Devices                                    | 12       |
| 4 |                                            |          |
|   | 4.1 Xilinx FPGA Architecture Overview      |          |
|   | 4.2 Device Data Structures                 |          |
|   | 4.3 Supported Device Files                 | . 16     |
| _ |                                            |          |
| 5 | Designs                                    | 17       |
|   | 5.1 Xilinx Netlists                        |          |
|   | 5.2 RS2 Netlist Data Structures            | . 18     |
|   | 5.3 The Cell Library                       | . 26     |
|   | ·                                          |          |
| 6 | Placement                                  | 30       |
|   |                                            |          |
| 7 | Routing                                    | 32       |
|   | 7.1 Wires and WireConnections              | . 32     |
|   | 7.2 Traversing Wire Objects                | . 32     |
|   | 7.3 Other Types of Connections             |          |
|   | 7.4 RouteTrees                             |          |
|   | 7.5 Three Part Routing                     |          |
|   | 7.6 Routing in Vivado                      |          |
|   | <u> </u>                                   |          |
|   | 7.7 IntrasiteRouting                       | 38       |
| 8 | Design Import/Export                       | 39       |
| 0 |                                            |          |
|   | RapidSmith Checkpoints                     |          |
|   | 8.2 Tincr Checkpoints                      |          |
|   | 8.3 Additional Information                 |          |
|   | 8.4 Importing and Exporting TCPs           | . 44     |
|   |                                            |          |
| 9 | Example Programs                           | 45       |
|   | 9.1 Sample Vivado Designs                  |          |
|   | 9.2 DeviceBrowser                          | 45       |
|   | 9.3 DeviceAnalyzer                         | 46       |
|   |                                            |          |
|   | ·                                          | 46       |
|   | 9.4 ImportExportExample                    |          |
|   | 9.4 ImportExportExample                    | 46       |
|   | 9.4 ImportExportExample                    | 46<br>47 |

CONTENTS 3

| 10 |                                    |    |
|----|------------------------------------|----|
| 11 | Bitstreams in RS2                  | 52 |
| 12 | License                            | 53 |
| 13 | Included Dependency Projects       | 54 |
| A  | Appendix A: XDLRC Files and Syntax | 55 |
| В  | Appendix                           | 58 |

## 1 Introduction

## 1.1 What is RapidSmith 2?

The original BYU RapidSmith project began in 2010. Its goal was to develop a set of tools and APIs which would provide academics with an easy-to-use platform to implement experimental CAD ideas and algorithms on modern Xilinx FPGAs. It integrated with Xilinx's old design suite, ISE. RapidSmith 2 (abbreviated RS2 hereafter) represents a major addition to RapidSmith. Specifically, Vivado designs are now supported. Using RS2 you can write custom CAD tools which will:

- Export designs from Vivado
- Perform analyses on those designs
- Make modifications to those designs
- Import those designs back into Vivado for further processing or bitstream generation

Futhermore, you need not start with a Vivado design — you can create a new design from scratch in RS2 and then import it into Vivado if desired.

The other new major capability of RS2 is that it changes RapidSmith's design representation. Instead of using XDL's view of a design with Instances and Sites, RS2 uses Vivado's representation of design with Cells and BELs. This is a significant change as it exposes the actual design and device in a way that RapidSmith never did, opening up new CAD research opportunities which were difficult to perform using Rapidsmith.

#### 1.2 Who Should Use RS2?

RS2 is aimed at anyone desiring to do FPGA CAD research on real Xilinx devices available in Vivado. As such, users of RS2 should have some understanding of Xilinx FPGA architecture, the Vivado design suite, and the Tcl programming language. However, one goal of this documentation is to provide sufficient background and detail to help bring developers up to speed on the needed topics. RS2 is by no means a Xilinx Vivado replacement. It cannot be used without a valid and current license to a Vivado installation (RS2 cannot generate bitstreams for example).

#### 1.3 Why RS2?

The Xilinx-provided Tcl interface into Vivado is a great addition to the tool suite. It can be used to do a variety of useful things including scripting design flows, querying device and design data structures, and modifying placed and routed designs. In theory, the Tcl interface provides all of the functionality needed in order to create any type of CAD tool as a plugin to the normal Vivado tool flow. However, there are a few issues in TCL that motivate the use of external CAD tool frameworks such as RS2. These include:

- Tcl, being an interpreted language, is slow. It is far too slow to implement complex algorithms such as PathFinder. Compiled and managed runtime languages are a better option in terms of performance.
- Tcl is hard to program in. TCL is not an object oriented language, and so writing complex algorithms are difficult since Object-Oriented language constructs do not exist. That being said, TCL is great for writing automation scripts.
- There are some memory issues in Vivado's Tcl interface. In our experience, long-running scripts eventually cause the system to run out of memory even if they are not doing anything interesting.
- Vivado's TCL interface does not offer a complete device representation (determined by Brad White's MS work).
   Most notably, a user cannot gain access to sub-site wire objects through the Tcl interface. This limits the CAD tools that can be created in Tcl, but this additional information can be added to external tools with some manual work.

In short, the ability to export designs out of Vivado, manipulate them with more powerful languages such as Java, and then import the design back into Vivado is a very useful capability.

RS2 (in conjunction with Tincr which is described in subsubsection 2.1.3) abstracts this process into a few easy-to-use function calls. Generating FPGA part information, importing and exporting **all aspects** of a design, and dealing with other fairly arcane details is made mostly transparent to the user. RS2 and Tincr provide a nice API into equivalent Vivado device and design data structures. All of this enables researchers to have more time to focus on what matters most: the research of new ideas and algorithms.

## 1.4 Which Xilinx Parts does RS2 Support?

As of the writing of this document, Artix 7 has been tested the most and is currently supported in all forms and applications. In addition, an Ultrascale device file was created and demonstrated as a part of Brad White's MS work to show that it is possible. At some point, Ultrascale should be fully supported. <sup>1</sup>

As will be seen later, to generate additional device files for additional parts within a supported family is relatively straightforward and can be done by any user. New families can also be supported but this requires a bit more work. As time goes on the process will become simpler — that is one of the goals for RS2 moving forward.

#### 1.5 How is RS2 Different than VPR and VTR?

VPR (Versatile Place and Route) has been an FPGA research tool for several years and has led to many publications on new FPGA CAD research. It has been a significant contribution to the FPGA research community and has grown to be a complete FPGA CAD flow for research-based FPGAs. The main difference between RapidSmith and VPR is that the RapidSmith tools can target commercial Xilinx FPGAs, providing the ability to exit and re-enter the standard Xilinx flow at any point. All features of commercial FPGAs which are accessible via XDL and Vivado's Tcl interface are available in RapidSmith and RS2. VPR is currently limited to FPGA features which can be described using VPR's architectural description facilities.

#### 1.6 Why Java?

RS2 is written in Java. We have found Java to be an excellent rapid prototyping platform for FPGA CAD tools. Java libraries are rich with useful data structures, and garbage collection eliminates the need to clean up objects in memory. This helps reduce the time spent debugging, leaving more time for researchers to focus on the real research at hand. Our experience over the past decade is that for student research projects, Java has greatly improved student productivity and led to far more stable CAD tools.

<sup>&</sup>lt;sup>1</sup>An XDL-based import/export capability has also been created and used with Virtex 6 devices as a part of Travis Haroldsen's PhD work but that path is not being released, documented, or supported.

## 2 Vivado, RS2, and Tincr

## 2.1 RapidSmith vs. RS2

## 2.1.1 What Was The Original RapidSmith?

The original RapidSmith was written by Christopher Lavin as a part of his PhD work at BYU. It was based on the Xilinx Design Language (XDL) which provides a human-readable file format equivalent to the Xilinx proprietary Netlist Circuit Description (NCD) of ISE. With RapidSmith, researchers were able to import XDL/NCD, manipulate, place, route and export designs among a variety of design transformations. The RapidSmith project made an excellent test bed to try out new ideas and algorithms for FPGA CAD research because code could quickly be written to take advantage of the APIs available.

RapidSmith also contained packages which could parse/export bitstreams (at the packet level) and represent the frames and configuration blocks in the provided data structures. In this regard, RapidSmith did not include any proprietary information about Xilinx FPGAs that is not publicly available.

RapidSmith continues to be functional and is still available at the SourceForge.net website. There, you will find documentation, installation instructions, the RapidSmith code base, and a collection of demo programs based on it.

#### 2.1.2 What is RS2?

With the announced end of ISE (with the Virtex7 family of parts being the last family to be supported by ISE), there was no path forward to newer parts using RapidSmith. This is because XDL is not available with Vivado. With Vivado, however, Xilinx has provided an extensive Tcl scripting capability which initially looked as if it could provide a similar capability to that provided by XDL in terms of accessing both Vivado's design and device data and in terms of creating and modifying Vivado designs. However, as described above, Vivado's Tcl is limited by speed and memory challenges. The development of RS2 consisted of three parts.

## 2.1.3 Tincr: Integrating Custom CAD Tool Frameworks with the Xilinx Vivado Design Suite

In the first part, the Vivado Tcl capability was investigated to ensure that, indeed, it did provide the needed ability to access design and device data and export that to external tools such as RapidSmith. This resulted in the Tincr project, led by Brad White as a part of his MS work at BYU, with Thomas Townsend making additions as a part of his research.

Tincr is a Tcl-based library of routines which (a) provide a variety of functions to simply make working with Vivado via Tcl easier, (b) provide a way to export all the data associated with a Vivado design into what is called a Tincr Checkpoint (TCP), (c) provide a way to reimport Tincr Checkpoints back into Vivado, and (d) access device data from Vivado and output that data in the form of XDLRC files (these are the files which XDL used to describe devices and are necessary for RapidSmith and RS2 to understand the structure of and the resources available for use in a given Xilinx part). Tincr is available at <a href="https://github.com/byuccl/tincr">https://github.com/byuccl/tincr</a>. Tincr is described in two publications:

B. White and B. Nelson, "Tincr A custom CAD tool framework for Vivado," 2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig14), Cancun, 2014, pp. 1-6, DOI: 10.1109/ReConFig.2014.7032560

White, Brad S., "Tincr: Integrating Custom CAD Tool Frameworks with the Xilinx Vivado Design Suite" (2014), BYU Scholars Archive, Paper 4338. URL:http://scholarsarchive.byu.edu/etd/4338

## 2.1.4 RS2: A Framework for BEL-Level CAD Exploration on Xilinx FPGAs

The second part of the development of RS2 was to add a new layer of design representation to RapidSmith which more closely matches that of Vivado. This was done as a part of his PhD work by Travis Haroldsen at BYU. As of this writing, one paper on RS2 has appeared:

Travis Haroldsen, Brent Nelson, and Brad Hutchings, RapidSmith 2: A Framework for BEL-Level CAD Exploration on Xilinx FPGAs, Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, February 2015, Monterey CA, pp. 66-69, DOI: 10.1145/2684746.2689085.

#### 2.1.5 Vivado and RS2 Integration

The third part of the development of RS2 was to create the ability to export designs from Vivado and into RS2 and, correspondingly, to import RS2 data back into Vivado. This was completed during 2016, largely by Thomas Townsend as an MS student at Brigham Young University. The initial public release of RS2 was made in January 2017 once that piece was in place.

#### 2.1.6 What is All This About XDL and XDLRC and How Does RS2 Fit Into That?

The Xilinx ISE tools had the capability to export XDL and XDLRC files which RapidSmith used:

- An XDLRC file was a complete description of a given Xilinx FPGA, describing every tile, every switchbox, every wire segment, and every PIP in the part. RapidSmith was able to process this information and create a device representation for use in support of CAD tools such as placers and routers.
- An XDL file was a textual representation of an NCD file (a user design). It described the user design as a collection of Instances and Nets. Instances correspond to things like SLICEs, BRAMs, DSP48s, and IOBs. Instances could be placed onto Sites. Additionally, Nets in XDL consisted of a list of Pins (their logical connections) and an optional list of PIPs (their physical routing connections).

In Vivado, however, designs are described as a collection of Cells where a Cell corresponds to things like LUTs, flip flops, etc. A Cell may be placed onto a BEL object such as an ALUT or a BFF. RS2 contains a new layer of hierarchy in its design and device descriptions where Cells and BELs are first-class objects and design manipulation is all done at the Cell/BEL level.

Also, Vivado Nets are described using directed routing strings rather than lists of PIPs. RS2 also contains a set of new classes to enable the representation and manipulation of Nets in a format compatible with these routing strings.

Thus, using RS2, design manipulation is now done at the level of Cells and BELs and importing/exporting designs to/from Vivado is now fully supported.

## 2.2 RS2 Usage Model and Structure

The usage model for RS2 is shown in Figure 1. As can be seen, a design can be exported from Vivado at multiple different points in the Vivado design flow. In each case, Tincr is used to export a Tincr Checkpoint which can then be imported into RS2. At those same points in the design flow, RS2 can export a Tincr Checkpoint which can then be imported back into Vivado. Thus, a complete solution involves Vivado, Tincr, and RS2.



Figure 1: Vivado and RS2 Usage Model

## 3 Getting Started

#### 3.1 Installation

RS2 is available on Github at: https://github.com/byuccl/RapidSmith2. You can either build RS2 into .class and .jar files for use in any Java environment, or configure RS2 to work in an IDE (recommended).

#### 3.1.1 Requirements for Installation and Use

- Windows, Linux or Mac OS X all will work (see additional notes below for Mac OS X)
- Vivado 2016.2. Later versions of Vivado may work, have not been tested yet. Earlier versions will not work.
- JDK 1.8 or later
- Tincr

Tincr is a companion project (https://github.com/byuccl/tincr) which is used for importing/exporting designs between Vivado and RS2. For getting started (running the example programs on the provided sample designs) you will not need it installed. Later, as you actually start processing your own Vivado designs you will need to obtain and install it. There are additional dependencies beyond these required for installation, but they are either provided in the distribution itself or are automatically retrieved for you as a part of the installation process. Examples of these additional dependencies include QT Jambi and the BYU Edif Tools.

#### 3.1.2 Steps for Installation

1. Clone the RS2 repository at https://github.com/byuccl/RapidSmith2. If you are not familiar with GitHub, you will need to install Git on your computer, and run the following command in an open terminal:

```
git clone https://github.com/byuccl/RapidSmith2
```

This will copy the RS2 repository into a local directory.

- 2. Create a new environment variable called RAPIDSMITH\_PATH, and point it to your local repository of RS2 that you setup in step (1). This is needed so RS2 can find required device files and other items at runtime.
- 3. Build the RS2 project. RS2 is managed using a gradle build system. To build the project, navigate to your local repository of RS2 and execute one of the following scripts in a terminal:

```
gradlew build (unix)
gradlew.bat build (windows)
```

The build process could take a few minutes.

4. At this point, you have two choices: set up RS2 for use in an IDE, or run RS2 from the command line. Both choices are detailed below.

**Running from an IDE** The gradle scripts in RS2 currently support setup for both Eclipse and IDEA Java IDEs. This section will detail how to setup the Eclipse environment, but similar steps can be taken for IDEA. If using Eclipse, it is best to use version Eclipse Neon or later. To create a new eclipse project, execute one of the following in a terminal:

```
gradlew antlr eclipse (unix)
gradlew.bat antlr eclipse (windows)
```

3.1 Installation 9

Executing these will create an Eclipse .project file. After the project file has been created, you can import the project into Eclipse by opening Eclipse and selecting:

```
File->Open Projects From File System
```

and pointing it to your RapidSmith2 local repository. All Java source files will be found under *src/main/java*. **NOTE:** Your RS2 git repository should not be put inside your eclipse workspace. It is better to put it elsewhere, and then import it into your workspace.

**Building on the Command Line** After step (3) in the installation process, gradlew produces everything that you will need to run RS2 from the command line. The following directories are created:

- build/classes/main: This folder contains the RS2 class file directory tree.
- build/libs: This folder contains a Jar file of the RS2 class files.
- build/distributions: This folder has both .zip and .tar files with contains all Jars needed to run RS2 from the command line. This includes a full jar of the RS2 build alond with copies of dependency Jars (such as QT-Jambi).

After adding the appropriate .class files or Jars to your CLASSPATH, you should be able to run RS2 tools from the command line. If you make any changes to the RS2 code, you will have to rebuild before running the program again (Step 3). **CAUTION:** An obvious thing to try is to mix and match developing in Eclipse but then running the resulting apps from the command line. Just be aware that Eclipse puts its compiled .class files in very different places than where the gradle build process puts its .class and .jar files. Make sure you understand that before you try to combine these two build/execution methods. Our suggested approach is to choose one or the other, but not both.

#### 3.1.3 Alternative Installation and Use

RapidSmith2 is also available as a Docker container. To painlessly set up a working RapidSmith2 environment, type:

```
docker run -it byuccl/rapidsmith2
```

For more information about Docker, see the guide.

#### 3.1.4 Additional Notes for Mac OS X Installation

The instructions above require you to set the RAPIDSMITH\_PATH environment variable. If running from the command line, the environment variables can be added to your .bash\_profile file as in any other UNIX-like system. However, if using an IDE such as Eclipse you either need to define the environment variable for every Run Configuration you create, or you need to add the RAPIDSMITH\_PATH definition system-wide in OS X. This can be done, but how to do so differs based on what OS X version you are running (and seems to have changed a number of times over the years). Search the web for instructions for how to do so if you desire. **Hint**: you will likely have to edit some .plist files.

#### 3.1.5 Running RS2 Programs

Some points to keep in mind while configuring and running RS2 programs:

- The RS2 code base contains a number of assertions which may be helpful as you are developing code. These are not enabled by default in Java. To enable them, add -ea as a VM argument. This is highly recommended.
- If you are running on a Mac, when running RS2 programs that use Qt (any of the built-in programs like **Device-Browser**) that are GUI-based, you will need to supply an extra JVM switch, -*XstartOnFirstThread*.

- A common error when running RS2 programs is failing to have your RAPIDSMITH\_PATH defined. If this is the case when you try to execute a program, an EnvironmentException will be thrown telling you that you forgot to set the variable.
- If you are running on Windows, only a 32-bit QT Jar file is included in the RS2 repository. This means that you will need to set your JRE to a 32-bit version when running the GUI programs. We are working on updating QT to the latest version, so this will no longer be an issue.
- For Linux command line usage, the CLASSPATH environment variable must point to both the full (uncompressed) RS2 jar in the *build/distributions* folder as well as all the jar files in the */lib* subdirectory. An example CLASSPATH could look like this:

RAPIDSMITH2-SNAPSHOT/\*:RAPIDSMITH2-SNAPSHOT/lib/\*

#### 3.1.6 Testing Your Installation

At this point you can test your installation by executing the java **DeviceBrowser** program:

java edu.byu.ece.rapidSmith.device.browser.DeviceBrowser

This can be done either from within Eclipse or from the command line, depending on how you are running RS2 (if running under OS X be sure to provide the *-XstartOnFirstThread JVM argument*. If all goes well you should see a graphical representation showing the details of a physical FPGA device as shown in Figure 2. You may initially be zoomed far in and might want to zoom out to see the entire chip layout.



Figure 2: DeviceBrowser Sample Display

## 3.2 Device Files For Use With RS2

Device files for one part (the *xc7a100tcsg324*) are included in the distribution so you can immediately start working with RS2 (initially, it will be the only device available when you run the **DeviceBrowser** program above). The device

files for this part can be found in the  $RAPIDSMITH\_PATH/devices/artix7$  directory. If you desire to work with additional parts, follow the instructions found in the file  $RAPIDSMITH\_PATH/doc/InstallingNewDevices.txt$ .

## 4 Devices

#### 4.1 Xilinx FPGA Architecture Overview

This section is intended to give a brief introduction to Xilinx FPGA architecture and terminology. The terminology introduced here is consistent with the terminology used in the Vivado Design Suite. If you are already familiar with Xilinx FPGA devices, then you can skip to subsection 4.2. As you read through this section, it may be helpful to open a sample device in Vivado's Device Browser. To do this, open a new command prompt and run Vivado in Tcl mode ("vivado -mode tcl"). Then, run the following commands in the Vivado prompt:

```
Vivado% link_design -part xc7a100tcsg324-3 -quiet Vivado% start_gui
```

After these commands are run, a GUI view should pop up showing the components of an Artix7 FPGA part. Use this to explore the Xilinx device architecture if needed.

#### **4.1.1** Tiles

Conceptually, a Xilinx FPGA can be thought of as a two dimensional array of Tiles. A Tile is a template section within a FPGA that performs a specific function (say, implementing logic equations). Templates are then duplicated across the device (all copies are identical) and different Tile types are wired together. As an example, Figure 3 displays three types of template Tiles in an Artix7 part.



**Figure 3:** Highlighted example Tiles in an Artix7 FPGA.

The two **VBRK** Tiles on the left are used for intermediate wiring. The two **INT\_L** Tiles on the right are switchbox Tiles. These are reconfigurable routing Tiles that allow a single wire to go multiple different locations within the FPGA. The two **CLBLL** Tiles in the middle are used to implement combinational and sequential digital logic. They are the basic building blocks of Xilinx FPGAs. Other Tile types include DSP, BRAM, and IOB.

#### 4.1.2 Primitive Sites

Each Tile can contain one or more Primitive Sites (often shortened to Site). As Figure 3 shows, CLBLL Tiles have two Sites, INTL Tiles have one Site, and VBRK Tiles have none. A Site is the part within a Tile that actually performs its "useful" function. The remainder of the Tile is used to wire signals to/from its Sites. Figure 4 shows a SLICEL, one of the Sites within a CLBLL Tile. As the figure shows, there are three main components to Sites in Xilinx FPGAs:

- Site PIPs: Reconfigurable routing PIPS used for intrasite routing (also called routing muxes). In Vivado, Site PIPs are usually configured automatically as cells in a design are being placed (based on cell properties and placement location).
- BELs: **B**asic **E**Lements are hardware components within the Site for implementing digital logic. For example, LUT BELs within a SLICEL are used to implement logic equations, and Flip Flop BELs are used as storage. In a synthesized netlist, design elements are mapped to physical BELs during implementation.
- Site Pins: Site input and output. These pins are connected to Wires of the parent Tile and typically drive general fabric.



**Figure 4:** (a) A highlighted example of a SLICEL Site in a CLBLL Tile of an Artix7 FPGA. (b) The basic components of a Site in a Xilinx FPGA.

4.2 Device Data Structures 14

#### 4.1.3 Wires and PIPs

FPGA components are connected together using metal Wires (called Nodes in Vivado). In order to make the FPGA reconfigurable, Wires are connected together through Programmable Interconnect Points (PIPs). Individual PIPs can be enabled or disabled as a design is being routed, and a string of enabled PIPs uniquely identify the used Wires of a physical route. PIPs are most commonly found in Switchbox Tiles, and enable a single wire to be routed to many different locations in the FPGA. Figure 5 shows an example of a switchbox.



**Figure 5:** An example of PIP wire connections in a device. The green wire represents the source wire, and the red wires represent all possible sink wires in the Switchbox. The highlighed white sections of the figure are PIP connections.

#### 4.2 Device Data Structures

In the original RapidSmith, the <code>Device</code> architecture stopped at the <code>Site</code> level. A <code>Site</code> was considered a black box who could be configured using string attributes, but the actual components were unknown. RapidSmith 2 extends the <code>Device</code> architecture to include all components within a <code>Site</code> as well. Figure 6 shows the new data structure hierarchy, which can be found in the <code>edu.byu.ece.rapidSmith.device</code> package.

4.2 Device Data Structures 15



Figure 6: RapidSmith2 Device data structure tree. Green arrows represent inheritance, and black arrows represent association. Classes and Interfaces bolded in blue are new to RapidSmith 2.

The classes and interfaces within *edu.byu.ece.rapidSmith.device* are named to reflect the terminology used by Xilinx. Many classes that exist in Vivado's Tcl interface have a direct map to a class in RapidSmith (such as a Tile). Because of this, most RapidSmith data structures represent a straightforward part of a Xilinx FPGA, The reader is referred to the documentation in the source code to learn more about each data structure. Also, The **DeviceBrowser** and **DeviceAnalyzer** programs described in section 9 illustrate how to load and browse a device down to the Tile and Site levels.

#### 4.2.1 Templates

As Figure 6 shows, there are several template classes in RapidSmith. Template classes are used to specify the configuration of certain structures only once, and then reuse the configuration across identical objects. The usefulness of templates is best shown with an example. In an Artix-7 xc7a100tcsg324 part, there are 11,100 Sites of type SLICEL. Each of these SLICELs have 215 BELs, Site Pins, and Site PIPs combined. In order to save memory, RapidSmith lazily creates these objects only when a SLICEL Site is being used. The alternative would be to create each of the objects when a device is loaded. Template classes should **NOT** be used by the normal user. When creating algorithms using RapidSmith's API, use the non-template version of classes.

#### 4.2.2 WireEnumerator

Wires with the same name can occur several times throughout a Xilinx FPGA device. For example, the wire "CLBLL-L." In order to make the device files small, each uniquely named Wire is assigned an integer enumeration. This avoids moving strings around in memory which would be costly in terms of both space and comparison times. RapidSmith manages all uniquely named wires in an FPGA family with a WireEnumerator. The WireEnumerator class has methods that convert to and from the wire name and enumeration of a Wire, and also stores other Wire information such as direction and type. In previous versions of RapidSmith, the user had to use the WireEnumerator extensively while building CAD tools. RapidSmith 2 has changed this, largely abstracting the WireEnumerator away in favor of more convenient methods in all classes and interfaces that deal with Wires. For example, the name or enumeration of a Wire can now be obtained with the function calls Wire.getWireName() and Wire.getWireEnum() respectively. A handle to the WireEnumerator still exists in the Device class for those who want to use it, but this is not recommended.

#### 4.2.3 TileWire and SiteWire

Wires in RapidSmith are uniquely identified not only by their name (or enumeration), but also by the Tile or Site in which they exist. RapidSmith 2 introduces the TileWire and SiteWire classes to encapsulate this information for the user. Many functions in RapidSmith now return a TileWire or SiteWire (wrapped in a generic Wire object) to the user instead of an integer wire enumeration.

#### 4.2.4 Wire and Connection Interfaces

As Figure 6 shows, there are two classes that inherit the Wire interface (described in subsubsection 4.2.3) and three classes that inherit the Connection interface (described in subsection 7.3). In general, most methods in RapidSmith return the generic interface instead of the subclasses. When creating CAD tools in RapidSmith, it is suggested that the user use these interfaces. Several of the examples referenced in section 9 demonstrate the use of these interfaces.

## 4.3 Supported Device Files

TODO section:

## 5 Designs

#### 5.1 Xilinx Netlists

During the synthesis stage of implementation, a digital circuit expressed using RTL (VHDL or Verilog) is translated to a lower level Xilinx netlist. This netlist describes a digital circuit in terms of primitive elements that can directly target hardware on a Xilinx FPGA. In terms of granularity, a Xilinx netlist is more abstract than gates and transistors, but more detailed than RTL. A list of valid primitives (also called Cells) that can be used within a Xilinx netlist can be found here for 7 Series devices and here for Ultrascale devices. It is suggested that users of RapidSmith become familiar with the primitives found in these documents, and how they are used. The primitives of a Xilinx netlist are wired together to create a digital circuit (such as an 32-bit adder) capable of being implemented on an FPGA. Figure 7 shows an example netlist of a 3-bit counter that has been synthesized in Vivado.



**Figure 7:** Schematic of a 3-bit counter in Vivado using LUT and FDRE cells. The yellow boxes are Cells, the green lines are Nets, and and the white figures on the edge of the diagram are Ports. An example of each of these netlist components have been highlighted.

As the figure shows, Vivado netlists are composed of three primary components: Cells, Nets, and Ports. Cells are **instances** of Xilinx primitives. They are the basic building blocks of a Xilinx netlist and implement the actual logic of a digital design. The most commonly used Cells include:

- Look Up Tables (LUTs): Implement logic equations such as  $O6 = (A1 + A2) \oplus A3$ .
- Flip Flops (FDxx): Single-bit storage elements. Figure 7 uses an FDRE cell which specifies a rising-edge Flip Flop with a reset port, but ties the clock enable port high. Other types of FDxx cells can be used to include a clock enable signal (replace xx with the corresponding letters).
- Block Ram (BRAMs): On-chip FPGA memory cell.
- Digital Signal Processing Units (DSPs): Perform complex arithmetic functions efficiently.
- **Buffers** (BUF): IO, clock, and other types of signal buffers.

There exists several other types of Cells, but the ones in the list above are the most prevalent. Nets are used to connect Cells together. In other words, the output of one Cell is wired to the input of another Cell using a Net. Ports are simply design Input/Output (IO). In terms of an FPGA design, Ports are connected to specific peripheral pins of the FPGA for chip IO. It is important to note that a Xilinx netlist is purely logical, there is no physical information within the netlist (i.e. there is not information about where the Cells have been placed, or how the Nets have been routed). When exporting a design from Vivado, the Xilinx netlist representation is converted to EDIF. In order to recreate the design in RapidSmith, the EDIF file is parsed and converted to a custom netlist structure described in the next section.

#### 5.2 RS2 Netlist Data Structures

RapidSmith netlists are modeled closely after Xilinx netlists. In fact, much of the terminology between the two are identical or very similar. For those that are familiar with Vivado designs, this should make the transition to RapidSmith straightforward. The data structures that constitute a RapidSmith netlist can be found in the package byu.edu.ece.rapidSmith.design.subsite. The package hierarchy can be seen in Figure 8.



Figure 8: RapidSmith 2 design data structure tree. Black arrows represent composition (i.e. a CellDesign contains an array of CellNets and an array of Cells). Green arrows represent inheritance (i.e. a CellPin can be either a BackedCellPin or a PseudoCellPin). Blue boxes represent physical implementation components of the netlist (i.e. what physical Bel a Cell is placed on, or what physical Wires are used in a CellNet).

A CellDesign is the top level design object in RapidSmith. It consists of a collection of Cell objects, interconnected by CellNets. RapidSmith Cells are equivalent to Xilinx Cells and RapidSmith CellNets are equivalent to Xilinx Nets. Cell objects have a template LibraryCell, which represents a Xilinx primitive. They also have a collection of CellPins, which can be thought of as Cell IO. During the placement phase of implementation, Cells may be mapped onto Bels and the corresponding CellPins mapped onto BelPins. Placement is described in more detail in section 6. CellNet objects connect Cells together via their CellPins. During the routing phase of implementation, a CellNet maps onto one or more RouteTrees. Routing is described in more detail in section 7. The best way to learn how to use these classes is to read through the Javadocs, but important aspects of each class is included in the following subsections.

#### 5.2.1 CellDesign

As previously mentioned, the CellDesign class is the top level netlist structure. A Vivado design is converted from a Tincr checkpoint to a CellDesign on import, and converted from a CellDesign to a Tincr checkpoint on design export. It contains the following:

- A list of uniquely named Cells in the design.
- A list of uniquely named CellNets in the design
- Global GND and VCC CellNets
- Cell placement information (where each cell is placed)
- Internal routing configuration for each **used** Site (in the form of Site PIPs)

• A list of XDC constraints imported from Vivado. See section 8 for more information about XDC constraints and how they are represented in RapidSmith.

The CellDesign class has a variety of methods to retrieve and manipulate the Cells and CellNets of a design, place Cells onto physical BELs, configure sub-site routing, and perform several other tasks. See the Javadocs for the exact API.

#### 5.2.2 Cell

This section contains a few things you should know about Cells in, no particular order.

• A Cell always contains a reference to a backing LibraryCell. A LibraryCell is equivalent to a Xilinx primitive cell (described in subsection 5.1), and serves as a template for instantiated Cell objects. The template is used to save memory when creating several Cells of the same type. Whenever a new Cell object is created, a corresponding LibraryCell must be specified in the constructor. Code Sample 1 demonstrates how to create a new cell in RapidSmith and filter cells based on their type.

#### Code Sample 1: How to create new cells in RapidSmith

The method call CellDesign.getCellsOfType (String, CellLibrary) can be used to get all Cells in the current design with a specific LibraryCell type.

- A Cell has a list of CellPins that can connect to CellNets. The methods Cell.getPins(), Cell. getInputPins(), and Cell.getOutputPins() can be used to get a handle to the pins of a Cell. If more pins are needed on a Cell, PseudoCellPin objects can be attached (see subsubsection 5.2.3).
- Cells can be placed onto BELs of the current Device. See section 6 for more information about cell placement
- Top-level Ports in Vivado (design input/output) are represented as Port Cells in RapidSmith. Specifically, there are three types of port cells: IPORT, OPORT, and IOPORT. The method CellDesign.getPorts() can be used to iterate through the port Cells in a design, and the method Cell.isPort() can be used to determine if a given Cell is actually a port.
- Cells can be MACRO cells or LEAF cells. Leaf cells are the types of cells described in subsection 5.1. Macro cells are described in more detail in subsubsection 5.2.5.

#### 5.2.3 CellPin

There are two types of CellPin objects in RapidSmith: (1) BackedCellPins and (2) PseudoCellPins. BackedCellPins are straightforward. They represent a LibraryCellPin, which is a pin attached to the corresponding LibraryCell of a Cell. These pins are used as signal input and signal output. On the other hand, PseudoCellPins are "fake" CellPin objects which are added to a Cell after the Cell has been created. They don't affect the functionality of the Cell, but are needed for physical implementation. An example of where a PseudoCellPin is needed can be seen in Figure 9.



**Figure 9:** An example where a PseudoCellPin in RapidSmith may be needed. Both a PseudoCellPin and BackedCellPin are pointed out. The red net in the image (going into the A6 pin) is the power net (VCC).

In the figure, a cell of type LUT2 has been placed onto a D6LUT BEL of a SLICEL. The two CellPins of the LUT2 have been mapped to the A1 and A4 pins of the BEL (which have been highlighted in blue), connected to Nets, and routed. This is expected behavior based on the netlist. Also, the global VCC net has been routed to the A6 BelPin. This is necessary due to the hardware implementation of LUTs. However, in the netlist view of the circuit, the fact that VCC needs to be routed to the A6 pin is not at all represented (a CellNet can only be connected to CellPins). This is where PseudoCellPins are helpful. In RapidSmith, a PseudoCellPin can be added to the LUT2 cell (called "VCCPin"), mapped to the A6 BelPin, and attached to the global VCC CellNet. This gives a more complete view of how the netlist needs to be physically implemented. PseudoCellPins are created and added automatically to Cells when importing a Tincr checkpoint. If you are creating Cells from scratch, a PseudoCellPin can be added with the method call Cell.attachPseudoPin(string,PinDirection). To determine if a CellPin is a PseudoCellPin, the method call CellPin.isPseudoPin() can be used. That being said, in general users do not need to worry about the different between pseudo and backed CellPins when creating a CAD algorithm. They function identically, so you can use the higher level handles to the CellPin interface instead.

Each CellPin object also has a CellPinType and PinDirection enumeration. Table 1 displays the possible values for both of these enumerations. The CellPinType field can be used to find all RESET CellPins in a design, determine if a CellNet is a clock net (it connects to pins of type CLOCK), and other useful functions. The PinDirection field is typically used to filter a list of CellPins by their direction. It it especially useful in determining if a pin on a Cell is of type INOUT.

#### 5.2.4 CellNet

CellNets are used to wire components of a logical netlist together. Specifically, a CellNet connects an output CellPin to several input CellPins with the purpose of transferring a signal from one Cell to another. The remainder of this section details important aspects about CellNets in RapidSmith.

- CellNets are routed using the RouteTree data structure. Routing in RapidSmith is described in more detail in section 7.
- All CellNets have a NetType enumeration. Possible values for NetType include VCC, GND, and WIRE.
   VCC is reserved for power nets, GND is reserved for ground nets, and WIRE represents all other nets in the design. Other NetTypes could be added to this list in the future (such as CLOCK), but for now they are not differentiated.
- VCC and GND nets (also called static nets) are treated specially in RapidSmith in two ways. (1) Vivado designs can have multiple nets that correspond to VCC or GND, but they all logically represent the same thing. This representation can be challenging to work with when designing CAD tools. RapidSmith takes a different approach. When a design is imported from Vivado, all VCC nets are collapsed into a single VCC net that is

| Enum         | Values       |  |  |
|--------------|--------------|--|--|
| CellPinType  | CLEAR        |  |  |
|              | CLOCK        |  |  |
|              | ENABLE       |  |  |
|              | PRESET       |  |  |
|              | RESET        |  |  |
|              | REUSED       |  |  |
|              | SET          |  |  |
|              | SETRESET     |  |  |
|              | WRITE_ENABLE |  |  |
|              | DATA         |  |  |
|              | PSEUDO       |  |  |
| PinDirection | IN           |  |  |
|              | OUT          |  |  |
|              | INOUT        |  |  |

Table 1: Enumerations for the CellPin class

sourced by a global VCC cell (the same applies to GND nets). We have found this representation much easier to work with. The function calls <code>CellDesign.getVccNet()</code> and <code>CellDesign.getGndNet()</code> can be used to retrieve these special nets from the design. If you add a VCC net to a design that already has a VCC net, the old net will be overridden. (2) When a static net is routed, it doesn't have a single source driving it. Rather, the net is partitioned into sections with each section having a single source. In an FPGA, each of these sources corresponds to a TIEOFF within a switchbox tile. This is shown in Figure 10.



**Figure 10:** Static net multiple sources example. The higlighted white wires are a part of the same GND net, and the circled red areas are the source TIEOFFs for the GND net. In this image, only two sources are shown.

RapidSmith handles this oddity by allowing CellNets to have more than one RouteTree object associated with it. In the case of Figure 10 the GND net would have two RouteTree objects associated with it, one for each source TIEOFF.

• Most CellNets have a single source pin and multiple sink pins (referred to as the "fanout" of the net). When routing a net that fits this description, only one RouteTree needs to be specified. However, there are two noticable exceptions to the common case: (1) static nets (described in the bullet point above) and (2) nets with multiple drivers (involving IO pins). In a Xilinx FPGA, this is most often seen in IOB pads. An example is shown in Figure 11. The highlighted net can be driven by both the OBUF output, and from an external source. RapidSmith 2 supports nets with multiple drivers. The function call CellNet.getAllSources() can be used to retrieve all possible source CellPins of a net.



Figure 11: An example of net (highlighted in white) that can be driven by more than once source.

• After a design has been placed, CellNets fall into one of two categories: INTRASITE or INTERSITE. Figure 12 shows an example of both types of nets. As can be seen, INTRASITE nets do not cross Site boundaries while INTERSITE nets stretch across multiple Sites. These types of nets are handled differently during routing. To determine if a CellNet is an INTRASITE net, the method call CellNet.isIntrasite() can be used.



Figure 12: Example of an INTRASITE net (left) and an INTERSITE net (right).

#### 5.2.5 Macro Cells

Most cells in RapidSmith or Vivado designs are leaf cells (LUTs, Flip Flops, etc.). However, macro cells can also exist. A macro cell can conceptually be thought of as a wrapper for related leaf cells. Figure 13 shows an example of an "IOBUF" macro cell in Vivado, which is a bidirectional buffer.



Figure 13: An IOBUF macro in Vivado.

As the figure shows, the macro consists of two internal cells: one of type "OBUFT" and another of type "IBUF". The two internal cells are encapsulated into an external macro cell with the name "MemDB\_IOBUF[0]\_inst", and wired together accordingly. External cell pins of the macro connect to one or more internal cell pins. Also, internal nets can exist in macros that connect two internal cells together. RapidSmith now supports importing macro cells from Vivado and adding them to a CellDesign. Code Sample 2 gives a brief introduction to using macro in RapidSmith.

## Code Sample 2: How to use macros in RapidSmith

```
// Get a handle to a design and cell library
  CellDesign design = getCellDesign();
   CellLibrary libCells = getCellLibrary();
   // Create a new macro cell and add it to a design
   Cell macro = new Cell("myMacro", libCells.get("IOBUF"));
   design.addCell(macro);
  // Connect the macro cell to a net
  CellPin pin = macro.getPin("IO");
   design.getNet("TmpNet").connectToPin(pin);
  // Iterate through a list of all cells (macro and leaf cells) of a design
   for (Cell cell : design.getCells) {
14
     if ( cell.isMacro() ) {
15
       List<Cell> internalCells = cell.getInternalCells();
16
       List<CellNet> internalNets = cell.getInternalNets();
       List<CellPin> externalPins = cell.getPins();
18
       // do something with the macro info
19
     }
2.0
21
     else {
       // do something with a regular leaf cell
24
   }
```

As the code example above shows, macro cells are generally used exactly like regular leaf cells. However, there are a few distinctions between macro cells and leaf cells.

- When a macro cell is added to a CellDesign, all of the internal cells and internal nets are automatically added to the design as well. You do not have to worry about adding these yourself. When a macro cell is removed from a CellDesign, the internal cells and nets are also removed.
- When a macro cell pin is connected to a CellNet, RapidSmith automatically connects the net to the corresponding internal pins. When a macro cell pin is disconnected from a CellNet, the internal cell pins are disconnected. Nets in RapidSmith only connect to leaf cells (i.e. it is essentially a flattened netlist).
- Internal cells and nets within a macro cannot be individually added or removed from a CellDesign. An exception will be thrown if you try to do this. Instead, the entire macro cell must be added or removed.
- Macros cannot be placed. Rather, the internal cells of a macro should be placed instead.
- When a CellDesign is exported from RapidSmith, macro cells are not exported. Only the internal cells and nets are exported and the macro is not rebuilt in Vivado.

#### 5.2.6 PropertyList

Most objects in Vivado's Tcl interface have attached properties. These properties can be used to describe attributes of the object (such as name, type, etc.), but they can also be used for configuring the object. Figure 14 shows a list of properties for a FDCE flip flop cell in Vivado.

| Property      | Tyne   | Read-only | Value               |
|---------------|--------|-----------|---------------------|
| BEL           |        | false     |                     |
| DEB           | Scring | Laibe     | NHICEH.DOIT         |
| CLASS         | string | true      | cell                |
| FILE_NAME     | string | true      | C:/Users/ecestudent |
| INIT          | binary | false     | 1'b0                |
| IS_BEL_FIXED  | bool   | false     | 0                   |
| IS_BLACKBOX   | bool   | true      | 0                   |
| IS_DEBUGGABLE | bool   | true      | 1                   |
| IS_LOC_FIXED  | bool   | false     | 0                   |
| IS_ORIG_CELL  | bool   | true      | 0                   |
| IS_PRIMITIVE  | bool   | true      | 1                   |
| IS_SEQUENTIAL | bool   | true      | 1                   |
| LINE_NUMBER   | int    | true      | 231                 |
| LOC           | site   | false     | SLICE_X5Y92         |
| NAME          | string | true      | arrow_addr_reg[9]   |

Figure 14: Properties of an FDCE cell in Vivado.

If you are interested in exploring object properties in Vivado, the following Tcl command can be used where "object" is a cell, bel, or other Tcl object.

```
vivado% report_property $object
```

Cells are the most interesting objects in terms of properties. This is because the function of a Cell is determined by how it is configured. For example, the memory width of a BRAM cell in Vivado is configured by setting the "READ\_WIDTH" and "WRITE\_WIDTH" properties of the cell. Possible values include 1, 2, 4, 9, 18, 36 and 72. The operation of the BRAM is different depending on how this property is set. Another example is a D flip flop cell (FDRE) and its "IS\_C\_INVERTED" property. This property indicates if the flip flop will be rising-edge or falling-edge triggered. On design export, the properties of cells, nets, and the top-level design are included in the output EDIF netlist of a Tincr Checkpoint (if they are not the default value).

When RapidSmith parses the EDIF file in a Tincr Checkpoint, the properties within are stored in a data structure called a PropertyList. Each CellDesign, Cell, and CellNet in RapidSmith has an associated PropertyList object. The PropertyList for each Cell in the design also has a list of default configuration properties. Configuration properties for Cells are always included in the PropertyList even if they are not explicitly set by the user. As previously stated, this is because the functionality of the cell is dependent on the values of its configuration properties. Code Sample 3 shows some basic property usage.

#### Code Sample 3: Using PropertyLists in RapidSmith

```
// Create a new FF cell with default properties
  CellLibrary libCells = getCellLibrary();
  LibraryCell libCell = libCells.get("FDRE");
  Cell cell = new Cell("myCell", libCell);
  // Get a handle to the cells properties
  PropertyList properties = cell.getProperties();
  // Print the configurable properties of the cell
  for(String propName : libCell.getConfigurableProperties()) {
     Property prop = properties.get(propName);
     System.out.println(propName + ":");
     System.out.println("\tDefault -> " + prop.getStringValue());
     System.out.println("\tPossible -> " + libCell.getPossibleValues(propName));
14
15
16
  // Iterating over a PropertyList
  for (Property prop : properties) {
18
     System.out.println(prop.getKey() " -> " + prop.getStringValue());
19
  }
20
  // Change the FF to a falling edge flop...this will override the default property
  properties.update("IS_C_INVERTED", PropertyType.EDIF, "1'b1");
```

Here are some additional notes about cell properties that may be helpful:

To learn what configuration properties a Cell has, open up a Vivado design and type the following into the
console,

```
Vivado% link_design -part xc7a100tcsg324-3 -quiet
Vivado% set lib_cell [get_lib_cells RAMB36E1]
Vivado% report_property $lib_cell
```

replacing "xc7a100tcsg324-3" with the part you are using and "RAMB36E1" with your cell of interest. All properties that start with "CONFIG" represent configuration properties that a user can set.

- Because EDIF properties only support String, Integer, and Boolean types, any properties imported from the
  EDIF file will be one of these types. However, from our testing, Vivado always exports its properties as strings.
  RapidSmith makes no attempt to parse the Vivado properties into their corresponding data structures.
  All Vivado properties are represented using Strings, and it is currently up to the user to parse the properties if they need to.
- To learn what configuration properties **affect the pin mappings of a cell**, open the *cellLibrary.xml* file you have, and search for the tag "libcellproperty." Pin mappings are described in more detail in section 6.
- Only properties of type PropertyType.EDIF will be exported from RapidSmith. If you create cells in your
  design and add properties to them, make sure that you mark the properties as EDIF properties if you want them
  to be exported All other properties will be ignored during design export.

• Some properties not only affect how a Cell is configured, but they can also affect how a Site is configured. For example, on SLICEL sites there exists a clock routing mux that chooses between a regular clock signal and an inverted signal (Figure 15). The output of this inverter is connected to all flip flops of the SLICEL, and so decides whether the flip flops are all rising-edge or all falling-edge triggered. The clock that is selected is decided by the property "IS\_C\_INVERTED" of the flip flops cells **that have been placed onto the SLICEL**. The clock inverter is programmed automatically based on the cell properties. There are other such properties, but they will not be listed here. When creating designs in RapidSmith, it is important to not place Cells together that may have conflicting properties. RapidSmith does not perform any error checking, and so an error in Vivado will be thrown if you violate property restrictions.



**Figure 15:** A SLICEL clock inverter mux. The highlighted white portion of the figure shows the connection of the clock inverter.

## 5.3 The Cell Library

The CellLibrary in RapidSmith stores important information about each of the LibraryCells compatible with a specific Xilinx FPGA part. This includes:

- The name of the library cell
- The name, direction, and type of each library cell pin
- Valid placement locations for instances of the library cell
- Possible logical-to-physical pin mappings for every library pin
- The type of library cell (macro or simple)
- Configuration properties of the library cell (TODO)

When doing any type of netlist modification, the CellLibrary is needed. Currently, each CellLibrary corresponds to a specific Xilinx part. This means that for each part you use in RapidSmith, a new CellLibrary needs

to be generated <sup>2</sup>. The rest of this section demonstrates how to get access to a cell library in RapidSMith, and how to create a new cell library XML file from Tincr.

## 5.3.1 Loading a Cell Library

When a Tincr checkpoint is imported into RapidSmith, a CellLibrary is automatically loaded. To get a handle to the CellLibrary, you can use the method call TincrCheckpoint.getLibCells() on the loaded checkpoint object. If you don't have a handle to a Tincr Checkpoint, then the cell library XML can be loaded directly from disk. Code Sample 4 demonstrates both methods of loading a CellLibrary.

#### Code Sample 4: How to load a CellLibrary

#### 5.3.2 Adding Custom Macros to a Cell Library

Customized macros can now be added to a CellLibrary if desired. This can be accomplished in two easy steps.

1. Create an XML specification of your macro that follows the format laid out below:

```
<?xml version="1.0" encoding="UTF-8"?>
<root.>
 <macros>
    <macro>
      <type>RAM128X1D</type>
      <!-- List of internal cells with name and leaf type of each -->
      <cells>
         <internal>
            <name>DP.HIGH</name>
            <type>RAMD64E</type>
         </internal>
         <internal>
            <name>DP.LOW</name>
            <type>RAMD64E</type>
         </internal>
         <internal>
            <name>F7.DP</name>
            <type>MUXF7</type>
         </internal>
      </cells>
      <!-- List of macro pins with name, direction, pin type, and internal
         connections -->
      <pins>
         <pin>
            <name>DPO</name>
            <direction>output</direction>
            <type>MACRO</type>
```

<sup>&</sup>lt;sup>2</sup>This may change to be family-specific in the future. Usually, parts in the same family can use the same CellLibrary. The part-specific requirement is very conservative, but is one that we know is right

```
<internalConnections>
               <pinname>F7.DP/O</pinname>
            </internalConnections>
         </pin>
         <pin>
            <name>SPO</name>
            <direction>output</direction>
            <type>MACRO</type>
            <internalConnections>
               <pinname>F7.SP/O</pinname>
            </internalConnections>
         </pin>
         <pin>
            <name>A[6]</name>
            <direction>input</direction>
            <type>MACRO</type>
            <internalConnections>
               <pinname>F7.SP/S</pinname>
               <pinname>DP.HIGH/WADR6</pinname>
               <pinname>DP.LOW/WADR6</pinname>
               <pinname>SP.HIGH/WADR6</pinname>
               <pinname>SP.LOW/WADR6</pinname>
            </internalConnections>
         </pin>
         . . .
      </pins>
      <!-- List of internal nets, and the internal cell pins they connect to -->
      <internalNets>
         <internalNet>
            <name>DPOO</name>
            <pins>
               <pinname>F7.DP/I0</pinname>
               <pinname>DP.LOW/O</pinname>
            </pins>
         </internalNet>
      </internalNets>
   </macro>
 </macros>
</root>
```

2. Import the macro into the CellLibrary using the API call shown in Code Sample 5.

## Code Sample 5: Adding new macros to the Cell Library

```
// Get a handle to a CellLibrary
CellLibrary libCells = getCellLibrary();

// Add the macros in an XML file.
libCells.loadMacroXML(Paths.get("myMacro.xml");
```

Once this is complete, you can use your custom macro in a CellDesign like a normal cell.

#### 5.3.3 Creating a New Cell Library

During runtime, a CellLibrary data structure is created by parsing a *cellLibrary.xml* file into memory. So, to create a new CellLibrary all you have to do is generate a new *cellLibrary.xml* file for the Xilinx part you are interested

- in. This can be done in a few easy steps.
  - 1. Open Vivado in Tcl mode, and type the following into the console:

```
::tincr::create_xml_cell_library xc7a100tcsg324-3 mycellLibrary.xml
```

Replace "xc7a100tcsg324-3" with the part you want to generate and "mycellLibrary.xml" with the location where you want to store the generated cell library XML. This will generate most of what you need in the cell library XML automatically.

- 2. Series7: If you are generating a cell library for a series7 device, open the generated XML file in a text editor and search for the "CARRY4" cell.
- 3. Series7: Scroll down to the "bels" XML element within the CARRY cell, and add the following lines to each pin that is named "CI":



You should have to insert this line in two places only.

- 4. Save your changes and exit the text editor
- 5. Copy the XML file to \$RapidSmith\_Path/device/family directory where "family" is replaced by the family of your part (such as Artix7), and "\$RapidSmith\_Path" is your RapidSmith repository location. Once this is done the new CellLibrary should be ready to use.

**NOTE**: Usually, parts in the same family can use the same CellLibrary.

## 6 Placement

In the original RapidSmith, placement occurred at the Site level. A collection of Cells were grouped together in what was called an Instance, and the Instance was assigned to a compatible site type. Where the individual Cells were actually placed within the Site was a blackbox. Because RapidSmith 2 breaks up a Site into its individual components, Cells can now be placed directly onto physical BELs within a Site. This gives Xilinx FPGA CAD developers finer grain control during the placement of a design, and allows sub-site algorithms (such as packing algorithms) to be explored. Code Sample 6 demonstrates the basic steps to placing a Cell in RapidSmith.

#### Code Sample 6: Steps for placing a Cell in RapidSmith

```
// Load the device and design
  TincrCheckpoint tcp = VivadoInterface.loadTcp("myCheckpoint.tcp");
  Device device = tcp.getDevice();
  CellDesign design = tcp.getDesign();
  // Get a handle to a Cell and Bel. The cell is of type LUT2
  Cell cell = design.getCell("MyCell");
  Bel bel = device.getSite("SLICE_X40Y137").getBel("D6LUT");
  // Place the cell onto the bel
10
  design.placeCell(cell,bel);
  // Two ways to map bel pins
  CellPin pin1 = cell.getPin("I0");
  CellPin pin2 = cell.getPin("I1");
  CellPin pin3 = cell.getPin("0")
16
  // First way
  pin1.mapToBelPin(bel.getPin("A1"));
  pin2.mapToBelPin(bel.getPin("A2"));
22
  // Second way
  List < BelPin > possible = pin3.getPossibleBelPins(bel);
  pin3.mapToBelPin( possible.get(0) );
```

As the code listing shows, there are two steps to placing a RapidSmith Cell. The first is straightforward: get a handle to a Bel object, and use the method call CellDesign.placeCell(Cell, Bel) (line 11). This will map a logical Cell onto a physical Bel. Once a Bel has been used, no other Cells can be mapped to it. No error checking is performed to ensure that a Cell is compatible with the Bel it is being placed on, that is up to the user. The second step is more complicated: each logical CellPin of the Cell needs to be mapped to a physical BelPin. This can be done by either (a) specifying the BelPin name (lines 19-20), or (b) using the function call CellPin. getPossibleBelPins (Bel) (line 23) to get a list of valid BelPin mappings. For most Cells each CellPin only maps to a single BelPin, and so option (b) is preferable. However, there are two noticable exceptions to this rule.

- 1. LUT input pins are permutable. This means that an input CellPin attached to a LUT Cell can be mapped to any input pin of a LUT Bel. Figure 16 shows an example of this functionality in Vivado. Once the input pins have been mapped, a configuration equation is created for and applied to the LUT Bel using the appropriate BelPin inputs. In order for a LUT Cell to be physically implemented, these pins need to be mapped. The function call CellPin.getPossibleBelPins (Bel) in this case will return all of the input BelPins of the LUT and the user can decide which ones to use.
- 2. Logical-to-physical pin mappings can change **based on how a cell is configured**. For example, on a RAMB36E1 Cell, the least-significant data input pins map to different physical BelPins when the width of the BRAM is set to 72. This is an important concept when performing netlist modifications in RapidSmith. The function call CellPin.getPossibleBelPins (Bel) only returns the pin mappings for the **default** Cell configuration. The correct pin mappings are always used when a Tincr checkpoint is imported into RapidSmith, but if



**Figure 16:** An example of LUT pin permutations. The left figure shows the schematic of a LUT2 cell. The middle figure shows the two input pins of the LUT cell being mapped to the A1 and A2 pins of a LUT6 bel. The right figure shows the same input pins being mapped to the A6 and A5 pins instead.

new logic is being added to a design it is up to the user to determine the proper pin mappings. Adding support for automatically determining a Cells pin mappings based on a its configuration is currently being worked on, but is not yet ready. For now, users can configure a cell in Vivado, place it, and use the TCL function shown in Code Sample 7 to determine the correct pin mappings for a Cell.

Code Sample 7: TCL script to print all logical-to-physical pin mappings of a cell

```
proc print_pin_mappings{cell} {

foreach cell_pin [get_pins -of $cell] {
 puts "$cell_pin -> [get_bel_pins -of $cell_pin]"
}

}
```

Some additional points about placement to be aware of include:

- VCC and GND Cells are not placed when implementing a design in Vivado. This distinction is applied to RapidSmith 2 as well. Rather than placing VCC or GND explictly, RouteTrees that are sourced by switchbox TIEOFFs are used to express their placement implicitly (VCC/GND is "placed" on the TIEOFF).
- A list of valid placement options for a Cell can be obtained with the function call Cell.getPossible—Anchors(). The sample program **CreateDesignExample** (described in subsection 9.6) gives a good example of how to use this function. The reader is referred to the source code and Javadocs for more details.
- If you plan on writing a placer in RapidSmith, there are several placement rules for a given device. A few examples include (a) CARRY4 Cells that are connected through the carry chain need to placed vertically to one another so they can use the dedicated carry wires, and (b) A RAMB36E1 cell cannot be placed in the same tile as a RAMB18E1 cell. If either of these rules are violated, an error will be thrown in Vivado. It is the responsibility of the user to determine all relevant placement rules because error checking is not performed on design export. The source code for a sample simulated annealing placer for the Artix-7 part xc7a100tcsg324 can be found in the package edu.byu.ece.rapidSmith.examples.placerDemo.
- Macro cells in RapidSmith cannot be placed. The internal cells of a macro should be placed instead.

## 7 Routing

After placement, all Cells of a design have been mapped to Bels, and their corresponding CellPins mapped to BelPins. The next (and final) step of the FPGA implementation flow is to physically wire together the used BelPins. This is known as routing. Routing involves taking each logical Net of a design, determining the BelPins they are connected to (based on the CellPins), and finding a list of physical wires that electrically connect the pins together. This section details how routing is done in RS2.

#### 7.1 Wires and WireConnections

Routing in RapidSmith is done using Wire objects, which are described in subsubsection 4.1.3. Wires are uniquely identified by their corresponding tile and wire name (i.e. tile1/wire1), and are connected through WireConnection objects. There are two types of wire connections:

- 1. **PIP Connections**: Connect two different wires through a Programmable Interconnect Point. Most PIP connections are found in switchbox tiles of an FPGA part (as shown in Figure 5). These types of connections are important to FPGA routing, because they dynamically configure the routing network for a given design.
- 2. Non-PIP Connections: Connect the same physical wire across two different tiles. In general, wires stretch across multiple tiles in a device, having a different name in each tile. This is demonstrated in Figure 17. The example wire shown in the figure spans 5 different tiles, but has a different name in each. To save space, only the source and sink wire segments are kept in RapidSmith data structures (INT\_X1Y1/E2BEG4, INT\_X2Y1/E2MID4, and INT\_X3Y1/E2END4). The source segment is connected to each sink segment through a non-PIP wire connection. It is also possible to have non-PIP connections within a Tile, but this is rare.



Figure 17: A wire in an FPGA illustrating how each part of the wire has a different name depending on the tile it is located in

#### 7.2 Traversing Wire Objects

Traversing through wires in a device is straightforward. Given a handle to a Wire object named "mywire" or a WireConnection object named "wc", the following function calls can be used:

- mywire.getWireConnections(): Returns a collection of all WireConnections with the source wire of mywire. This collection can be iterated through to find all places a specific wire goes (i.e. what wires it connects to).
- wc.isPip(): Returns true if the wire connection "wc" is a PIP connection. Returns false otherwise.
- wc.getSinkWire(): Returns the sink wire of a wire connection.

In general, these are the only three functions that are needed to search through the wires of a FPGA device. It is important to note however that the first wire in the route has to be either (a) created using a TileWire constructor, or (b) retrieved from a function call of another object (such as SitePin::getExternalWire()). Code Sample 8 demonstrates how to iterate through WireConnection objects. To gain a better understanding of how to use Wires and WireConnections, see the **HandRouter** example in the RapidSmith repository.

## 7.3 Other Types of Connections

Along with PIP and non-PIP wire connections, there are several other types of connections in RapidSmith. The source of the connection is always a Wire object, but the sink object differs. A description of these connections is found below:

- Site Pin Connections: Connects a Wire to a SitePin. The function call conn.getSitePin() can be used to get the sink SitePin object.
- Terminal Connections: Connects a Wire to a BelPin. The function call conn.getBelPin() can be used to get the sink BelPin object.
- Site Routethrough Connections: Connects an input site Wire to an output site Wire. A Site in Vivado can be configured to pass the signal on an input SitePin directly to an output SitePin. These connections are represented as routethroughs in RapidSmith and can be determined with the function call conn. isRoutethrough(). NOTE: Before using this type of connection when building a routing data structure, make sure the Site is unused.
- BEL Routethrough Connections: Connects an input BEL Wire to an output BEL Wire. Similarly to a Site routethrough, LUTs in Vivado can also be configured as routethroughs. In this case, the signal on one of the input pins of the LUT is passed directly to the output pin of the LUT. A BEL routethrough connection can be used to configure a LUT as a routethrough in RapidSmith. NOTE: Before using this type of connection, make sure the LUT is unused (i.e. no cell is placed on it).

When traversing through the device data structure, a generic Connection object is usually used. This connection can refer to any of the connections described so far in this documentation. Code Sample 8 demonstrates how to iterate through different Connection types in RapidSmith.

#### Code Sample 8: How to iterate over Connections in RapidSmith

```
// Get a handle to a wire
  Wire wire = sitePin.getExternalWire();
  // Iterate over all WireConnections
   for (Connection conn : wire.getWireConnections()) {
     Wire sinkWire = conn.getSinkWire();
     if (conn.isPip()) {
       // Do something with a PIP
10
     else if (conn.isRoutethrough()) {
       // Do something with a routethrough
14
     else {
       // Do something with a regular wire connection
15
16
  }
18
  // Iterate over all Pin Connections
19
  for (Connection conn : wire.getPinConnections() ) {
     SitePin pin = conn.getSitePin();
     // Do something with the SitePin
  }
24
  // Iterate over all Terminal Connections
  for (Connection conn : wire.getTerminals()) {
     BelPin bp = conn.getBelPin();
     // Do something with the BelPin
  }
```

7.4 RouteTrees 34

```
// Iterate over all connections at the same time
   Iterator<Connection> connIt = wire.getAllConnections().iterator();
33
   while (connIt.hasNext()) {
34
     Connection conn = connIt.next();
35
36
     if (conn.isPinConnection()) {
37
       //do something with the site pin
38
39
     else if (conn.isTerminal()) {
       //do something with the bel pin
41
42
     else if (conn.isPip()) {
43
       //do something with the pip wire connection
44
45
46
     else {
       //do something with regular wire connection.
47
48
49
   }
```

#### 7.4 RouteTrees

Wires and WireConnections are the fundamental objects used to specify and explore routing in RapidSmith, but they need to be organized in a higher-level data structure to give meaning to the route of a CellNet. In the original RapidSmith, creating this data structure was up to the user. RapidSmith 2 introduces the RouteTree, which can be see in Figure 18.



Figure 18: Visual representation of the RouteTree data structure.

As the figure shows, a RouteTree is a simple tree data structure. Each node in the tree represents a physical wire, and is connected to other nodes (wires). Edges in the tree represent wire connections (i.e. how one wire connects to another). A RouteTree can also be conceptually thought of as a graph, with a single "starting" node and several "sink" nodes. A RouteTree node contains the following members:

• Wire: The physical Wire object that the RouteTree node represents. This can be either a TileWire or

7.4 RouteTrees 35

SiteWire.

- Source: A link to the the parent RouteTree node
- Connection: The Connection taken from the parent RouteTree node to reach the current RouteTree node. In other words, it is the WireConnection object that was taken from the parent wire to reach the current wire
- Sinks: A list of child nodes. There is no limit to how many children a RouteTree node can have.
- Cost (not shown): An optional cost field for routers

A complete RouteTree specifies how the source of a CellNet is physically connected to all of its sinks. Figure 19 shows an example of a complete RouteTree in RapidSmith.



Figure 19: A visual representation of a RapidSmith2 RouteTree. Nodes represent Wires, red edges represent PIP wire connections, and black edges represent non-PIP wire connections. This specific RouteTree was created from the **AStarRouter** example.

7.5 Three Part Routing 36

In this case, the CellNet that is being routed has once source pin, and two sink pins. The source pin is connected to wire "CLBLM\_L\_X8Y97/CLBLM\_L\_DQ", and the sink pins are connected to the wires "CLBLM\_R\_X1Y97/CLBLM\_M\_AA". Starting from the source, wires are traversed downward (via wire connections) until the target wires are reached. Code Sample 9 demonstrates the basic usage of RouteTrees in RapidSmith. The **DesignAnalyzer**, **AStarRouter**, and **HandRouter** examples described in section 9 also demonstrate how to traverse and build a RouteTree.

#### **Code Sample 9:** Building a RouteTree

```
// Find a Wire to start the RouteTree at
  Site site = device.getSite("SLICE_X5Y84");
  SitePin pin = site.getSitePin("DQ");
  Wire startWire = sink.getExternalWire();
  // Create the first node in the RouteTree
  Queue<RouteTree> rtQueue = new LinkedList<RouteTree>();
  RouteTree start = new RouteTree(startWire);
  rtQueue.add(start);
10
  // Build up the RouteTree somehow
12
  while (!amDone()) {
     RouteTree current = rtQueue.poll();
13
     Wire wire = route.getWire();
14
     for (Connection conn : wire.getWireConnections()) {
       // add qualified connections to the RouteTree
       if (isQualified(wire)) {
18
         RouteTree tmp = current.addConnection(conn);
19
         rtQueue.add(tmp);
21
22
  }
```

When a design is imported from Vivado, the routing information is parsed and loaded into RouteTrees for each CellNet. On design export, the RouteTree for each CellNet is traversed and converted into a Vivado ROUTE string. A user can use custom data structures to route a design, but it needs to be **converted to an equivalent RouteTree data structure** before exporting the design to Vivado.

#### 7.5 Three Part Routing

In RapidSmith 2, there are three sections to a routed CellNet.

- 1. The portion of the net that starts at the source BelPin, and is routed to an output SitePin. This part of the route exists completely inside of Site boundaries.
- 2. The portion of the net that starts at the output SitePin of part (1), and is routed to several sink SitePins. This part of the route is called the "INTERSITE" route because it connects Sites together. A typical router is responsible for routing this section of the net<sup>3</sup>.
- 3. The portion of the net that start at an input SitePin from part (2), and is routed to a sink BelPin. Since there can be several sink pins in a CellNet, this section of the net can have more than one component. This part of the route also exists completely inside of Site boundaries.

Figure 20 shows an example of the three parts of a routed CellNet.

<sup>&</sup>lt;sup>3</sup>VCC and GND nets don't follow this pattern. The only difference for VCC and GND is that they can have multiple intersite nets.

7.5 Three Part Routing 37



**Figure 20:** The three sections of a RapidSmith route.

There is a RouteTree object associated with each section of the three-part routing. A source RouteTree represents the orange wires in the figure above (part 1). An intersite RouteTree represents the green wires in the figure above (part 2). A list of sink RouteTrees represents the purple wires in the figure above (part 3, with a different RouteTree object for each Site). It is important to note that INTRASITE nets only have a source RouteTree because they are completely contained within a Site. Code Sample 10 demonstrates how to utilize three part routing in RapidSmith. When a design is imported into RapidSmith, the routing sections of each CellNet are created automatically.

#### Code Sample 10: Three-part routing in RapidSmith

```
// Get a handle to a routed net in the design
  CellNet net = design.getNet("myNet");
  // Handling the source RouteTree
  RouteTree source = net.getSourceRouteTree();
  net.setSourceRouteTree(createSourceRoute());
  // Handling the intersite RouteTree
  RouteTree intersite = net.getIntersiteRouteTree();
  net.addIntersiteRouteTree(createIntersiteRoute());
  // Iterate over a list of sink RouteTrees
13
  for (RouteTree rt : net.getSinkSitePinRouteTrees()) {
     // do something with the RouteTree
14
15
16
  // Or, get a RouteTree based on a SitePin
  for(SitePin sitePin : net.getSitePins()) {
     if (sitePin.isInput()) {
```

7.6 Routing in Vivado 38

```
RouteTree sinkTree = net.getSinkTree(sitePin);

// do something with the RouteTree

}

// Add a new sink RouteTree that starts at a SitePin

net.addSinkRouteTree(sitePin, createSinkRouteTree(sp));
```

## 7.6 Routing in Vivado

The reason a three-part routing distinction is necessary in RapidSmith, is due to how routing is represented in Vivado. Inside Site boundaries, a route is represented using SitePIPs. A string of enabled PIPs determines what pins are connected together within the Site. Between Site boundaries, a route is instead represented using wires. The wires are formatted into a Vivado ROUTE string, which uniquely specifies the intersite route for a net. The three-part routing representation makes this distinction explicit to the users of RapidSmith (it also makes import/export easier). When a design is exported from Vivado, the intrasite portions of a net are exported as SitePIPs, and the intersite portion of the net is exported as wires.

```
% An example of a string of used site pips in Vivado
{IUSED:0 IBUFDISABLE_SEL:GND INTERMDISABLE_SEL:GND}

% An example of a Vivado ROUTE string
{ CLBLL_LL_AQ CLBLL_LOGIC_OUTS4 { NW6BEG0 NE2BEG0 WR1BEG1 IMUX_L34
IOI_OLOGICO_D1 LIOI_OLOGICO_OQ LIOI_OO } IMUX_L1 CLBLL_LL_A3 }
```

## 7.7 IntrasiteRouting

On design import, the SitePIP information extracted from Vivado is stored into RapidSmith data structures, and used to resconstruct the three-part routing view. This gives the user two options when dealing with intrasite routing in RapidSmith: (1) use the three-part routing data structures, or (2) use the set of enabled SitePIPs that exist in each Site. It is up to user preference for which representation to use when writing a CAD tool, but in order to export a design back into Vivado the three-part routing data structures need to be converted to SitePIPs and assigned to their corresponding Site. Code Sample 11 demonstrates how this is currently done. This step needs to be taken only when you have modified the intrasite routing for a Site. We are working on improving the API for SitePIPs, and creating a way to automate conversion from three-part routing to SitePIP information.

## **Code Sample 11:** Configuring Site Pips

# 8 Design Import/Export

Importing and exporting designs between Vivado and RS2 is fairly straightforward. It is done using Tincr Checkpoint (TCP) files, which contain a variety of information about a digital design. This section describes in detail the following:

- The contents of TCP files
- The small differences between TCP files for design import and export
- How to load a TCP into RS2, and how to export a RS2 design to a TCP
- Some additional information that is imported with a TCP

## 8.1 RapidSmith Checkpoints

RapidSmith checkpoints (RSCP) are Tincr checkpoints generated in Vivado from the command ::tincr::write-rscp. These checkpoints are capable of representing a Vivado design at any stage of implementation (post-synthesis, post-place, and post-route), and can be parsed and imported into RapidSmith. As we will see in subsection 8.2 below, the format of these checkpoints differ slightly than those of regular TCPs. This is because a great deal of additional information needs to be added to RapidSmith checkpoints in order to import a complete representation of a Vivado design. The remainder of this subsection describes each of the file types within a RapidSmith checkpoint, and what they represent.

#### 8.1.1 design.info

The *design.info* file within a RSCP is used to include any useful information about a design. Currently, it only stores the part name that the design is implemented on. This file is reserved to add more additional information in the future.

#### 8.1.2 netlist.edf

The *netlist.edf* file within a RSCP is an EDIF netlist representing the logical portion of a design. It details all of the Cells, Nets, and Ports within a design, and is generated from Vivado using the Tcl command write\_edif. A RapidSmith CellDesign is created by parsing the EDIF file, and converting it into the appropriate RS2 data structures described in subsection 5.2. Below is an example of a RSCP EDIF file.

```
(Library work
 (edifLevel 0)
 (technology (numberDefinition ))
(cell add (celltype GENERIC)
  (view add (viewtype NETLIST)
   (interface
    (port a (direction INPUT))
    (port b (direction INPUT))
    (port cin (direction INPUT))
    (port cout (direction OUTPUT))
    (port s (direction OUTPUT))
   (contents
     (instance GND (viewref netlist (cellref GND (libraryref hdi_primitives))))
     (instance VCC (viewref netlist (cellref VCC (libraryref hdi_primitives))))
     (instance a_IBUF_inst (viewref netlist (cellref IBUF (libraryref
        hdi_primitives))))
     (instance b_IBUF_inst (viewref netlist (cellref IBUF (libraryref
        hdi_primitives))))
     (instance cin_IBUF_inst (viewref netlist (cellref IBUF (libraryref
        hdi_primitives))))
     (instance cout_OBUF_inst (viewref netlist (cellref OBUF (libraryref
        hdi_primitives))))
```

```
(instance cout_OBUF_inst_i_1 (viewref netlist (cellref LUT3 (libraryref
   hdi_primitives)))
  (property INIT (string "8'hE8"))
)
```

#### 8.1.3 placement.rsc

The placement.rsc file within a RSCP stores all of the placement information of a Vivado design. This includes which package pin every Port is mapped to, which Bel each Cell is placed on, and the logical-to-physical pin mappings for each CellPin. If a design has not yet been placed, this file will be empty. Below is an example of a RSCP placement file.

```
LOC a_IBUF_inst R10 IOB33 INBUF_EN LIOB33_SING_X0Y50
PINMAP a_IBUF_inst O:OUT I:PAD
LOC b_IBUF_inst T10 IOB33 INBUF_EN LIOB33_X0Y51
PINMAP b IBUF inst O:OUT I:PAD
LOC cin_IBUF_inst T9 IOB33 INBUF_EN LIOB33_X0Y51
PINMAP cin_IBUF_inst O:OUT I:PAD
LOC cout_OBUF_inst U13 IOB33 OUTBUF LIOB33_X0Y53
PINMAP cout_OBUF_inst O:OUT I:IN
LOC cout_OBUF_inst_i_1 SLICE_X0Y51 SLICEL A6LUT CLBLL_L_X2Y51
PINMAP cout_OBUF_inst_i_1 0:06 I0:A4 I1:A5 I2:A6
LOC s_OBUF_inst T13 IOB33 OUTBUF LIOB33_X0Y53
PINMAP s_OBUF_inst O:OUT I:IN T:TRI
PACKAGE_PIN R10 a
PACKAGE_PIN T10 b
PACKAGE_PIN T9 cin
PACKAGE_PIN U13 cout
PACKAGE PIN T13 s
```

#### 8.1.4 routing.rsc

The routing rsc file within a RSCP stores all of the routing information of a Vivado design. This includes:

- The used Site PIPs within each Site (which specifies the internal routing)
- A list of LUT Bels that are acting as static sources (see subsection 8.3 for more details)
- A list of LUT Bels that are being used as routethroughs (see subsection 8.3 for more details)
- The name of each INTRASITE net
- The name, connecting SitePins, and used TileWires of each INTERSITE net.
- Special routing information for the VCC and GND nets.

If Vivado has not yet been routed, this file will be mostly empty. Below is an example of the contents within a RSCP routing file.

```
SITE_PIPS SLICE_X9Y80 SRUSEDMUX:0 CEUSEDMUX:IN COUTUSED:0 CLKINV:CLK DCY0:DX ...
SITE_PIPS SLICE_X13Y80 PRECYINIT:AX SRUSEDMUX:0 CEUSEDMUX:IN COUTUSED:0 ...
SITE_PIPS SLICE_X15Y80 SRUSEDMUX:0 CEUSEDMUX:IN COUTUSED:0 CLKINV:CLK DCY0:DX ...
SITE_PIPS M17 OUSED:0

...
STATIC_SOURCES SLICE_X2Y106/D6LUT/O6 SLICE_X2Y106/C6LUT/O6 SLICE_X4Y106/D6LUT/O6 ...
```

```
LUT_RTS SLICE_X5Y101/B6LUT/A6/06 SLICE_X2Y100/A5LUT/A4/05 SLICE_X5Y100/C6LUT/A6/06 ...

INTRASITE AddSub[10]
INTERSITE AngStep1[0] SLICE_X2Y82/AX SLICE_X2Y87/AQ SLICE_X2Y84/A1
ROUTE AngStep1[0] CLBLM_R_X3Y87/CLBLM_M_AQ CLBLM_R_X3Y87/CLBLM_LOGIC_OUTS4 ...

VCC INT_L_X2Y107/VCC_WIRE INT_L_X2Y107/VCC_WIRE INT_L_X2Y107/VCC_WIRE ...
START_WIRES INT_L_X2Y107/VCC_WIRE INT_L_X2Y106/VCC_WIRE INT_R_X3Y106/VCC_WIRE ...
GND INT_R_X3Y106/GND_WIRE INT_R_X3Y106/GND_WIRE INT_R_X3Y105/GND_WIRE ...
START_WIRES INT_R_X3Y106/GND_WIRE INT_L_X4Y106/GND_WIRE INT_R_X3Y105/GND_WIRE ...
```

#### 8.1.5 contraints.rsc

The *contraints.rsc* file within a RSCP stores all XDC constraints on a Vivado design. XDC constraints are similar to UCF files for ISE designs. They can be used to set the clock frequency, constrain a top-level port to a specific package pin on the device, and set other physical implementation details. An example of a RSCP constraints file can be seen below.

```
create_clock -period 5.000 -name sysClk -waveform {0.000 2.500} set_property IOSTANDARD LVCMOS18 [get_ports clk] set_property IOSTANDARD LVCMOS18 [get_ports ena] set_property PACKAGE_PIN E15 [get_ports {Yin[12]}] set_property PACKAGE_PIN H17 [get_ports {Xin[14]}] set_property PACKAGE_PIN D18 [get_ports {Xin[7]}]
```

As can be seen, a Vivado constraints file is essentially a list of Tcl commands to execute before generating a bitstream. When importing a RSCP into RapidSmith, each constraint is parsed into a XdcConstraint object and stored in the top-level CellDesign. Each XdcConstraint has two fields, a String command and a String representing all command arguments. Code Sample 12 demonstrates how to manipulate XDC constraints in RapidSmith. Future RS2 plans include creating a more intelligent way of handling XDC constraints.

#### **Code Sample 12:** How to manipulate XDC constraints in RS2

#### 8.1.6 macros.xml

The *macros.xml* file within a RSCP contains template information about macro cells in a Vivado design that (a) were not fully flattened, and (b) do not exist in the default Vivado cell library. Using the information in the XML file, a LibraryCell can be created and added to the CellLibrary so that the design can import correctly. More information about macro cells in RapidSmith can be found in subsubsection 5.2.5.

8.2 Tincr Checkpoints 42

```
<?xml version="1.0" encoding="UTF-8"?>
<root.>
 <macros>
   <macro>
      <type>IOBUF</type>
      <cells>
         <internal>
            <name>IBUF</name>
            <type>IBUF</type>
         </internal>
         <internal>
            <name>OBUFT</name>
            <type>OBUFT</type>
         </internal>
      </cells>
      <pins>
         <pin>
            <name>IO</name>
            <direction>inout</direction>
            <type>MACRO</type>
            <internalConnections>
                <pinname>IBUF/I</pinname>
                <pinname>OBUFT/O</pinname>
            </internalConnections>
         </pin>
         . . .
      </pins>
   </macro>
 </macros>
</root>
```

## 8.2 Tincr Checkpoints

Tincr checkpoints are very similar to RapidSmith checkpoints, but they can be used to import designs back into Vivado after they have been modified in RapidSmith. The main different between TCPs and RSCPs, is that the *placement.rsc*, *routing.rsc*, and *constraints.rsc* files turn into *placement.xdc*, *routing.xdc*, and *constraints.xdc* respectively. Vivado is able to parse the XDC files and apply physical information to the design. On design export, RapidSmith produces a TCP that is compatible with Vivado. The file formats of the three XDC files will not be included in this documentation. If you are interested in their format, the best way to learn is to consult the Vivado user guide and generate a TCP from RapidSmith and view the individual files.

#### **8.3** Additional Information

The reason RSCP and TCP checkpoints are distinguished, is that there are several design implementation aspects in Vivado that aren't explictly represented. In order to accurately represent a design in RapidSmith, they must be included somehow. This section describes the parts of the design that aren't explicitly represented in Vivado, and how RapidSmith handles them. subsection 8.4 describes how to get a handle to the additional information after a checkpoint has been imported.

## 8.3.1 LUT Routethroughs

Besides their use in implementing logic equations, LUT BELs can also act as signal routethroughs. In Vivado, a LUT is marked as a routethrough when its configuration equation (CONFIG.EQN) maps the value of a single input pin directly to the output pin. These LUTs are not explicitly represented in the logical netlist because there is no cell placed on the corresponding BEL. Figure 21 shows two examples of LUTs in Vivado that are configured as routethroughs.



**Figure 21:** Two examples of LUTs being used as routethroughs in the Vivado GUI. The RT on the left uses the A6 input pin, while the RT on the right uses the A4 input pin. The net highlighted in red represents VCC.

When a design is exported from Vivado, all LUTs that are acting as routethroughs are included in the *routing.rsc* file of a RSCP. When the file is parsed, each routethrough is stored in a BelRoutethrough object which contains the physical Bel, the input BelPin, and the output BelPin that uniquely describes the routethrough. It is important to note that if a Bel is being used as a routethrough, no Cell can be placed there. When creating CAD tools, BelRoutethroughs give the user a more complete picture of what Bels are actually available.

As described in subsection 7.3, there is a Connection object in RapidSmith from every input LUT pin to the LUTs output pin. These Bel routethrough connections, are actually just WireConnections with the isRoute-through () method returning true. In order to use a LUT as a routethrough in RapidSmith, you simply have to include the corresponding WireConnection within an INTRASITE RouteTree. When a design is exported from RapidSmith, the routethrough WireConnections are automatically detected and converted to a passthrough LUT Cell. The netlist is rewired appropriately.

## 8.3.2 Static Source LUTs

LUTs can also act as a static power or ground source. This occurs when the configuration equation is set to 1 or 0. An example of a LUT that exhibits this behavior is shown in Figure 22. When a design is exported from Vivado, all LUTs that are acting as static sources are included in the *routing.rsc* file of a RSCP. As the file is being parsed, a list of static sources Bels are recorded and stored in a list data structure. Just like routethroughs, if a Bel is being used as a static source no Cell should be placed on it.



Figure 22: A LUT Bel that has been configured as a GND source. Notice that there are no input pins being driven.

#### 8.3.3 Pseudo CellPins

PseudoCellPins are described in great detail in subsubsection 5.2.3. When a RSCP is loaded, PseudoCellPins are automatically detected, created, and attached to their corresponding Cells.

## 8.4 Importing and Exporting TCPs

After a Vivado design has been converted to a TCP using the ::tincr::write\_rscp command, it can be loaded into RapidSmith. Code Sample 13 demonstrates the basix syntax of design import and export.

## **Code Sample 13:** How to import and export TCP files to and from RS2

```
// Loading a Tincr Checkpoint
TincrCheckpoint tcp = VivadoInterface.loadTcp("pathToCheckpoint.tcp");
CellDesign design = tcp.getDesign();
Device device = tcp.getDevice();
CellLibrary libCells = tcp.getLibCells();

// Insert CAD Tool Here
// Exporting the modified design to a Tincr Checkpoint
VivadoInterface.writeTCP("pathToStore.tcp", design, device, libCells);
```

While a design is being imported into RapidSmith, several useful data structures are built up. If you want to gain access to those data structures, you can pass an additional argument into the VivadoInterface.loadTCP method. This is shown in Code Sample 14.

#### **Code Sample 14:** Importing a TCP with additional information

```
// Loading a Tincr Checkpoint with additional info
TincrCheckpoint tcp = VivadoInterface.loadTcp("PathToCheckpoint.tcp", true);
Collection<BelRoutethrough> belRts = tcp.getRoutethroughObjects();
Collection<Bel> staticSources = tcp.getStaticSourceBels();
Map<BelPin, CellPin> belPinToCellPinMap = tcp.getBelPinToCellPinMap()
```

After a design is exported from RapidSmith, a Tincr Checkpoint (TCP) is produced. To import the TCP back into Vivado, simply open up Vivado in Tcl mode and use the following command:

```
Vivado% tincr::read_tcp myCheckpoint.tcp
```

# 9 Example Programs

A variety of example programs can be found in the edu.byu.edu.rapidSmith.examples package of the RS2 installation. They have been heavily commented to provide a means to learn the RS2 API by example. We believe this approach is better than reading through a block of text while trying to understand the data structures and what they do. There is a *README.txt* file in that directory to provide an overview of each example. The order that they appear in the *README.txt* is also the suggested learning order for beginners. In addition, the subsections below describe one or more built-in RS2 programs which you might find useful.

## 9.1 Sample Vivado Designs

To enable new users of RS2 to quickly start running the example programs, a small set of pre-compiled Vivado designs have been included in the distribution. They are located in the *exampleVivadoDesigns* directory of the repository, and consist of 3 designs:

• add.tcp: synthesized only

• cordic.tcp: synthesized and placed

• count16.tcp: synthesized, placed and routed

There is a sample design to demonstrate each possible tool flow between Vivado and RS2. Equivalent Vivado checkpoints files (.dcp) are also included in the same directory as *add.dcp*, *cordic.dcp*, and *count16.dcp* files. To open these checkpoints in Vivado, you can either (a) double click on the .dcp file in a file explorer, or (b) use the command open\_checkpoint when a Vivado terminal is open. It is suggested that you have the equivalent Vivado designs open when going through the example programs listed below.

If you want to recompile the designs from scratch, the source code for each design has also been included in the same directory. The Tcl script called **compile.tcl** can be used for this purpose. Simply open Vivado in Tcl mode, and type the following commands to re-build one of the example designs.

```
Vivado% cd <path to exampleVivadoDesigns directory>
Vivado% compile_hdl_to_checkpoint_files add
Vivado% close project
```

This will synthesize, place, and route a design and, from that compiled design, generate the .tcp directory and the .dcp file. For example programs that only explore the architecture, opening the device browser in Vivado can also be helpful.

## 9.2 DeviceBrowser

The DeviceBrowser is a GUI program located in the edu.byu.ece.rapidSmith.device.browser package. It lets you browse parts at the tile level, and is useful for becoming more familiar with FPGA architecture. As long as a valid device file exists, then the DeviceBrowser can operate (no design required). A screenshot from the DeviceBrowser can be seen in Figure 2. On the left, the user may choose the desired part by navigating the tree menu and double-clicking on the desired part name. This will load the part in the viewer pane on the right (the first available part is loaded at startup). The status bar in the bottom left displays which part is currently loaded. Also displayed is the name of the current tile which the mouse is over, highlighted by a yellow outline in the viewer pane. The user may navigate inside the viewer pane by using the mouse. By right-clicking and dragging the cursor, the user may pan. By using the scroll-wheel on the mouse, the user may zoom. If a scroll-wheel is unavailable, the user may zoom by clicking inside the viewer pane and pressing the minus(-) key to zoom out or the equals(=) key to zoom in.

The device browser also allows the user to follow the various connections found in the FPGA. By double clicking a wire in the wire list, the application will draw the connection on the tile array (as shown in Figure 23). By hovering the mouse pointer over the connection, the wire becomes red and a tooltip will appear describing the connection made by declaring the source tile and wire followed by an arrow and the destination tile and wire. By clicking on the wire, the application will redraw all the connections that can be made from the currently selected wire. By repeating this action, the user can follow connections and discover how the FPGA interconnect is laid out. Thanks to Chris Lavin for originally creating this application.

9.3 DeviceAnalyzer 46



Figure 23: DeviceBrowser Screen Shot Showing Wire Connections

## 9.3 DeviceAnalyzer

The DeviceAnalyzer is designed as a simple getting started program and demonstrates how to use some of the Device data structures in RS2. This includes how to query for and print tiles in a device, how to use wires and wire connections, and other useful device functions.

## 9.4 ImportExportExample

The ImportExportExample demonstrates how to load a Tincr checkpoint into RapidSmith, and how to export a Rapid-Smith netlist back into a Tincr checkpoint. This is a very important step in passing digital designs back and forth between Vivado and RapidSmith.

## 9.5 DesignAnalyzer

The DesignAnalyzer loads a Tincr checkpoint into RapidSmith, walks the design data structures, and prints what it finds as it goes in a readable format. As such, it provides a nice example of a number of things which would be useful for getting started with RS2 including:

- How to enumerate the Cells in a design, determine and print their placement information, and determine and print their properties.
- How to enumerate the logical nets in a design and print out their source and sink pins.
- How to traverse and print out the physical route for a logical net (if it is routed)

## 9.6 CreateDesignExample

The CreateDesignExample program builds a RapidSmith netlist from scratch (using Cells and CellNets) and then places the design. While this is certainly not recommended for substantial designs, it does demonstrate how to do the following useful tasks in RapidSmith:

- Create new Cells and add them to an existing netlist
- Create new CellNets, and connect them to CellPins
- Modify the properties on Cells
- Place Cells onto a Bels
- Find compatible Bel placement for a given Cell

## 9.7 Other Test Programs

The programs introduced in this section are designed for beginners of RS2. Once you start becoming more comfortable with RapidSmith and its data structures, there are several other more advanced examples. These examples include the **HandRouter**, **AStarRouter**, and **SimulatedAnnealingPlacer** programs. See the README.txt file for more information about each of these examples.

# 10 Installing New Device Files

The device files included with the RapidSmith2 installation (listed in ??) have been well-tested, and are great starting points for new users. If you are a new user, it is *strongly encouraged* that you learn RapidSmith's APIs and data structures by first implementing CAD tools on these devices. However, RapidSmith2 also supports installing new device files for parts not listed in ??. This section contains the following: (1) a description of the required intermediate files that need to be generated from Tincr to support a new device, and (2) the necessary steps to transform the intermediate files into compact device files that can be loaded into RapidSmith2.

#### **10.1 XDLRC**

The Device data structures in RapidSmith are mainly created from XDLRC files. XDLRC files contain a complete physical description of a Xilinx device (including tiles, wires, sites, etc.). For older Xilinx parts (series 7 and below), these files can be created in ISE with the xdl command. For newer parts (UltraScale and above), XDLRC files can be created in Vivado using the Tincr command tincr::write xdlrc. Because XDLRC files can grow to be hundreds of Gigabytes in size, RapidSmith compresses them into much smaller device files (usually in the tens of Megabytes). Appendix A contains a more detailed description of XDLRC syntax for those who are interested.

## 10.2 FamilyInfo

A *familyInfo.xml* file contains useful information that is not included in the XDLRC files for a given family of devices. Vivado currently supports 8 different family types: Artix7, Virtex7, Kintex7, Zynq, Kintex UltraScale, Virtex UltraScale, Kintex UltraScale+, and Virtex UltraScale+. Only one *familyInfo.xml* file is required for each of these families (all devices within a family share the same family info). The remainder of this section describes the important parts of a *familyInfo.xml* file, and how they are represented in RapidSmith2.

• A list of **compatible types** for each site. Site A is said to be compatible with site B if the logical cells placed on site A can *always* be placed on site B as well. For example, as shown in Figure 24, SLICEL sites are compatible with SLICEM sites. The cells placed on the SLICEL in the figure can be moved to the SLICEM and function indentically. SLICEM sites, however, are *not* compatible with SLICEL sites. This is because SLICEM sites support LUT RAM cells, which cannot be placed on SLICEL sites.



Figure 24: The same group of cells placed on a SLICEL site (left) and a SLICEM site (right).

Compatible sites are important to site-level placers in determining where a group of cells can be placed. **NOTE**: In some instances of compatibility, you have to first change the type of the compatible site before placing cells on it. For example, a RAMB36 site is compatible with a RAMBFIFO36 site. However, the site type of the RAMBFIFO36 site **must be changed to a RAMB36** before it is truly compatible.

10.2 FamilyInfo 49

• A list of **routethrough connections** for each LUT BEL (including the input bel-pin and output bel-pin of the routethrough). These routethrough connections are turned into Connection objects in RapidSmith2 that can be used for internal site routing. Sections 7 and 8.3 give more information about LUT routethroughs.

• A list of **alternate types** for each site. Each physical site on the device has an associated default type. Some sites, however, can be configured to be one of many types. An example for an UltraScale BITSLICE\_RX\_TX site is shown in Figure 25. As the figure shows, a BITSLICE\_RX\_TX site can also be configured to be of type BIT-SLICE\_COMPONENT\_RX\_TX, BITSLICE\_RXTX\_RX, or BITSLICE\_RXTX\_TX. RapidSmith2 parses the alternate site information from the XML and applies it to the corresponding Site data structure. This allows users to change site types based on what they need.



**Figure 25:** Vivado GUI showing alternate types for a BITSLICE\_RX\_TX site.

• A list of **site pip corrections**. In the XDLRC files that are parsed into RapidSmith, site pips (or routing muxes) are not distinguished from the functional BELs (such as LUTs and Flip Flops) of a site. The Family Info XML identifies the site pips of a site and marks them as either a "mux" or a "polarity\_selector." RapidSmith2 transforms these routing muxes into individual pips as shown in Figure 26 so that they are no longer represented as BELs. After the decomposition, each pip is a routing resource of the site.



**Figure 26:** RapidSmith2 site pip decomposition.

• A list of **pin direction corrections**. For ISE-generated XDLRC files, all bel-pins are given a direction of either INPUT or OUTPUT. However, there are several bel-pins in Xilinx devices that are actually of direction INOUT (bidirectional). The Family Info file marks INOUT bel-pins so that their direction can be corrected in RapidSmith2.

## 10.3 Creating New Device Files for Supported Families

?? gives a list of currently supported families in RapidSmith2. If the device you want to install is **not** within a supported family, see subsection 10.4 for how to add support for a new family in RapidSmith. Otherwise, a new device file can be added in two easy steps:

1. Open Vivado in Tcl mode, and execute the Tincr command ::tincr::write\_xdlrc. An example command usage is given below for the Artix7 part xc7a100tcsg324-3.

```
[ttown523@CB461-EE09968:] vivado -mode tcl

****** Vivado v2014.2 (64-bit)

**** SW Build 928826 on Thu Jun 5 17:55:10 MDT 2014

**** IP Build 924643 on Fri May 30 09:20:16 MDT 2014

** Copyright 1986-2014 Xilinx, Inc. All Rights Reserved.

Vivado% ::tincr::write_xdlrc -part xc7al00tcsg324-3 -max_processes 4
   -primitive_defs xc7al00tcsg324_full.xdlrc
```

The "-max\_processes" option is used to parallelize the operation so that it will execute faster. If you have problems with the parallel generation however (it can sometimes hang), then setting this option to "1" will prevent the process from hanging. **NOTE**: This Tcl command can take a very long time to run (more than 24 hours for very large devices). It is suggested that you run it on a remote machine so that you can continue work on your regular work machine. Also, be aware that these XDLRC files are massive. 100 GB for the largest XDLRC files is not uncommon. Make sure you have enough space on your hard drive before generating the XDLRC for a device.

Once the XDLRC creation is complete, run the device installer in RapidSmith2 and pass the newly created XDLRC as an argument. An example command line usage is shown below. The device installer can also be run in an IDE.

```
[ttown523@CB461-EE09968:] java -ea -Xmx4096m
edu.byu.ece.rapidSmith.util.Installer --generate file
xc7a100tcsg324_full.xdlrc
```

The device installer parses the verbose XDLRC file, and creates compact RapidSmith device files (Megabytes instead of Gigabytes) that represent a Xilinx device. Notice the two JVM command line arguments used in the command above. The first option ("-ea") enables assertions for the code. It is important to include this flag so that device file errors can be caught during parsing. If you are creating a new device file for a supported family however, there should be no errors during the installation process. The second option ("-Xmx4096m") sets how much memory the JVM can use while running the installer. Since XDLRC files are so large, the memory usage of the installer can grow very quickly. In the command above, the JVM is set to use 4 GB of memory. If the device installer fails with an out of memory exception, you will need to increase the number of this parameter and re-run the installer (you may need up to 32 GB of memory here).

Once the device installer is done executing, the compact devices files are stored in the corresponding family directory of the RapidSmith2 "devices" folder. For example, the device files generated from the example part xc7a100tcsg324-3 are stored in the "artix7" sub-directory. The listing below shows the two device files that are created after the device installer is run (the device files are bolded). The file ending in "\_db.dat" contains the serialized Device data structures for RapidSmith. The file ending in "\_info.dat" contains additional serialized data (such as reverse wire connections) that can be optionally loaded with the device.

```
[ttown523@CB461-EE09968:artix7] ls cellLibrary.xml familyInfo.xml xc7a100tcsg324_db.dat xc7a100tcsg324_info.dat
```

## **10.4** Supporting New Device Families

1.

- 1.) Look in the tincr/cache folder of your Tincr Installation. Then try to find the family folder for you device (i.e. Artix7). If you look inside the family folder and there is no "primitive defs" folder (or the family folder does not exist), then you will need to generate the primitive defs before creating the XDLRC file. To do this, you will need to use the VSRT GUI. The process to create a new set of primitive defs is given in the VSRTUserGuide.pdf of the RS2 repository.
  - 2.) Once the primitive defs are created, put them in the tincr cache folder as described in step (1).
- 5.) Generate a new Family Info. If you are generating a new familyInfo.xml, then that means that most likely you also generated the primitive defs using VSRT. In the VSRT folder you used, there is a file called "addedBels.txt". To generate a new family info file, run the Vivado command ::tincr::create\_xml\_family\_info "familyInfo.xml" family addedBels.txt. This will generate most of the information you need for the family info file. However, there are some compatible sites that are NOT included with this file. For ultrascale, we needed to manually add these two compatible types manually: !!put pictures here with highlighted text!! For your device, you will have to determine what compatible types are missing (SLICEL-¿SLICEM will always be one, and there should be some IOB sites that are compatible, but the rest is up to you to determine). It is best practice to go through each site in the family info file, and make sure there are no corrections that need to be made before creating a new device file in RApidSmith. Once the family info is created, store it in rapidSmithPath/devices/family/familyInfo.xml.

1.

## 11 Bitstreams in RS2

In the original RapidSmith, bitstreams can be parsed, manipulated, and exported for Virtex 4, Virtex 5 and Virtex 6 Xilinx FPGA families. Because of the proprietary nature of Xilinx bitstreams, RapidSmith provided only documented functionality when working with bitstreams (and was limited mainly to manipulation at the frame level including helping to assemble sequences of configuration commands which are interpreted by the FPGA configuration controller circuitry). While this has proven valuable to many researchers, it does not provide the ability to create your own bitstream from scratch because it does not provide the specific meaning of each bit in a bitstream.

If you desire to use RapidSmith's bitstream manipulation features, you should download and work with Rapid-Smith instead of RS2 (the RapidSmith bitstream packages have been removed from RS2). If you do so, note that RapidSmith's bitstream packages have not been tested beyond Virtex 6. The authors would be interested in upgrading RapidSmith's bitstream functionality to device families beyond Virtex 6 if users create it and are willing to contribute it to us for inclusion.

# 12 License

RS2 is released under GPL version 3 with the following license:

BYU RapidSmith Tools

Copyright (c) 2010-2016 Brigham Young University

BYU RapidSmith Tools is free software: you may redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 2 of the License, or (at your option) any later version.

BYU RapidSmith Tools is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

A copy of the GNU General Public License is included with the BYU RapidSmith Tools. It can be found at doc/gpl2.txt. You may also get a copy of the license at http://www.gnu.org/licenses/.

# 13 Included Dependency Projects

RS2 includes the Caucho Technology Hessian implementation which is distributed under the Apache License. A copy of this license is included in the doc directory in the file APACHE2-LICENSE.txt. This license is also available for download at:

http://www.apache.org/licenses/LICENSE-2.0

The source for the Caucho Technology Hessian implementation is available at: http://hessian.caucho.com

RS2 also includes the Qt Jambi project jars for Windows, Linux and Mac OS X. Qt Jambi is distributed under the LGPL GPL3 license and copies of this license and exception are also available in the /doc directory in files LI-CENSE.GPL3.TXT and LICENSE.LGPL.TXT respectively. These licenses can also be downloaded at: http://www.gnu.org/licenses/licenses.html

Source for the Qt Jambi project is available at: http://qtjambi.org/downloads, and https://sourceforge.net/projects/qtjambi/files/

RS2 also includes the JOpt Simple option parser which is released under the open source MIT License which can be found in this directory in the file MIT\_LICENSE.TXT. A copy of this license can also be found at: http://www.opensource.org/licenses/mit-license.php

A copy of the source for JOpt Simple can also be downloaded at: http://jopt-simple.sourceforge.net/download.html

RS2 also includes the JDOM jars. JDOM is available under an Apache-style open source license, with the acknowledgment clause removed. This license is among the least restrictive license available, enabling developers to use JDOM in creating new products without requiring them to release their own products as open source. This is the license model used by the Apache Project, which created the Apache server. The license is available at the top of every source file and in LICENSE.txt in the root of the JDOM distribution.

The user is responsible for providing copies of these licenses and making available the source code of these projects when redistributing these jars.

# A Appendix A: XDLRC Files and Syntax

In general, users of RapidSmith do not need to understand the syntax of XDLRC files to create CAD tools in Rapid-Smith. The syntax is introduced here for those who are interested, and for those who want to modify the XDLRC parser in some way. If these don't apply to you, then go ahead and skip this section. XDLRC files are textual descriptions of Xilinx FPGA devices and can be very verbose (which is why they get so large). This section highlights the main parts of an XDLRC file with accompanying images. As you will see, much of the terminology is the same as subsection 4.1.

#### **Tiles**

Figure 27: Tile syntax in XDLRC files

A tile in an XDLRC file corresponds to the same thing as the Tile described in subsection 4.1. Each tile is declared with a "(tile" directive as shown above followed by the unique row and column index of where the tile fits into the grid of tiles found on the FPGA. The tile declaration also contains a name followed by a type with the final number being the number of primitive sites found within the tile. The tile ends with a "tile\_summary" statement repeating the name and type with some other numbered statistics. Tiles can contain three different sub components, primitive sites, wires, and PIPs.

## **Primitive Sites**

Figure 28: Primitive site syntax in XDLRC files

Primitive site declarations in XDLRC files contain a list of pinwires which describe the name and direction of pins on the primitive site. The first pinwire declared in the example above is the BX input pin which is the internal name to the SLICEL primitive site. Pinwires have an external name as well to differentiate the multiple primitive sites that may be present in the same tile. In this case, BX of SLICE\_X9Y127 has the external name BX\_PINWIRE3. In RapidSmith, only the first pin name (i.e. BX above) is used.

#### Wire

Figure 29: Wire syntax in XDLRC files

A wire as declared in XDLRC is a routing resource that exists in the tile that may have zero or more connections leaving the tile. In the example above, the wire "E2BEGO" connects to 5 neighboring tiles. These connections (denoted by "conn") are described using the unique tile name and wire name of that tile to denote connectivity. The connections are not programmable, but hard wired into the FPGA. Wire portions of the XDLRC file are included in the definition of every tile (even if the same tile type has already been printed), which has a big impact on the final size of XDLRC files. How RapidSmith handles wire duplication is described in subsubsection 4.2.2. The WireConnection objects that are created from this part of the XDLRC are described in subsection 7.1.

#### **PIP**

Figure 30: PIP syntax in XDLRC files

A PIP (programmable interconnect point) is a possible connection that can be made between two wires. In the example above, the PIP is declared in the tile and repeats the tile name for reference. It specifies two wires by name that both exist in that same tile ("BEST\_LOGIC\_OUTS0" and "BYP\_INT\_B5") and declares that the wire "BEST\_LOGIC\_OUTS0" can drive the wire "BYP\_INT\_B5". A collection of these PIPs in a net define how a net is routed and is consistent with saying that those PIPs are turned on. subsection 7.1 describes in detail how PIPs are represented in RapidSmith.

#### **Primitive Definitions**

The Primitive Definition portion of an XDLRC file textually describes the components found within a Primitive Site (a SLICEL for example) and how they are connected. This includes:

- BELs
- Site Pins
- Site Pips (Routing Muxes)
- Configuration options (both site and BEL)
- Site Routethroughs (configurable connections from a site input pin to a site output pin)
- Site Wire Connections

An example of a complete primitive definition file of type BUFHCE can be seen in Figure 31. The sub-site data structures in RapidSmith (Bels, SiteWires, etc.) are built by parsing this section of the XDLRC file. For a more detailed description of primitive definitions, view the VSRT User Guide in the RapidSmith2 repository.

```
(primitive def BUFHCE 3 7
                                                                    (pin CE B input)
       (pin CE CE input)
                                                                    (cfg CE CE B)
                                                                    (conn CEINV OUT ==> BUFHCE CE)
(conn CEINV CE <== CE CE)
       (pin I I input)
       (pin 0 0 output)
       (element CE 1
                                                                    (conn CEINV CE_B <== CE CE)
                (pin CE output)
                (conn CE CE ==> CEINV CE)
                                                            (element BUFHCE 3 # BEL
               (conn CE CE ==> CEINV CE_B)
                                                                    (pin CE input)
                                                                    (pin I input)
        (element I 1
                                                                    (pin O output)
               (pin I output)
                                                                    (conn BUFHCE CE <== CEINV OUT)
                                                                    (conn BUFHCE I <== I I)
                (conn I I ==> BUFHCE I)
                                                                    (conn BUFHCE O ==> 0 0)
        (element 0 1
               (pin O input)
                                                            (element CE TYPE 0
                (conn O O <== BUFHCE O)
                                                                    (cfg SYNC ASYNC)
        (element CEINV 3
                                                            (element INIT_OUT 0
               (pin OUT output)
                                                                   (cfg 0 1)
               (pin CE input)
               (pin CE_B input)
```

Figure 31: Primitive Def sections of XDLRC files

# **B** Appendix

TBD