# AWS FPGA Starter's Guide (Rev 2)

#### Introduction

This guide explains how to work with AWS FPGA Hardware/Software Development Kit (HDK/SDK) and deploy your FPGA application on AWS. Developing FPGA application in cloud flows like this.

- 1) Write your custom logic (CL) that goes into FPGA in SystemVerilog.
- 2) RTL Simulate your CL for debugging and validation.
- 3) Create Amazon FPGA image (AFI).
- 4) Write your host program in C/C++.
- 5) Remote into AWS FPGA instance and load AFI into FPGA.
- 6) Compile and run the C/C++ application.



Figure 1. Programming model

# **Creating AWS Account**

First, sign up for an AWS account with your UW email account at <a href="https://aws.amazon.com">https://aws.amazon.com</a>. You will have to enter a valid credit card info, but you will not be charged for anything (Uncle Bezos got you covered this time). Once the sign up is complete, go to <a href="https://aws.amazon.com/education/awseducate">https://aws.amazon.com/education/awseducate</a> to apply for free AWS credit. You want to choose the option to have the free credit applied to the account you just created (Do not pick the "AWS Starter Account" option). Be sure to do this step first, because this process may take up to 2-3 days at most. You can check whether your free credit has been applied to AWS at Billing Dashboard:

https://console.aws.amazon.com/billing/home#/.



# **Writing Custom Logic**

Custom Logic (CL) is the core of your FPGA application that actually does something inside FPGA. HDK provides the shell template (SH) which encapsulates CL and provides I/O interface to the host and peripherals.

### 1) Setting up HDK

1) Log into one of EE Linux machines where Vivado is installed (remotely or in person). Those machines are named 'linux-lab-001.ee.washington.edu' through 'linux-lab-040.ee.washington.edu'.

You need to have EE account. Follow this link, if you don't have one yet. https://www2.ee.washington.edu/computing/fag/new\_account.html

Once you have obtained EE account, follow this link that explains how you can remote into EE machines and setting up X11 forwarding for GUI.

https://www2.ee.washington.edu/computing/fag/xming.html

2) Run the following line. 'settings64.sh' script will allow you call 'vivado' from command line, and appending this line to '.bashrc' will make sure that this script is called whenever you open a bash terminal.

```
$ "source /homes/lab.apps/xilinx_vivado_v2017.1_sdx/SDx/2017.1.op/settings64.sh" >> ~/.bashrc
```

- 3) Type 'bash' to switch terminal.
- 4) Create a directory where you want to store your project and cd in there.
- 5) Clone git repository from aws-fpga. It should create a directory named "aws-fpga".

```
$ git clone https://github.com/aws/aws-fpga.git
```

6) cd into aws-fpga directory, and you will find hdk\_setup.sh file.

Figure 2. inside aws-fpga directory

7) Run source hdk\_setup.sh.

You should be able to run vivado, and validate that you have required licenses. This will open up Vivado License Manager. Once vivado application is launched, go to Help->Manage License. Make sure you have following licenses.

# vivado &



# 2) Hello World (Virtual LED/DIP switch)

Using the virtual LED and DIP switch is the most basic way for the host to communicate with FPGA. The host can pass some configuration or command via virtual DIP, and the FPGA can return the result or raise an alert to the host via virtual LED. It's called 'virtual', because we are remotely programming an FPGA board somewhere inside AWS datacenter, not a local FPGA board.

The example custom logics are located under 'aws-fpga/hdk/cl/examples/'. Those examples are provided by AWS, and you are free to take a look. Instead, we are going to use the 'cl\_led\_dip' custom logic in aws-fpga-starterpack.



Figure 3. aws-fpga-starterpack directory structure

- Clone aws-fpga-starterpack repository in your project directory.
   git clone https://github.com/tommydclung/aws-fpga-starterpack.git
- Copy the 'cl/cl\_led\_dip' folder from aws-fpga-starterpack to 'aws-fpga/hdk/cl/examples/'.

Figure 4. Inside aws-fpga/hdk/cl/examples/ directory

3) Inside the 'cl\_led\_dip' directory, you will find four directories.

```
[dcjung@linux-lab-019 cl_led_dip]$ ls
build design software verif
```

'build' will be used for creating AFI later, and 'verif' will be used for simulating CL. All the custom logic Verilog code should be written in design.

4) cd into 'design', and you will find 'cl\_led\_dip.sv' file. This is the top-level Verilog module for your CL. As mentioned before, CL is encapsulated within shell (SH) to abstract I/O for the developer. The input and output ports of this top-level module are defined in the Verilog header file called 'cl\_ports.vh', which is located in 'aws-fpga/hdk/common/shell\_stable/design/interfaces/'.

```
// Hello World virtual DIP/LED
logic[15:0] vdip_q;
always_ff @ (posedge clk_main_a0 or negedge rst_main_n_sync)
    if (!rst_main_n_sync) begin
        vdip_q <= 16'b0;
    end
    else begin
        vdip_q <= sh_cl_status_vdip[15:0];
    end

assign cl_sh_status_vled = {vdip_q[3:0], vdip_q[7:4], vdip_q[11:8], vdip_q[15:12]};</pre>
```

Figure 5. core functionality of cl\_led\_dip

The ports for virtual DIP and LED are called "sh\_cl\_status\_vdip" and "cl\_sh\_status\_vled" respectively, and they are each 16-bit wide. In the code snippet above, the input "sh\_cl\_status\_vdip" flopped into the register called "vdip\_q" every clock cycle, and the "cl\_sh\_status\_vled" is assigned the reversed hex of "vdip\_q".

### Validating and Debugging CL by RTL simulation

It is a good idea to simulate your custom logic before you build and create the AFI. This provides a chance to validate that your design works as you intended. If there is a bug in your logic, this can be an invaluable tool for debugging. We will write the simulation code in SystemVerilog.

1) Go to 'verif/tests/' directory. There is a file called 'test\_led\_dip.sv'.

```
[dcjung@linux-lab-019 cl_led_dip]$ cd verif/
[dcjung@linux-lab-019 verif]$ ls
scripts sim tests
[dcjung@linux-lab-019 verif]$ cd tests/
[dcjung@linux-lab-019 tests]$ ls
test_led_dip.sv
```

2) Open 'test\_led\_dip.sv', and you will see that the test is setting the DIP switch to "0x0000", and then to "0xbeef". When you read the LED value, it should be "0xfeeb". More details about the test bench can be found here: https://github.com/aws/aws-fpga/blob/master/hdk/docs/RTL Simulating CL Designs.md

```
dule test led dip();
   import tb_type_defines_pkg::*;
   logic [15:0] vdip value;
   logic [15:0] vled value;
   initial begin
       tb.power_up();
       tb.set_virtual_dip switch(.dip(16'h0000));
       vdip_value = tb.get_virtual_dip_switch();
                        rdip = 0x%x", $realtime, vdip value);
       vled_value = tb.get_virtual_led();
       $display("[%t] vled = 0x%x", $realtime, vled_value);
       tb.set virtual dip_switch(.dip(16'hbeef));
       vdip_value = tb.get_virtual_dip_switch();
                              0x%x", $realtime, vdip_value);
       $display('
       #10ns: // wait 10ns
       vled_value = tb.get_virtual_led();
       $display("[%t] vled = 0x%x", $realtime, vled_value);
       if (vled_value == 16'hfeeb)
    $display("[%t] Test passed.", $realtime);
       else
           $display("[%t] Test failed.", $realtime);
       tb.kernel reset();
       tb.power down();
  end
endmodul<mark>e</mark>
```

Figure 6. test\_led\_dip.sv

- 3) In order to run the test, cd into "verif/scripts". There are three files of our interests.
  - a. Makefile we are using this file to build and tun the test.
  - b. top.vivado.f if you add create a new module or include a new IP, make sure to include it in this file.
  - c. waves.tcl during the simulation you can capture waveforms of some of the signals in your custom logic. The first line of 'wave.tcl' is "add\_wave /card/fpga/CL". This statement will tell the simulation to capture all the signal in the top-level module (in this case 'cl\_led\_dip'). Note that it will only signals at the top-level, but not the signals at a lower level. For example, if there was an instance named "MY\_MODULE", you will have to add 'add\_wave /card/fpga/CL/MY\_MODULE' to capture the signals inside "MY\_MODULE" instance. You can also use "add\_wave -recursive /card/fpga/CL" to capture all the signals at lower levels recursively, but it could result in a very large waveform output, so use with discretion.
- 4) While you are in "verif/scripts", run "make TEST=test\_led\_dip". The parameter after "TEST=" should match the name of the test file under "verif/tests" without the file extention (.sv). The test script will compile and run the test program, and it should print the output like this in the end.

Figure 7. test\_led\_dip test result

5) Now, let us view the waveform captured during the simulation. If you go back to "verif" directory, you will notice that "verif/sim" directory is created. The simulation results are stored under this directory, with the name of the test you just ran. cd into "verif/sim/test\_led\_dip". There is a file called "tb.wdb". That is the waveform database file. Create a text file called "open\_waves.tcl" inside this directory, and add the following two lines and save.

```
current_fileset
open_wave_database tb.wdb
```

Now, run vivado -source open\_waves.tcl

6) Under 'Scope' tab, you will see the hierarchy of modules. Click down to "/tb/card/fpga/CL". In 'Objects' panel, you will see the signals defined under each hierarchy. Right-click 'clk\_main\_a0', and click "Add to Wave Window". Also, add 'rst\_main\_n', 'sh\_cl\_status\_vdip[15:0]' and 'cl\_sh\_status\_vled[15:0]'. Drag and click the waveform window to zoom into where the interesting events are happening.

You should see that virtual DIP starts out as '0x0000' and becomes '0xbeef'. Then, one clock cycle after, virtual LED value becomes '0xfeeb'.



Figure 8. Waveform of cl\_led\_dip.

# Creating Amazon FPGA Image

Now that we have validated the functionality of cl\_led\_dip through RTL simulation, it is time to create Amazon FPGA Image (AFI). First, we have to build a tarball from our CL design. Then, we have to run various AWS CLI commands to create AFI.

- 1. Creating tarball from CL design
  - 1) cd into "aws-fpga/hdk/cl/examples/cl\_led\_dip"
  - 2) Set the environment variable. When you type echo \$CL\_DIR, it should print out the path to your cl led dip directory.

# \$ export CL\_DIR=\$(pwd)

- 3) cd into "cl led dip/build/scripts" and run ./aws build dcp from cl.tcl -foreground
- 4) The build process will take a long time even for the simplest application (~2-3 hours), so you can either move onto "Writing C/C++ Host Application" or go out for a walk and grab some coffee.
- 5) If the build was successful, cd into "cl\_led\_dip/build/checkpoints/to\_aws/". You shall find a tarball file generated with timestamp as prefix (e.g. 17\_11\_16-183421.Developer\_CL.tar).

- 2. Configuring AWS CLI / Creating AFI
  - We are going to use AWS Command Line Interface (CLI) to create AFI. If you are working from your own computer you will have to install AWS CLI. <a href="https://aws.amazon.com/cli/">https://aws.amazon.com/cli/</a>
     If you are working with EE dept machines, AWS CLI should have been installed.
  - 2) We have to configure the AWS CLI so that it works with your AWS account credential. In order to do that we need to create access key pair. <a href="https://console.aws.amazon.com/iam/home?#/security\_credential">https://console.aws.amazon.com/iam/home?#/security\_credential</a> Click Access keys-> Create New Access Key. It will give you "Access Key ID" and "Secret Access Key". Save those two strings somewhere secure. Do not share the *secret* access key with others.



3) Now back to the terminal. Run the following command. Enter the access key ID and secret access key. Default region name should be 'us-west-2' (Oregon). Just hit enter for default output format.

# \$ aws configure

```
[dcjung@linuxsrv01 ~]$ aws configure
AWS Access Key ID [None]:
AWS Secret Access Key [None]:
Default region name [None]: us-west-2
Default output format [None]:
[dcjung@linuxsrv01 ~]$
```

- 4) AWS S3 is a cloud-based file storage service. We have to upload our tarball to S3, which then will be picked up by AFI generation service to be processed.
- 5) Create a S3 bucket (analogous to a file directory)

```
$ aws s3 mb s3://my-fpga-cl --region us-west-2
```

6) Upload the tarball to the bucket just created.

```
$ aws s3 cp ./17_11_16-183421.Developer_CL.tar s3://my-fpga-cl/
```

7) Create a S3 bucket for AFI generation log.

```
$ aws s3 mb s3://my-fpga-log --region us-west-2
```

You should be able to verify that bucket has been created and the tarball uploaded as expected here on the AWS management portal: <a href="https://s3.console.aws.amazon.com/s3/home?region=us-west-2#">https://s3.console.aws.amazon.com/s3/home?region=us-west-2#</a>

8) Start AFI generation.

```
$ aws ec2 create-fpga-image --region us-west-2 --input-storage-location
Bucket=my-fpga-cl,Key=17_11_16-183421.Developer_CL.tar --logs-storage-
location Bucket=my-fpga-log,Key=/
```

Calling this command will return json output like this.

You will need to save these IDs somewhere to load AFI to FPGA later.

9) Check if AFI generation has completed.

```
aws ec2 describe-fpga-images --fpga-image-ids <your afi id>
```

Once it's completed FpgaImages. State. Code will eventually become "available". Only then you are able to load this AFI to FPGA. It takes about 20~30 minutes.

# Writing C/C++ Host Application

Developing host application has to be done on AWS virtual machine that runs FPGA Developer AMI (Amazon Machine Images). AWS provides C library for interacting with FPGA. This interaction is done through PCI Express Bus. Shell (SH) that encapsulates our CL handles the communication. This communication is generally divided into two physical function (PF): MgmtPF and AppPF. MgmtPF deals with the management functionality of FPGA: loading/unloading FPGA image, peeking/poking virtual DIP switch and LED. AppPF deals the functionalities that are more specific to your CL, such as sending and receiving stream of data over DMA.

Host application is separate from CL. In the starterpack repo, there is a directory called '/host'. It contains an example host application for cl\_led\_dip. '/host/common' contains C++ header and class files for wrapping C libraries provided in aws-fpga repo (Refer to aws-fpga/sdk/userspace/include'). Make sure to call 'source sdk\_setup.sh' before you compile the example host applications. Makefile inside host/cl\_led\_dip defines how the host application is compiled.



Figure 9. There are many ways to interact with FPGA.

- 1. Setting up FPGA Development AMI VM
  - 1) Go to <a href="https://us-west-2.console.aws.amazon.com/ec2/v2/home?region=us-west-2">https://us-west-2.console.aws.amazon.com/ec2/v2/home?region=us-west-2</a> Click "Launch Instance".



2) Click "AWS Marketplace" on the left tab, and search for "FPGA Developer AMI". Click "Select". Click "Continue" when it shows you a menu.



3) Select "c4.large", and click "Next".



4) Leave the settings as they are, and click "Next".



5) Leave the storage settings as they are, and click "Next"



7) Click "Add Rule". Select type "RDP" for remote desktop. Select source "Anywhere". Click "Review and Launch"



8) Click "Launch"



9) Select "Create a new key pair". Download your key pair (.pem). Click "Launch Instances".



10) You can see that your instance is up and running. Make sure that the instance state is "Stopped", when you are not using. You will get billed for just leaving the machine running idly. You can click "Actions" dropdown bar, and select instance state.



# 2. Connecting to VM

- 1) You can follow the instruction here to connect to VM on Windows using PuTTY here: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/putty.html?icmpid=docs\_ec2\_console\_
- 2) You can also connect using ssh: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/AccessingInstancesLinux.html

Note: your login will be 'centos'

3. Setting up SDK

1) Once you are logged into the VM, we need to set up the SDK. You want to store all your files in /home/centos/src/project\_data, because the files stored there will be persist even after the session ends.

cd ~/src/project\_data

2) Clone the aws-fpga repository.

git clone https://github.com/aws/aws-fpga.git

3) cd into aws-fpga, and run

source sdk\_setup.sh

4) You should see message like this.

4. Cloning aws-fpga-starterpack repository

I have created C++ wrapper around C library in the SDK.

- 1) cd back into ~/src/project\_data
- 2) git clone <a href="https://github.com/tommydcjung/aws-fpga-starterpack.git">https://github.com/tommydcjung/aws-fpga-starterpack.git</a>
- 3) Inside the repository, there is aws-fpga-starterpack/host/common directory where C++ wrapper headers/classes are located.
- 4) You can go into aws-fpga-starterpack/host/cl\_led\_dip directory, where the source code for the host application for cl\_led\_dip is located. It also contains Makefile that builds the app.

Figure 10. app.cpp

Run make. That should compile the program and create a file called app.

6) From here, you can develop your own FPGA host application in C++. You should create your own github repository and start from there.

### Loading AFI in AWS FPGA instance

Finally, we have CL and host application written. We are ready to run our application on the real FPGA instance.

Setting up F1 instance
 Amazon Elastic Compute Cloud (EC2) now provides F1 instances, which are equipped with Xilinx FPGA (VU9P).
 The details of this FPGA can be found under 'doc/' in aws-fpga-starterpack repository. (ds890-ultrascale-overview.pdf).

#### F1 Instance Details

| Instance Type | FPGA Cards | vCPUs | Instance Memory (GiB) | SSD Storage (GB) | Enhanced Networking | EBS Optimized |
|---------------|------------|-------|-----------------------|------------------|---------------------|---------------|
| f1.2xlarge    | 1          | 8     | 122                   | 470              | Yes                 | Yes           |
| f1.16xlarge   | 8          | 64    | 976                   | 4 x 940          | Yes                 | Yes           |

For F1.16xlarge instances, the dedicated PCI-e fabric lets the FPGAs share the same memory space and communicate with each other across the fabric at up to 12 GBps in each direction.

1. The steps for setting up f1 instance is nearly identical to setting up FPGA Development AMI VM as above except we are picking f1.2xlarge as instance type. We will still be using FPGA Development AMI as the machine image.

# Running C/C++ host application

- 1. Setting up aws-fpga and aws-fpga-starterpack repo
  - 1) Once you are logged into f1.2xlarge instance, do the same as done above. Go to ~/src/projectdata. Git clone both aws-fpga, and aws-fpga-starterpack repositories.
  - 2) Run source sdk\_setup.sh from aws-fpga directory.
  - Run make to build the host application from aws-fpga-starterpack/host/cl led dip.
  - 4) Clear the AFI slot 0.

```
$ sudo fpga-clear-local-image -S 0
```

```
[centos@ip-172-31-0-242 cl_led_dip]$ sudo fpga-clear-local-image -S 0
AFI 0 none cleared 1 ok 0 0x071417d3
AFIDEVICE 0 0x1d0f 0x1042 0000:00:1d.0
[centos@ip-172-31-0-242 cl_led_dip]$ |
```

5) Load the AFI you created in the section "<u>Creating Amazon FPGA Image</u>" section above. Use the FPGA image global ID that started with 'agfi-' that you saved somewhere.

```
$ sudo fpga-load-local-image -S 0 -I {{ Your AFGI ID }}
```

6) Run the host application.

```
$ sudo ./app
```

Figure 11. It works.

7) Make sure that your instances are turned off when you are done.

# **Useful Links**

https://github.com/aws/aws-fpga

https://forums.aws.amazon.com/forum.jspa?forumID=243&start=0

https://github.com/tommydcjung/aws-fpga-starterpack

https://github.com/

https://aws.amazon.com/console/

https://aws.amazon.com/education/awseducate/

https://www2.ee.washington.edu/computing/faq/index.html