Lab 3

FPGAs as Accelerators

# Introduction

\*\*LSAL Explanation

\*\*File structure

# Software (x86 & Arm)

\*\*workflow

## Unoptimized

## Code Profiling

## Optimized

\*\*valgrind

## Comparison

# Hardware (FPGA)

*During* the last part of the lab we use ***Vitis HLS*** to implement the *LSAL algorithm* on *FPGA* hardware.

In order to have the optimum outcome we defined a *streamlined* ***workflow***:

* *First* our team tries different ***optimization methods*** on *Visual Studio Code IDE*.
* *Then* *functionality* of the *C code* is tested using *Vitis HLS’s* ***C Simulation***.
* *At this point synthesis* and ***Synthesis Design Report*** of *Vitis HLS* is used to analyze the *resources in-use and the time estimations.*
* *Next* we use the provided *makefile* to ***emulate*** the *kernel design* using ***QEMU***
* *Last*ly we test the design ***natively*** on the FPGA and *extract* the ***timing*** of *our implementation*

It is important to note that both *C Simulation*, *which tests functionality of the code before it gets synthesised*, and *software emulation using QEMU* are really important as a simple mistake can be detected and fixed long before the time it takes to have the kernel implemented into a bitstream.

Also *Synthesis Design Report* is a great tool to have in order to make sure the HLS optimizations you use have been implemented in the correct way. *For example* an easy mistake to make is placing a *#pragma* above a loop instead of inside it, which could result in vastly different implementations.

## Algorithmic Optimizations

## HLS Optimizations

## Comparison

# Results

## FPGA vs CPU