`Accelerate Interpretable Deep Neural Networks`

This projects attempts to accelerate the IAIA-BL code (https://github.com/Ruchit-rt/interpretable_neural_networks) using SYCL kernels on intel devcloud FPGA nodes. Results have been obtained via emulation due to a compilation bug on devcloud.

Understanding the Codebase

The scripts directory shows examples of how the code can be compiled, emulated etc. The main source code can be found under the src directory. The file to look at should be src/model_unoptimised.cpp to understand how the kernels implement the different layers.

The test directory contains numpy arrays under 3 classes used for testing the data.

OpenCV dependency needs to be built for running the code.

Setting Optional Flags

Variable	Description
`USER_HARDWARE_FLAGS`	This semicolon-separated list of flags applies only to flows that generate FPGA hardware (i.e. report, simulation, hardware). You can specify flags such as `-Xsclock` or `-Xshyper-optimized-handshaking=off`
`USER_FLAGS`	This semicolon-separated list of flags applies to all flows, including emulation. You can specify flags such as `-v` or define macros such as `-DYOUR_OWN_MACRO=3`
`USER_INCLUDE_PATHS`	This semicolon-separated list of include paths applies to all flows, including emulation. Specify include paths relative to the `CMakeLists.txt` file, or using absolute paths in the filesystem.

Building Options

This design uses CMake to generate a build script for GNU/make.

Configure the build system for the Stratix10 device family, which is the default.
```
mkdir build
cd build
cmake ..
```

Compile the design through the generated Makefile.

Target	Expected Time	Output	Description
`make fpga_emu`	Seconds	x86-64 binary	Compiles the FPGA device code to the CPU. Use the Intel® FPGA Emulation Platform for OpenCL™ software to verify your SYCL code’s functional correctness.
`make report`	Minutes	RTL + FPGA reports	Compiles the FPGA device code to RTL and generates an optimization report that describes the structures generated on the FPGA, identifies performance bottlenecks, and estimates resource utilization. This report will include the interfaces defined in your selected Board Support Package.
`make fpga_sim`	Minutes	RTL + FPGA reports + x86-64 binary	Compiles the FPGA device code to RTL and generates a simulation testbench. Use the Questa*-Intel® FPGA Edition simulator to verify your design.
`make fpga`	Multiple Hours	Quartus Place & Route (Full accelerator) + FPGA reports + x86-64 host binary	Compiles the FPGA device code to RTL and compiles the generated RTL using Intel® Quartus® Prime. If you specified a BSP with `FPGA_DEVICE`, this will generate an FPGA image that you can run on the corresponding accelerator board.
`make fpga_ip_export`	Minutes	RTL + FPGA reports	Compiles the FPGA device code to RTL that may be exported to Intel® Quartus Prime software

Run the `fpga_template` Executable

Run the sample on the FPGA emulator (the kernel executes on the CPU).
```
./fpga_template.fpga_emu
```

Run the sample on the FPGA simulator device.

CL_CONTEXT_MPSIM_DEVICE_INTELFPGA=1 ./fpga_template.fpga_sim

Alternatively, run the sample on the FPGA device (only if you ran cmake with -DFPGA_DEVICE=<board-support-package>:<board-variant>).
```
./fpga_template.fpga
```

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.vscode		.vscode
data		data
dependencies		dependencies
matrix_mul		matrix_mul
opencv		opencv
scripts		scripts
src		src
test		test
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
hardware.sh.e2347073		hardware.sh.e2347073
hardware.sh.e2347595		hardware.sh.e2347595
hardware.sh.e2347604		hardware.sh.e2347604
hardware.sh.e2347882		hardware.sh.e2347882
hardware.sh.e2348637		hardware.sh.e2348637
hardware.sh.o2347073		hardware.sh.o2347073
hardware.sh.o2347595		hardware.sh.o2347595
hardware.sh.o2347604		hardware.sh.o2347604
hardware.sh.o2347882		hardware.sh.o2347882
hardware.sh.o2348637		hardware.sh.o2348637
model_encoding.ipynb		model_encoding.ipynb
temp.txt		temp.txt

Ruchit-rt/fgpa_hardware_accelerateNN

Folders and files

Latest commit

History

Repository files navigation

Accelerate Interpretable Deep Neural Networks

Understanding the Codebase

Setting Optional Flags

Building Options

Run the fpga_template Executable

About

Resources

Stars

Watchers

Forks

Languages

`Accelerate Interpretable Deep Neural Networks`

Run the `fpga_template` Executable