AutoWS: Automate Weights Streaming in Layer-wise Pipelined DNN Accelerators

News: Our paper is accepted by DATE2024

Introduction

This work is an extension to the fpgaConvNet toolflow. We introduce a novel memory management methodology for the layer-wise pipelining architecture, that exploits both on-chip and off-chip memory for weights storage.

Set up environment

conda is required before starts, and then set up the enviornment named fpgaconvnet-autows using the following script.

source setup.sh

Generate hardware configuration and performance estimation

A python script is available to perform Design Space Exploration (DSE). It takes ~10 mins to finish, and afterwards, the report.json and config.json shall be found at ./output/resnet18/w4a5_bfp_zcu102

python example.py --arch resnet18 --device zcu102 --quantization w4a5 --output_path output/resnet18

Interpret predicted results

The report.json provides estimations for performance and resource utilization. For example, the accelerator latency estimation can be found as ["network"]["performance"][latency (s)].

The config.json provides the configuration of the pipeline, and the file will be used to generate the hardware. For the proposed weight streaming methodology, there is a field called stream_weights in each convolutional layer, which specifies the depth of weights that are evicted to off-chip.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
output/resnet18/w4a5_bfp_zcu102		output/resnet18/w4a5_bfp_zcu102
README.md		README.md
arch_compare.png		arch_compare.png
example.py		example.py
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoWS: Automate Weights Streaming in Layer-wise Pipelined DNN Accelerators

Introduction

Set up environment

Generate hardware configuration and performance estimation

Interpret predicted results

About

Releases

Packages

Languages

Yu-Zhewen/AutoWS

Folders and files

Latest commit

History

Repository files navigation

AutoWS: Automate Weights Streaming in Layer-wise Pipelined DNN Accelerators

Introduction

Set up environment

Generate hardware configuration and performance estimation

Interpret predicted results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages