GitHub - midiareshadi/systolic_array_RTL_implementation

Systolic Array

This repository provides an RTL implementation of a fundamental systolic array, a core computation unit for matrix multiplication. Systolic arrays are known for their ability to perform multiplication and addition operations at maximum concurrency, achieving low computation time and high efficiency.

RTL implementation

I described a baseline systolic array in both Verilog and SystemC. The SystemC code is parametric, allowing the dimensions of PE arrays to be specified as constants. In contrast, the Verilog code is written for a fixed base 2x2 dimension. While the iVerilog (Icarus) compiler supports parametric descriptions for any nxn dimensions, this code fails to compile correctly when using the ISim (ISE) compiler. However, the Verilog code is synthesizable and has been verified using Vivado.To construct a higher-dimensional systolic array in Verilog, we can employ basic 2x2 systolic array instances as building blocks. The directory tree is shown as:

.
|-- Accelergy_Configuration
|   `-- systolic_array_2x2
|-- systemC_RTL
|   |-- input_matrices
|   `-- vcd_file
|-- testbench_File_Gen
|   |-- dense_mat_gen
|   `-- sparse_generator
`-- verilog
    `-- systolic_array_2x2

SystemC Description

In the SystemC description, input matrices are stored in text files where A_TB.txt and B_TB.txt are zero-padded versions of the original A and B matrices, and the original format of A and B matrices is located in the input_matrices folder. The testbench generates a VCD file containing the simulation output, which is stored in the vcd_file folder. This VCD file can be visualized using the GTKWave application or online tools like surfer-project.

Verilog Description

The Verilog description closely mirrors the SystemC description, but with the limitation that Verilog code is not as parametric as SystemC. The basic design for creating higher dimensional systolic arrays is demonstrated with a 2x2 systolic array.

Using Accelergy

The Accelergy_Configuration folder includes the architecture definition and activity file (action count) of a 2x2 systolic array which are defined in YAML file format. We can use Accelergy to estimate the energy consumption. To run the Accelergy configuration, first, the Accelergy needs to be installed using pip install .. To run Accelergy configuration:

cd Accelergy_Configuration/systolic_array_2x2
accelergy -o output/ input/*.yaml input/components/*.yaml -v 1

PE state diagram

The processing element (PE) is the heart and soul of a systolic array. It is responsible for performing the computations that make the systolic array so efficient. Like any digital logic circuit, the PE's control path is based on a finite state machine (FSM). Both the SystemC and Verilog descriptions of the systolic array use an FSM with six states.

graph LR
A((reset)) --> B((calc_0))
B((calc_0)) --result_ld = 1--> C((ld_result))
B((calc_0)) --result_ld = 0--> D((calc_1))
D((calc_1)) --result_ld = 0--> B((calc_0))
D((calc_1)) --result_ld = 1--> C((ld_result))
C((ld_result)) --> E((drain_0))
E((drain_0)) --> F((drain_1))
F((drain_1)) --> E((drain_0))

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
Accelergy_Configuration/systolic_array_2x2		Accelergy_Configuration/systolic_array_2x2
SystemC_RTL		SystemC_RTL
testbench_File_Gen		testbench_File_Gen
verilog		verilog
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accelergy_Configuration/systolic_array_2x2

Accelergy_Configuration/systolic_array_2x2

SystemC_RTL

SystemC_RTL

testbench_File_Gen

testbench_File_Gen

verilog

verilog

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Systolic Array

RTL implementation

SystemC Description

Verilog Description

Using Accelergy

PE state diagram

About

Releases

Packages

Languages

midiareshadi/systolic_array_RTL_implementation

Folders and files

Latest commit

History

Repository files navigation

Systolic Array

RTL implementation

SystemC Description

Verilog Description

Using Accelergy

PE state diagram

About

Resources

Stars

Watchers

Forks

Languages