SPARTA

SPARse acceleration on Tensor Architecture

The project aims to investigate new data structures and compression algorithms for exploiting new architecture capabilities, specifically designed for deep learning, to accelerate sparse and irregular applications, such as graph analytics and arbitrary sparse DNN and GCN. SPARTA also looks at productivity and performance portability across different AI accelerators by providing an abstraction layer.

The repository contains stable code for reordering and compressing sparse matrices into dense block data-structures. The reordering algorithm matches rows (or columns) with similar patterns and builds dense blocks. The similarity of patterns is first determined with a hash function, and then refined with a tunable algorithm, which matches patterns with high cosine similarity.

The repository also contains code for sparse-dense matrix-matrix multiplication that exploits the dense block data-structure.

Input sparse matrices are stored in Compressed Sparse Row (CSR) or Compressed sparse columns (CSC) format. A variant of the variable Block Compressed Sparse Rows (or Columns) is used to store block-sparse matrices.

SPARTA requires CUDA >=10.0

CUDA install

Donwload the cuda toolkit (SPARTA supports >= CUDA 10.0) and follow the instructions: https://developer.nvidia.com/cuda-downloads

PRELIMINARY RESULTS

We have compared our routine with cusparse_spmm and cublas_gemm, the two main CUDA routines for sparse and dense matrix multiplication.

CUSPARSE COMPARISON Preliminary results show that our routine is faster than cusparse_spmm when the density inside blocks is greater then around 2% (in this case, this corresponds to a total density of 0.2%)

CUBLAS COMPARISON Preliminary results show that our routine is faster than cublas_gemm when less than the 20% of blocks are nonzero. (both cublas and our routine treat nonzero blocks as dense, so changing the density inside blocks does not affect this result)

PERFORMANCE LANDSCAPE The image below shows the fastest algorithm for each data point when both the density of nonzero blocks and the matrix density vary. For matrices in the green zone, SPARTA is the fastest choice.

STRUCTURE

The files have the following structure

SPARTA

include
obj
programs
src
test

each folder contains

general: files needed by all versions
cuda: files needed by the cuda version
mkl: files needed by the mkl version

RUNNING A TEST

use make to create a test executable of the cuda test. The executable will be placed in programs/cuda. You can run it with different command line arguments to test different features.
use 'source ./scripts/synthetic.sh' from the main folder to run and save some experiments.

Options for the cuda_test:

-i: select input example
- 1: Random CSR
- 3: Matrix Market (MTX) file
- 4: Random Variable Block matrix
-a: algorithm selection
- -1: all
- 1: cublas gemm
- 2: VBSmm
- 3: VBSmm with no zeros
- 4: VBSmm with asymmetric hash-angle reordering
- 5: cusparse spmm
-b: density of blocks (% of nonzero blocks) (only for i = 4)
-f: source file (only for i = 2, 3)
-m: first matrix rows
-n: second matrix columns
-k: first matrix columns
-p: size of VBS blocks
-q: density of entries, in-block density (% of nonzero entries. if i = 4, % of nonzeros inside each nonzero block)
-r: number of experiment repetitions
-s: scramble input matrix.
- 0: no scramble.
- 1: scramble rows
-S: random seed;
-v: verbose level; ( -1: repeatead experiment format)
-w: warmup repetitions

RoadMAP

Supporting Nvidia architecture (DONE!)
New algorithm for Matrix reordering and compression to support non-coherent blocks (DONE!)
New algorithm for sparse-sparse multiplication
Bring back support for MKL
Supporting other AI accelerators

Stay Tuned!

Name		Name	Last commit message	Last commit date
Latest commit History 1,758 Commits
Pyhton_scripts		Pyhton_scripts
data		data
images		images
include		include
results		results
script		script
src		src
test		test
README.md		README.md
makefile		makefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPARTA

PRELIMINARY RESULTS

STRUCTURE

RUNNING A TEST

RoadMAP

About

Releases

Packages

Languages

HicrestLaboratory/Matrix-Blocking-SPARTA

Folders and files

Latest commit

History

Repository files navigation

SPARTA

PRELIMINARY RESULTS

STRUCTURE

RUNNING A TEST

RoadMAP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages