High performance computing with GPU
-
Updated
Jan 31, 2016 - Cuda
High performance computing with GPU
Code for Sparse Matrix and Vector multiplication. Parallelised using CUDA and MPI
This project is a part of my thesis focusing on researching and applying the general-purpose graphics processing unit (GPGPU) in high performance computing. In this project, I applied GPU Computing and the parallel programming model CUDA to solve the diffusion equation.
The repo contains program for matrix multiplication using CUDA
Level 3 matrix multiplication using both cublas and mkl.
A simple and understandable CUDA kernel for batch-matmul operation
Evaluación Taller 4: CUDA
Classical and Strassen's Matrix Mutiplication in CUDA and OpenMP
Machine problems
Lab exercise of Parallel Processing course in NTUA regarding CUDA programming
Inline PTX Assembly in CUDA example
CUDA kernel functions
Add a description, image, and links to the matrix-multiplication topic page so that developers can more easily learn about it.
To associate your repository with the matrix-multiplication topic, visit your repo's landing page and select "manage topics."