Skip to content

GPU Acceleration in Python Using Elementwise Kernels

Notifications You must be signed in to change notification settings

z-ryan1/gtc_spring_2021

Repository files navigation

GTC Spring 2021

Use this repository to parallelize operations on the GPU by using Elementwise Kernel functions from the CuPy API. Definitions of Elementwise Kernels follow a distinct template with four essential parameters and their goal is to provide elementwise operations that target the NVIDIA GPU without the need of CUDA code. The benefit of Elementwise Kernel functions will be shown through the comparison of serial Python, Elementwise Kernel functions, and CuPy’s Raw Kernels performing the same functionality. Raw Kernels are a user-defined kernel using raw CUDA source. Performance analysis can be done using NVIDIA’s Nsight Systems system-wide profiler.

Results on Quadro RTX 6000

Results

About

GPU Acceleration in Python Using Elementwise Kernels

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published