Use this repository to parallelize operations on the GPU by using Elementwise Kernel functions from the CuPy API. Definitions of Elementwise Kernels follow a distinct template with four essential parameters and their goal is to provide elementwise operations that target the NVIDIA GPU without the need of CUDA code. The benefit of Elementwise Kernel functions will be shown through the comparison of serial Python, Elementwise Kernel functions, and CuPy’s Raw Kernels performing the same functionality. Raw Kernels are a user-defined kernel using raw CUDA source. Performance analysis can be done using NVIDIA’s Nsight Systems system-wide profiler.
-
Notifications
You must be signed in to change notification settings - Fork 2
z-ryan1/gtc_spring_2021
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
GPU Acceleration in Python Using Elementwise Kernels
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published