🌟 Vertex Centric approach for building GNN/TGNNs
-
Updated
May 27, 2024 - Python
🌟 Vertex Centric approach for building GNN/TGNNs
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
bilibili视频【CUDA 12.1 并行编程入门(Python语言版)】配套代码
Scripts to manage rocprof tracing of multi-process, multi-node program runs.
Fast deterministic all-Python Lennard-Jones particle simulator that utilizes Numba for GPU-accelerated computation.
A Taichi component for automatically compiling and launching compute graph.
simple ray tracer implemented in Python, capable of rendering 3D scenes with basic shapes, materials, and lighting.
Fundamentals of heterogeneous parallel programming with CUDA C/C++ at the beginner level.
A Bifrost plug-in for the Tensor-Core Correlator.
vgg16 inference implementation using tensorflow, numpy and pycuda
Boilerplate for GPU-Accelerated TensorFlow and PyTorch code on M1 Macbook
Implementation of a Transformer, but completely in Triton
PRACE Summer of HPC 2020, Performance of Parallel Python Programs on New HPC Architectures
Introduction to PyCuda GPU programming.
Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing
Lesson material for the HIP101 workshop on porting CUDA codes to HIP
A helper package to easily time Numba CUDA GPU events ⌛
pyCUDA implementation of forward propagation for Convolutional Neural Networks
CUDA accelerated raytracer using PyCUDA in Python
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Add a description, image, and links to the gpu-programming topic page so that developers can more easily learn about it.
To associate your repository with the gpu-programming topic, visit your repo's landing page and select "manage topics."