A General-purpose Parallel and Heterogeneous Task Programming System
-
Updated
Jun 10, 2024 - C++
A General-purpose Parallel and Heterogeneous Task Programming System
CUDA C++ Core Libraries
Thin, unified, C++-flavored wrappers for the CUDA APIs
This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010
TinyChatEngine: On-Device LLM Inference Library
An implementation of HIP that works on CPUs, across OSes.
simple GPU ransac fitting of multiple lines on 2d/3d point cloud
μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updating.
YOLOv9 Tensorrt deployment acceleration,provide two implementation methods: C++and Python🔥🔥🔥
Viewshed Analysis leveraging general-purpose computing on graphics processing units using CUDA .
Face Transformation app and library using python/c++/cuda
A parallel and GPU-accelerated Code for Real-Space All-Electron Linear-Scaling Density Functional Theory
📀NVIDIA DeepStream integrated GStreamer Plugin. It can blur objects with cuda cores on Jetson boards. Fast and smooth since everything is done on NVMM.🏎
High-Performance Memory Optimal CNN
📀NVIDIA DeepStream integrated GStreamer Plugin. Mask objects with cuda cores on Jetson boards. Fast and smooth since everything is done on NVMM.🏎
Cloth sim with Lighthouse 2 framework for real-time ray tracing
CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.
CUDA Gemm Convolution implementation
Learning CUDA Programming
Reconstruct mesh from point cloud data generated by 3D scanner
Add a description, image, and links to the cuda-programming topic page so that developers can more easily learn about it.
To associate your repository with the cuda-programming topic, visit your repo's landing page and select "manage topics."