CUDA Core Compute Libraries
-
Updated
Jul 11, 2024 - C++
CUDA Core Compute Libraries
This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010
Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
🚀 TensorRT-YOLO: Supports YOLOv3, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv9, YOLOv10, and PP-YOLOE using TensorRT acceleration with EfficientNMS, CUDA Kernels and CUDA Graphs!
Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Quantum-inspired evolutionary algorithms for Optimization problems
Some CUDA design patterns and a bit of template magic for CUDA
Parallel implementation of NW algorithms with NVIDIA GPU and CUDA C++
A CUDA project that implements optimizations of neural network operations on the GPU.
Surface triangle mesh smoothing w.r.t. vertex normals to preserve surface curvature. Runs parallel on OpenMP or CUDA if available. Vertex indices are reordered s.t. they are localized into patches to improve CUDA blocking into shared memory & spatial cache locality.
CUDA Gemm Convolution implementation
Simple CUDA Kernel for applying Sobel Filter on the image.
sharpen images using cuda and c++
Explore performance implications of various matrix multiplication approaches using GPU/CUDA compared to CPU side processing
An modern formatting library with CUDA support (fork of fmt)
Solving the N-Queens problem with OpenMP- and CUDA-implemented approaches (Edinburgh Napier University, Concurrent and Parallel Systems module coursework 2)
Hand-crafted Cuda Canny Edge Detector on top of your webcam feed, using OpenCV, modern CMake and Conan third parties
Optimization of Attention layers for efficient inferencing on the CPU and GPU. It covers optimizations for AVX and CUDA also efficient memory processing techniques.
Add a description, image, and links to the cuda-kernels topic page so that developers can more easily learn about it.
To associate your repository with the cuda-kernels topic, visit your repo's landing page and select "manage topics."