CUDA C++ Core Libraries
-
Updated
Jun 13, 2024 - C++
CUDA C++ Core Libraries
μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updating.
A parallel and GPU-accelerated Code for Real-Space All-Electron Linear-Scaling Density Functional Theory
A General-purpose Parallel and Heterogeneous Task Programming System
Thin, unified, C++-flavored wrappers for the CUDA APIs
ThreadPoolManager is a C++ project that implements an efficient multi-threading system using a thread pool for generic functions of the same type and different tasks. It includes task management, synchronization mechanisms, and thread-safe logging to demonstrate concurrent task execution.
TinyChatEngine: On-Device LLM Inference Library
fast parallel visualization of julia sets with CUDA and OpenMP
My research, playground, techniques with Parallel Programming
Eikonal CUDA implementation for the Advanced Methods for Scientific Computing (AMSC) Course @POLIMI
A simple ray-tracing program implemented with CUDA.
C++ framework for deep neural networks
Parallel matrix inversion using Gauss-Jordan method
Parallel LiDAR Point Cloud Preprocessing for Autonomous Driving Applications
Repositório do trabalho prático no âmbito da UC de Computação Paralela (CP) - Mestrado em Engenharia Informática (MEI/MIEI) - Universidade do Minho (UMinho)
An implementation of HIP that works on CPUs, across OSes.
YOLOv9 Tensorrt deployment acceleration,provide two implementation methods: C++and Python🔥🔥🔥
A modern C++ reimplementation of Darknet with CUDA support for efficient neural network inference
Add a description, image, and links to the cuda-programming topic page so that developers can more easily learn about it.
To associate your repository with the cuda-programming topic, visit your repo's landing page and select "manage topics."