GPGPU-based SVD solver for large dense matrices.
-
Updated
Jul 6, 2021 - C++
GPGPU-based SVD solver for large dense matrices.
Parallel Kakuro Solver with both OpenMP and CUDA
High-performance CUDA C++ implementation of Graph Convolutional Networks
ThreadPoolManager is a C++ project that implements an efficient multi-threading system using a thread pool for generic functions of the same type and different tasks. It includes task management, synchronization mechanisms, and thread-safe logging to demonstrate concurrent task execution.
A modern C++ reimplementation of Darknet with CUDA support for efficient neural network inference
CaTS: Calorimeter and Tracker Simulation is a flexible and extend-able framework for the simulation of various detector systems. CaTS replaces G4OpticksTest and serves as an example that demonstrates how to use opticks from within Geant4 for the creation and propagation of optical photons.
Solving the N-Queens problem with OpenMP- and CUDA-implemented approaches (Edinburgh Napier University, Concurrent and Parallel Systems module coursework 2)
Repositório do trabalho prático no âmbito da UC de Computação Paralela (CP) - Mestrado em Engenharia Informática (MEI/MIEI) - Universidade do Minho (UMinho)
Le projet consiste en une simulation de foule sur une grille, avec des versions parallélisées sur carte graphique. L'objectif est de modéliser le mouvement des individus dans un environnement en utilisant des paramètres tels que la dimension de la grille, le nombre d'individus et exporte de résultat de chaque frame dans unfichier bin pour analyse.
Deepstream/Gstreamer custom element to access the buffer in gpu memory and map it to GpuMat. Purpose of the element is to use it for preprocessing where it has been written using basic cuda programming.
Introduction to parallel scientific computing
Parallel LiDAR Point Cloud Preprocessing for Autonomous Driving Applications
fast parallel visualization of julia sets with CUDA and OpenMP
A Comprehensive Implementation and Analysis of Heat Simulation by MPI, thread, OpenMP and CUDA
Code to solve the Quadratic Assignment Problem running the Differential Evolution algorithm, using CUDA acceleration.
A simple ray-tracing program implemented with CUDA.
Simple C++ implementation of a sparsely connected multi-layer neural network using OpenMP and CUDA for parallelization.
Add a description, image, and links to the cuda-programming topic page so that developers can more easily learn about it.
To associate your repository with the cuda-programming topic, visit your repo's landing page and select "manage topics."