gpu
Here are 341 public repositories matching this topic...
GPU Accelerated t-SNE for CUDA with Python bindings
-
Updated
Oct 2, 2024 - Cuda
cuGraph - RAPIDS Graph Analytics Library
-
Updated
Nov 10, 2024 - Cuda
FlashInfer: Kernel Library for LLM Serving
-
Updated
Nov 10, 2024 - Cuda
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
-
Updated
Nov 8, 2024 - Cuda
CUDA Kernel Benchmarking Library
-
Updated
Oct 25, 2024 - Cuda
Graphics Processing Units Molecular Dynamics
-
Updated
Nov 8, 2024 - Cuda
PopSift is an implementation of the SIFT algorithm in CUDA.
-
Updated
Aug 15, 2024 - Cuda
A simple GPU hash table implemented in CUDA using lock free techniques
-
Updated
Feb 7, 2024 - Cuda
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
-
Updated
Sep 8, 2024 - Cuda
SDK for GPU accelerated genome assembly and analysis
-
Updated
May 3, 2024 - Cuda
GPU-accelerated triangle mesh processing
-
Updated
Oct 29, 2024 - Cuda
my own implementation of the WCSPH, DFSPH and PBD fluid solvers using CUDA and C++
-
Updated
Jul 11, 2019 - Cuda
cuVS - a library for vector search and clustering on the GPU
-
Updated
Nov 10, 2024 - Cuda
CUDA Matrix Factorization Library with Alternating Least Square (ALS)
-
Updated
Aug 14, 2018 - Cuda
Improve this page
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."