Instant neural graphics primitives: lightning fast NeRF and more
-
Updated
Apr 18, 2024 - Cuda
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
Instant neural graphics primitives: lightning fast NeRF and more
cuGraph - RAPIDS Graph Analytics Library
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
Fuse multiple depth frames into a TSDF voxel volume.
GPU Accelerated t-SNE for CUDA with Python bindings
PopSift is an implementation of the SIFT algorithm in CUDA.
Graphics Processing Units Molecular Dynamics
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
SDK for GPU accelerated genome assembly and analysis
FlashInfer: Kernel Library for LLM Serving
CUDA Kernel Benchmarking Library
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
Zero-knowledge template library
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
cuVS - a library for vector search and clustering on the GPU
CUDA Matrix Factorization Library with Alternating Least Square (ALS)
Created by Nvidia
Released June 23, 2007