gpu

Here are 324 public repositories matching this topic...

rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

Updated Jul 11, 2024
Cuda

flashinfer-ai / flashinfer

Star

FlashInfer: Kernel Library for LLM Serving

gpu cuda pytorch tvm llm-inference flash-attention large-large-models

Updated Jul 12, 2024
Cuda

morousg / cvGPUSpeedup

Star

A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!

computer-vision gpu cuda deeplearning speedup

Updated Jul 11, 2024
Cuda

rapidsai / cugraph

Star

cuGraph - RAPIDS Graph Analytics Library

graph graph-algorithms gpu cuda nvidia complex-networks graph-analysis graphml graph-framework rapids

Updated Jul 11, 2024
Cuda

rapidsai / cuvs

Star

cuVS - a library for vector search and clustering on the GPU

machine-learning information-retrieval statistics clustering gpu distance cuda sparse nearest-neighbors similarity-search vector-similarity anns vector-search llm vector-store neighborhood-methods

Updated Jul 12, 2024
Cuda

brucefan1983 / GPUMD

Star

Graphics Processing Units Molecular Dynamics

machine-learning neural-network simulation gpu cuda molecular-dynamics neuroevolution high-performance-computing molecular-dynamics-simulation phonon physics-simulation natural-evolution-strategies heat-transport gpumd machine-learning-potential

Updated Jul 11, 2024
Cuda

pyscf / gpu4pyscf

Star

A plugin to use Nvidia GPU in PySCF package

gpu

Updated Jul 11, 2024
Cuda

GooFit / GooFit

Star

Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP

gpu physics cuda fitting gpu-computing root-cern thrust omp

Updated Jul 8, 2024
Cuda

salvatore-dimartino / mcp-gpu

Star

Maximum clique solver running on GPUs

gpu parallel-computing maximum-clique

Updated Jul 8, 2024
Cuda

NVIDIA-Merlin / HierarchicalKV

Star

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.

gpu cuda recommender-system hashtable key-value-store dynamic-embedding embedding-storage

Updated Jul 7, 2024
Cuda

AntonioBerna / nvidia-devices

Star

Tool for get information about NVIDIA GPUs

cpp gpu cuda nvidia

Updated Jul 4, 2024
Cuda

hrshl212 / Preconditioned-Conjugate-Gradient-Method-in-CUDA

Star

Preconditioned conjugate gradient method with ILU preconditioner implemented in CUDA

gpu cuda conjugate-gradient preconditioning incomplete-lu-factorizations

Updated Jul 3, 2024
Cuda

rob147147 / CUDA-Riesel-Sieve

Star

A CUDA based sieve for numbers of the form k*b^n-1. This project is heavily based on SR2Sieve.

gpu cuda sieve prime-numbers discrete-logarithm bsgs baby-step-giant-step sr2sieve montmul riesel

Updated Jun 29, 2024
Cuda

hrishi-13 / Optimized-Kronecker-Product-and-Matrix-Computation-for-GPUs

Star

Enhanced the computation runtime for (C = A⊗B^T ) and (AB + CD^T ) by effectively leveraging Memory Coalescing and Shared Memory optimization techniques, while working with a 1024x1024 sized matrix.

c programming algorithms gpu cuda high-performance-computing

Updated Jun 27, 2024
Cuda

hrishi-13 / GPU-Accelerated-Facility-Reservation-System

Star

Designed efficient room reservation system for facility rooms with flexible slot booking (1-24 time slots), leveraged GPU parallel processing techniques to concurrently process a large number of user requests.

c programming algorithms cpp gpu cuda high-performance-computing

Updated Jun 27, 2024
Cuda

hrishi-13 / Hierarchical-Graph-Activation-Model-for-NVIDIA-GPUs

Star

Implemented activation rules and a depth-first hierarchy strategy, to efficiently optimize activation point requirements for each node within a large-scale graph with 10 Million vertices and 100 Million edges.

c programming algorithms gpu cuda high-performance-computing

Updated Jun 27, 2024
Cuda

CyprienBosserelle / BG_Flood

Star

Numerical model for simulating shallow water hydrodynamics on the GPU using an Adaptive Mesh Refinment type grid. The model was designed with the goal of simulating inundation (River, Storm surge or tsunami). The model uses a Block Uniform Quadtree approach that runs on the GPU but the adaptive/multi-resolution/AMR is being implemented and not y…

storm gpu surge adaptive flood inundation rain river tsunami