NVIDIA Corporation

All

582 repositories

Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•61•341•260•173•Updated Jul 5, 2025Jul 5, 2025
cuCollections
Public
datastructures cpp gpu cuda hashmap cpp17 hashset hashtable
C++
•
Apache License 2.0
•96•546•60•23•Updated Jul 5, 2025Jul 5, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•258•738•376•75•Updated Jul 5, 2025Jul 5, 2025
Megatron-LM
Public
Ongoing research training transformer models at scale
transformers model-para large-language-models
Python
•
Other
•2.9k•13k•325•205•Updated Jul 5, 2025Jul 5, 2025
NeMo-Skills
Public
A project to improve skills of large language models
Python
•
Apache License 2.0
•78•448•14•6•Updated Jul 5, 2025Jul 5, 2025
NeMo
Public
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models
Python
•
Apache License 2.0
•3k•15k•50•102•Updated Jul 5, 2025Jul 5, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•231•1.7k•993•133•Updated Jul 5, 2025Jul 5, 2025
bionemo-framework
Public
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
machine-learning gpu pytorch drug-discovery
Jupyter Notebook
•
Other
•74•458•61•66•Updated Jul 4, 2025Jul 4, 2025
nvidia-resiliency-ext
Public
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
Python
•
Other
•26•184•1•14•Updated Jul 4, 2025Jul 4, 2025
TensorRT-LLM
Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
C++
•
Apache License 2.0
•1.6k•11k•651•320•Updated Jul 4, 2025Jul 4, 2025
aistore
Public
AIStore: scalable storage for AI applications
kubernetes sds erasure-coding object-storage software-defined multiple-backends batch-jobs distributed-shuffle linear-scalability etl-offload
Go
•
MIT License
•213•1.5k•1•1•Updated Jul 4, 2025Jul 4, 2025
torch-harmonics
Public
Differentiable signal processing on the sphere for PyTorch
machine-learning signal-processing sphere pytorch
Jupyter Notebook
•
Other
•48•475•4•4•Updated Jul 4, 2025Jul 4, 2025
context-aware-rag
Public
Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.
Python
•
Apache License 2.0
•2•19•0•0•Updated Jul 4, 2025Jul 4, 2025
kvpress
Public
LLM KV cache compression made easy
python transformers inference pytorch kv-cache large-language-models llm long-context kv-cache-compression
Python
•
Apache License 2.0
•45•526•6•2•Updated Jul 4, 2025Jul 4, 2025
mlperf-common
Public
NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions
Python
•
Apache License 2.0
•14•27•0•1•Updated Jul 4, 2025Jul 4, 2025
JAX-Toolbox
Public
JAX-Toolbox
Python
•
Apache License 2.0
•61•320•79•39•Updated Jul 4, 2025Jul 4, 2025
Megatron-Energon
Public
Megatron's multi-modal data loader
Python
•
Other
•25•215•14•5•Updated Jul 4, 2025Jul 4, 2025
NVFlare
Public
NVIDIA Federated Learning Application Runtime Environment
python decentralized pet privacy-protection federated-learning federated-analytics federated-computing
Python
•
Apache License 2.0
•200•755•11•10•Updated Jul 4, 2025Jul 4, 2025
cuEquivariance
Public
cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks.
Python
•17•250•5•3•Updated Jul 4, 2025Jul 4, 2025
NV-Kernels
Public
Ubuntu kernels which are optimized for NVIDIA server systems
C
•
Other
•38•47•0•2•Updated Jul 4, 2025Jul 4, 2025
grove
Public
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
Go
•
Apache License 2.0
•6•11•5•1•Updated Jul 4, 2025Jul 4, 2025
nvidia-container-toolkit
Public
Build and run containers leveraging NVIDIA GPUs
Go
•
Apache License 2.0
•367•3.4k•383•19•Updated Jul 4, 2025Jul 4, 2025
NeMo-Guardrails
Public
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Python
•
Other
•497•4.8k•116•31•Updated Jul 4, 2025Jul 4, 2025
cloudai
Public
CloudAI Benchmark Framework
Python
•
Apache License 2.0
•30•68•0•13•Updated Jul 4, 2025Jul 4, 2025
recsys-examples
Public
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
pytorch recommender-system recommenders generative-recommenders
Python
•
Other
•17•68•20•8•Updated Jul 4, 2025Jul 4, 2025
k8s-dra-driver-gpu
Public
NVIDIA DRA Driver for GPUs
Go
•
Apache License 2.0
•78•387•47•22•Updated Jul 4, 2025Jul 4, 2025
cuopt
Public
NVIDIA cuOpt is an open-source GPU-accelerated optimization engine delivering near real-time solutions for complex decision-making challenges.
Cuda
•
Apache License 2.0
•37•255•48•18•Updated Jul 4, 2025Jul 4, 2025
warp
Public
A Python framework for accelerated simulation, data generation and spatial computing.
python gpu cuda nvidia gpu-acceleration differentiable-programming nvidia-warp
Python
•
Apache License 2.0
•328•5.3k•211•7•Updated Jul 4, 2025Jul 4, 2025
KAI-Scheduler
Public
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Go
•
Apache License 2.0
•72•681•20•18•Updated Jul 4, 2025Jul 4, 2025
NeMo-speech-data-processor
Public
A toolkit for processing speech data and creating speech datasets
Python
•
Apache License 2.0
•29•124•5•22•Updated Jul 4, 2025Jul 4, 2025