Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    NVIDIA cuOpt is an open-source GPU-accelerated optimization engine delivering near real-time solutions for complex decision-making challenges.

    Cuda 278 44

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 340 46

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16k 1.4k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 212

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.4k 372

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.3k 782

Repositories

Showing 10 of 583 repositories
  • TensorRT-LLM Public

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 11,001 Apache-2.0 1,582 678 333 Updated Jul 14, 2025
  • audio-flamingo Public

    PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.

    NVIDIA/audio-flamingo’s past year of commit activity
    499 30 3 7 Updated Jul 14, 2025
  • k8s-operator-libs Public

    A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.

    NVIDIA/k8s-operator-libs’s past year of commit activity
    Go 24 Apache-2.0 19 1 2 Updated Jul 14, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 745 260 377 (18 issues need help) 65 Updated Jul 14, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 12,873 2,927 335 211 Updated Jul 14, 2025
  • NeMo Public

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    NVIDIA/NeMo’s past year of commit activity
    Python 15,075 Apache-2.0 2,990 48 111 Updated Jul 14, 2025
  • recsys-examples Public

    Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

    NVIDIA/recsys-examples’s past year of commit activity
    Python 75 18 20 (1 issue needs help) 10 Updated Jul 14, 2025
  • spark-rapids Public

    Spark RAPIDS plugin - accelerate Apache Spark with GPUs

    NVIDIA/spark-rapids’s past year of commit activity
    Scala 909 Apache-2.0 255 1,656 (45 issues need help) 28 Updated Jul 14, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 343 61 255 (11 issues need help) 172 Updated Jul 14, 2025
  • NeMo-Skills Public

    A project to improve skills of large language models

    NVIDIA/NeMo-Skills’s past year of commit activity
    Python 458 Apache-2.0 84 28 3 Updated Jul 13, 2025