NVIDIA Corporation
- 9.2k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- http://www.nvidia.com
Pinned
Repositories
- TensorRT-LLM Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
-
-
-
- monai-cloud-api Public
MONAI Cloud API developments for intelligent imaging and learning tools, fostering innovation in medical imaging and AI-driven services.
- TensorRT-Model-Optimizer Public
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.