Pinned Loading
Repositories
- TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
binarybrainiacs/TensorRT-LLM’s past year of commit activity - NeMo Public Forked from NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
binarybrainiacs/NeMo’s past year of commit activity - pytorch Public Forked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
binarybrainiacs/pytorch’s past year of commit activity - XNNPACK Public Forked from google/XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
binarybrainiacs/XNNPACK’s past year of commit activity - pthread-win32 Public Forked from GerHobbelt/pthread-win32
clone of pthread-win32 (a.k.a. pthreads4w) + local tweaks (including MSVC2008 - MSVC2022 project files)
binarybrainiacs/pthread-win32’s past year of commit activity