A nearly-live implementation of OpenAI's Whisper.
-
Updated
Jun 7, 2024 - Python
A nearly-live implementation of OpenAI's Whisper.
A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Chat With RTX Python API
大模型推理框架加速,让 LLM 飞起来
Add-in for new Outlook that adds LLM new features (Composition, Summarizing, Q&A). It uses a local LLM via Nvidia TensorRT-LLM
Add a description, image, and links to the tensorrt-llm topic page so that developers can more easily learn about it.
To associate your repository with the tensorrt-llm topic, visit your repo's landing page and select "manage topics."