tensorrt-llm

Star

Here are 6 public repositories matching this topic...

collabora / WhisperLive

Star

A nearly-live implementation of OpenAI's Whisper.

text-to-speech translation voice-recognition openai obs dictation whisper tensorrt tensorrt-llm whisper-tensorrt

Updated Jun 7, 2024
Python

huggingface / optimum-benchmark

Star

A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

benchmark pytorch openvino onnxruntime text-generation-inference neural-compressor tensorrt-llm

Updated May 30, 2024
Python

rpehkone / Chat-With-RTX-python-api

Star

Chat With RTX Python API

tensorrt llm llm-inference tensorrt-llm mistral-7b llama2-13b chat-with-rtx nvidia-chat-with-rtx

Updated May 19, 2024
Python

zRzRzRzRzRzRzR / lm-fly

Sponsor

Star

大模型推理框架加速，让 LLM 飞起来

mlx tgi openvino llm vllm llm-inference tensorrt-llm

Updated May 10, 2024
Python

fgblanch / OutlookLLM

Star

Add-in for new Outlook that adds LLM new features (Composition, Summarizing, Q&A). It uses a local LLM via Nvidia TensorRT-LLM

outlook-addin tensorrt-llm

Updated Feb 24, 2024
Python

lix19937 / llm-deploy

Star

AI Infra LLM infer/ tensorrt-llm/ vllm

llm llm-inference tensorrt-llm

Updated Mar 29, 2024
Python

Improve this page

Add a description, image, and links to the tensorrt-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tensorrt-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tensorrt-llm

Here are 6 public repositories matching this topic...

collabora / WhisperLive

huggingface / optimum-benchmark

rpehkone / Chat-With-RTX-python-api

zRzRzRzRzRzRzR / lm-fly

fgblanch / OutlookLLM

lix19937 / llm-deploy

Improve this page

Add this topic to your repo