EmbeddedLLM
Pinned Loading
Repositories
- vllm Public Forked from vllm-project/vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
- vllm-rocmfork Public Forked from ROCm/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
- LMCache Public Forked from LMCache/LMCache
ROCm support of Ultra-Fast and Cheaper Long-Context LLM Inference
- JamAIBase Public
The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate on AI applications.
- vllmtests Public
This is a repository containing the tools for testing vLLM correctness and perf regression
- aiter-api-watcher Public
This is a repository to monitor the fast changing ROCm/aiter repository to alert user that AITER function of interests e.g. in vLLM, in SGLang has been updated at certain commit.
Top languages
Loading…
Most used topics
Loading…