#

vllm

Here are 35 public repositories matching this topic...

OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

reinforcement-learning raylib transformers deepspeed large-language-models reinforcement-learning-from-human-feedback vllm

Updated Jun 9, 2024
Python

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated Jun 8, 2024
Python

runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.

language-model llm runpod vllm

Updated Jun 7, 2024
Python

OSS-Pole-Emploi / happy_vllm

A REST API for vLLM, production ready

production transformers api-rest serving mlops llm llm-serving vllm

Updated Jun 7, 2024
Python

wtlow003 / modal-llm-serving

Examples of serving LLM on Modal.

modal openai model-serving openai-api llm vllm sglang lmdeploy

Updated Jun 6, 2024
Python

iNeil77 / vllm-code-harness

Run code inference-only benchmarks quickly using vLLM

transformers code-generation nlp-machine-learning vllm

Updated Jun 5, 2024
Python

yueying-teng / generate-language-image-instruction-following-data

mistral multimodal-learning llm langchain llava vllm llama-cpp-python instruction-following-data

Updated Jun 5, 2024
Python

SuperSecureHuman / rag_hyde_chat

Chat with Lex! A RAG app, using HyDE with milvus DB for vector store, VLLM for LLM inference, and FastEmbed for Embeddings!

rag milvus llm vllm llamaindex llm-inference fastembed

Updated Jun 4, 2024
Python

esmailza / Llama2-vLLM-LangChain-knowledge-graph

Preserving entities through the integration of knowledge graphs, Llama 2, vLLM, and LangChain.

python distributed information-extraction knowledge-graph named-entity-recognition summarization langchain vllm llama2

Updated Jun 3, 2024
Python

svjack / Genshin-Impact-Character-Chat

Genshin Impact Character Chat Models tuned by Lora on LLM

game chatbot transformers roleplay webui gradio mistral roleplaying-game genshin-impact llm sharegpt llama-cpp vllm qwen llama-cpp-python qwen1-5

Updated Jun 3, 2024
Python

sasha0552 / vllm-ci

CI scripts designed to build a Pascal-compatible version of vLLM.

Updated Jun 3, 2024
Python

Climatik-Project / Climatik-Project

Carbon Limiting Auto Tuning for Kubernetes

kubernetes sustainability kepler kubernetes-operator power-capping green-computing keda kserve llm vllm llm-inference

Updated May 30, 2024
Python

julep-ai / standard-chatml

Standardized spec and vendor-specific transforms for ChatML

openai gemini-api anthropic vllm chatml litellm standard-chatml

Updated May 27, 2024
Python

OpenCSGs / llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

microsoft / vidur

A large-scale simulation framework for LLM inference

simulation inference transformer llm vllm

Updated May 15, 2024
Python

EvilPsyCHo / Open-LLM-Benchmark

Evaluate open-source language models on Agent, formatted output, command following, long text, multilingual, coding, and custom task capabilities. 开源语言模型在Agent，格式化输出，指令追随，长文本，多语言，代码，自定义任务的能力基准测试。

openai evaluation-framework huggingface large-language-models llamacpp vllm llm-agent llms-benchmarking

Updated May 10, 2024
Python

zRzRzRzRzRzRzR / lm-fly

大模型推理框架加速，让 LLM 飞起来

mlx tgi openvino llm vllm llm-inference tensorrt-llm

Updated May 10, 2024
Python

lucataco / cog-deepseek-67b-base

Cog wrapper for deepseek-ai/deepseek-67b-base

Updated May 8, 2024
Python

lucataco / cog-Hermes-2-Pro-Llama-3-8B

Cog wrapper for NousResearch/Hermes-2-Pro-Llama-3-8B

Updated May 6, 2024
Python

Danitilahun / LLM_Projects

This repository has a lot of LLM projects done. It is the best place to start learning LLM.

transformer gemini llama gpt fine-tuning gpt-3 large-language-models llm langchain instruction-tuning vllm retrieval-augmented-generation

Updated May 3, 2024
Python

Improve this page

Add a description, image, and links to the vllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vllm topic, visit your repo's landing page and select "manage topics."