Gradio based tool to run opensource LLM models directly from Huggingface
python
streaming
opensource
cpu
chatbot
cuda
openai
gradio
huggingface
model-conversion
websockets-chat
llm
safetensors
langchain
llamacpp
llm-inference
ollama
llama-cpp-python
gguf
-
Updated
Jun 27, 2024 - Python