#

llm-inference

Here are 5 public repositories matching this topic...

substratusai / runbooks

Finetune LLMs on K8s by using Runbooks

kubernetes kubernetes-operator mlops ml-platform llmops llm-serving llm-training llm-inference

Updated Nov 21, 2023
Go

adalkiran / llama-nuts-and-bolts

A holistic way of understanding how LLaMA and its components run in practice, with code and detailed documentation.

go golang unicode machine-learning deep-learning utf-8 ml transformers llama educational-project large-language-models llm llms llm-inference llama2 llama2-7b llms-book

Updated Apr 29, 2024
Go

Hoshinonyaruko / Gensokyo-llm

开源的智能体项目支持6种聊天平台 Onebotv11一对多连接流式信息 agent 对话keyboard气泡生成支持6种大模型接口(持续增加中) 具有将多种大模型接口转化为带有上下文的通用格式的能力.

chatbot qqbot ai-agents onebot onebot-plugin llm onebot11 llm-inference ai-agents-framework llm-api

Updated Jun 20, 2024
Go

Climatik-Project / Climatik-Project

Carbon Limiting Auto Tuning for Kubernetes

kubernetes sustainability kepler kubernetes-operator power-capping green-computing keda kserve llm vllm llm-inference

Updated Jun 28, 2024
Go

beam-cloud / beta9

The open-source serverless GPU container runtime.

gpu distributed-computing cuda self-hosted fine-tuning ml-platform large-language-models llm generative-ai llm-inference

Updated Jun 28, 2024
Go

Improve this page

Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."