
Lists (10)
Sort Name ascending (A-Z)
Starred repositories
Ollama负载均衡服务器 | 一款高性能、易配置的开源负载均衡服务器,优化Ollama负载。它能够帮助您提高应用程序的可用性和响应速度,同时确保系统资源的有效利用。
Analyzes resource usage and performance characteristics of running containers.
The ultimate LLM/AI application development framework in Golang.
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including OpenAI Agents SDK, CrewAI, Langchain, Autogen, AG2, and CamelAI
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
Examples of AI Multi-Agent Solutions
Cost-efficient and pluggable Infrastructure components for GenAI inference
Manage kubernetes in the most light and convenient way ☸️
Kubernetes-like control planes for form-factors and use-cases beyond Kubernetes and container workloads.
Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec
A stress testing tool for the scheduler in a large-scale scenario.
SGLang is a fast serving framework for large language models and vision language models.
Load watcher is a cluster-wide aggregator of metrics, developed for Trimaran: Real Load Aware Scheduler in Kubernetes.
A popular & widely deployed Open Source Container Native Storage platform for Stateful Persistent Applications on Kubernetes.
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
An extended suite for running Hadoop YARN jobs in K8s with Koordinator.
Prometheus-based Kubernetes Resource Recommendations
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
Materials of My Kubernetes Deep Dive Sessions
Awesome-LLM: a curated list of Large Language Model
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Dragonfly is an open source P2P-based file distribution and image acceleration system. It is hosted by the Cloud Native Computing Foundation (CNCF) as an Incubating Level Project.
Prometheus ephemeral storage metrics exporter