-
-
-
llmaz Public
Forked from InftyAI/llmazA Kubernetes operator helps to manage Llmaz gracefully.
Go Apache License 2.0 UpdatedMar 11, 2025 -
github-workflow-as-kube Public
Following the same workflows as Kubernetes.
-
template-repo Public
Forked from InftyAI/template-repoA template repo.
Apache License 2.0 UpdatedMar 11, 2025 -
Awesome-LLMOps Public
Forked from InftyAI/Awesome-LLMOps🎉 An awesome & curated list of best LLMOps tools.
UpdatedMar 10, 2025 -
-
lws Public
Forked from kubernetes-sigs/lwsLeaderWorkerSet: An API for deploying a group of pods as a unit of replication
Go Apache License 2.0 UpdatedMar 6, 2025 -
aibrix Public
Forked from vllm-project/aibrixCost-efficient and pluggable Infrastructure components for GenAI inference
Jupyter Notebook Apache License 2.0 UpdatedMar 3, 2025 -
kubernetes Public
Forked from kubernetes/kubernetesProduction-Grade Container Scheduling and Management
-
website Public
Forked from kubernetes/websiteKubernetes website and documentation repo:
-
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedFeb 26, 2025 -
PUMA Public
Forked from InftyAI/PUMAA lightweight, high-performance inference engine for heterogeneous devices.
Rust Apache License 2.0 UpdatedFeb 25, 2025 -
org Public
Forked from kubernetes/orgMeta configuration for Kubernetes Github Org
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 21, 2025 -
test-infra Public
Forked from kubernetes/test-infraTest infrastructure for the Kubernetes project.
-
-
enhancements Public
Forked from kubernetes/enhancementsEnhancements tracking repo for Kubernetes
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedFeb 13, 2025 -
inftyai-scheduler-plugins Public
Forked from InftyAI/scheduler-pluginsA Kubernetes scheduler designed for smart scheduling with llmaz.
Go UpdatedFeb 13, 2025 -
llmchat Public
Forked from InftyAI/llmchatA building block for users to build their own LLM from A to Z.
Python Apache License 2.0 UpdatedFeb 8, 2025 -
-
kueue Public
Forked from kubernetes-sigs/kueueKueue: Kubernetes-native Job Queueing
-
-
-
-
Manta Public
Forked from InftyAI/MantaA simplified P2P-based cache system for model distributions.
Go Apache License 2.0 UpdatedDec 16, 2024 -
volcano Public
Forked from volcano-sh/volcanoA Cloud Native Batch System (Project under CNCF)
Go Apache License 2.0 UpdatedDec 12, 2024 -
inftyai-community Public
Forked from InftyAI/communityThe InftyAI community.
Apache License 2.0 UpdatedOct 23, 2024 -
community Public
Forked from kubernetes/communityKubernetes community content