Skip to content
View pacoxu's full-sized avatar
🥊
no big project yet, still low review qualities 💪
🥊
no big project yet, still low review qualities 💪

Block or report pacoxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Multi-tenancy and policy-based framework for Kubernetes.

Go 1,765 172 Updated Mar 11, 2025

This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.

C++ 29 1 Updated Mar 5, 2025
Go 127 25 Updated Mar 11, 2025

An open-source runtime for composable workflows. Great for AI agents and CI/CD.

Go 12,832 673 Updated Mar 11, 2025

aibrix static site

1 Updated Mar 11, 2025
Go 4 Updated Feb 11, 2025
2 2 Updated Feb 27, 2025

Drivers plans and notes

4 3 Updated Mar 5, 2025
Go 10 2 Updated Mar 4, 2025

Reference implementations of MLPerf™ inference benchmarks

Python 1,328 545 Updated Mar 10, 2025
Python 12 4 Updated Nov 27, 2023

reflect api without runtime reflect.Value cost

Go 780 74 Updated Jul 10, 2024

GaussDB driver and toolkit for Go

Go 4 1 Updated Feb 8, 2025

Start the main process of a pod only if elected via kubernetes leader election. While this was developed for LINSTOR, it may proof useful for other use cases.

Go 69 3 Updated Feb 14, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 80,915 11,855 Updated Mar 11, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,129 617 Updated Mar 11, 2025

Hydrophone is a lightweight Kubernetes conformance tests runner

Go 77 33 Updated Mar 5, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,103 282 Updated Mar 11, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,728 199 Updated Mar 4, 2025

🤖💬 extra-small AI SDK for Browser, Node.js, Deno, Bun or Edge Runtime.

TypeScript 243 16 Updated Mar 11, 2025

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 825 67 Updated Mar 11, 2025

An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises

Go 10 Updated Dec 3, 2024
Go 3 7 Updated Aug 28, 2024

A containerd snapshotter with data deduplication and lazy loading in P2P fashion

Go 182 105 Updated Jan 26, 2025

📍 A map of my life, where each week I've been alive is a little box.

HTML 441 74 Updated Feb 19, 2025

GenAI inference performance benchmarking tool

Python 19 8 Updated Mar 10, 2025

InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Python 1,188 399 Updated Mar 10, 2025

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Go 89 15 Updated Mar 11, 2025
Next
Showing results