-
Microsoft
- San Francisco
- https://ritazh.com
- @ritazzhang
Stars
Next Generation Agentic Proxy for AI Agents and MCP servers
Home of the out-of-tree KAITO plugin for Headlamp Kubernetes UI
The Security Toolkit for LLM Interactions
Set of tools to assess and improve LLM security.
A comprehensive social media management tool designed to help you create, format, and post content across multiple platforms including LinkedIn, Twitter/X, Bluesky, and Mastodon. Features advanced …
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
OPA Gatekeeper provider for GitHub Artifact Attestations
This repositories contains examples and best practices for AI workloads on Azure
The main repo for NLWeb, implemented in Python.
A TTS model capable of generating ultra-realistic dialogue in one pass.
Model Context Protocol (MCP) server for Kubernetes and OpenShift
Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling
Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord
mcp-use is the easiest way to interact with mcp servers with custom agents
⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.
GenAI inference performance benchmarking tool
Constrain, log and scan your MCP connections for security vulnerabilities.
A comprehensive security checklist for MCP-based AI tools. Built by SlowMist to safeguard LLM plugin ecosystems.
An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.
📦️ A fast, secure MCP server that extends its capabilities through WebAssembly plugins.
Chat with your Kubernetes Cluster using AI tools and IDEs like Claude and Cursor!
hyperlight-wasm is a rust library crate that enables Wasm Modules and components to be run inside lightweight Virtual Machine backed Sandbox. It is built on top of Hyperlight.
A Datacenter Scale Distributed Inference Serving Framework
SGLang is a fast serving framework for large language models and vision language models.
Kubernetes RBAC authorizing HTTP proxy for a single upstream.
Cost-efficient and pluggable Infrastructure components for GenAI inference